VE3 | Full time

Observability Engineer

London Arena, United Kingdom | Posted on 24/12/2024

Job Information

  • Date Opened 24/12/2024
  • Industry IT Services
  • Job Type Full time
  • Work Experience 5+ years
  • City London Arena
  • Province City of London
  • Country United Kingdom
  • Postal Code E14

About Us

VE3 is a technology and business consultancy focused on delivering end-to-end technology solutions and products. We have successfully serviced enterprises across multiple markets, including the public and private sectors. Our services span all aspects of business, providing a holistic approach to managing an organization. We are committed to providing technical innovations and tools that empower organizations with critical information to facilitate decision-making that results in business transformation through cost savings and increased operational efficiency. Our commitment to quality is adopted throughout the organization and sets the foundation for delivering our full suite of capabilities.


Job Description



Job Description: Observability Engineer


About Company-


VE3 is a global technology company committed to delivering innovative digital solutions that help organizations unlock their full potential. We assist organizations in optimizing operations, enhancing customer experiences, unlocking data-driven insights, and fostering agile strategies. Our focus on delivering cutting-edge software solutions and developing state-of-the-art mechanisms ensures that we meet the unique needs of our clients across various industries. We are a forward-thinking company, committed to helping organisations navigate and thrive in the digital era through innovative and sustainable technology solutions.



Position Overview:

We are seeking a skilled Observability Engineer to design, implement, and optimize observability solutions. This role involves working across various methodologies and integrating a diverse range of tools to enhance system monitoring, log aggregation, and performance optimization. The ideal candidate will have expertise in observability frameworks and tools, a strong understanding of data security principles, and the ability to train and empower teams.

Requirements



1)Observability Implementation & Development:

  • Design, develop, and implement observability solutions across diverse methodologies (Agile, SAFe, Waterfall, Kanban, DevOps).
  • Integrate tools such as DataDog, New Relic, ELK Stack, Kibana, and Grafana to create comprehensive monitoring and alerting capabilities.
  • Provide guidance on best practices for event management, log aggregation, and system performance monitoring.


2)Framework Development & Optimization:

  • Establish observability frameworks based on the Principle of Least Privilege, ensuring data security and compliance.
  • Optimize the cost-efficiency of observability tooling through strategic planning and implementation.


3)Operational Support & Team Enablement:

  • Roll out observability frameworks and capabilities to large-scale organizations, ensuring seamless integration into existing systems.
  • Upskill incumbent teams on new observability toolchains to enable effective dashboarding, incident resolution, and insight generation.
  • Lead training sessions and workshops to build team proficiency in monitoring and analytics.


4)Required Skills and Qualifications:

Experience Level:


  • 5+ years of experience in IT operations, monitoring, and observability.
  • Minimum 3 years of hands-on experience with tools such as DataDog, New Relic, ELK Stack, Grafana, or Kibana.


5)Technical Expertise:

  • Proficiency in observability platforms and log management tools.
  • Strong scripting and automation skills using Python, Bash, or PowerShell.
  • Hands-on experience with CI/CD pipelines and DevOps tools such as Jenkins, GitLab, and Azure DevOps.
  • Familiarity with cloud platforms, particularly AWS, Azure, and GCP.


6)Tools, Technologies & Frameworks:

  • Observability: DataDog, New Relic, ELK Stack (Elasticsearch, Logstash, Kibana), Grafana, Prometheus.
  • Logging and Metrics: Fluentd, Logstash, CloudWatch, Azure Monitor.
  • Incident Management: ServiceNow, PagerDuty.
  • Frameworks: Agile, SAFe, DevOps, ITIL.


7)Soft Skills:
​
  • Strong analytical and problem-solving skills with attention to detail.
  • Excellent communication skills for collaboration with technical and non-technical stakeholders.
  • Ability to mentor and train teams on observability tools and practices



8)Monitoring & Incident Management:

  • Safeguard customer data within the observability ecosystem while adhering to security best practices.
  • Collaborate with cross-functional teams to troubleshoot and resolve incidents identified through observability tools.


9)Tool Evaluation & Prototyping:

  • Conduct Proof of Concepts (POC) for new observability tools and techniques.
  • Continuously evaluate and recommend enhancements to the observability ecosystem.


10)Education and Certification:

  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • Certifications in tools like DataDog or cloud platforms (AWS, Azure, GCP) are highly desirable.



11)Preferred Qualifications:

  • Experience rolling out observability solutions in large-scale or complex environments.
  • Knowledge of hybrid cloud observability frameworks.
  • Expertise in cost optimization for observability tools and services.

Benefits


Competitive salary and benefits package.
Opportunities for professional development and certification.
Flexible working arrangements and a collaborative team environment.