Site Reliability Engineering (SRE) / Observability Technical Lead
JOB DESCRIPTION
The team you'll be working with:
We are seeking an experienced Site Reliability Engineer (SRE) / Observability Technical Lead to join our team and drive the strategy and execution of observability and reliability projects across our clients. The ideal candidate will have deep expertise in Application Performance Monitoring (APM), Infrastructure as Code (IaC), automation, and distributed tracing using OpenTelemetry. As a lead, you will guide the design, implementation, and continuous improvement of observability solutions, ensuring system reliability, performance, and scalability while fostering best practices in SRE and DevOps.
What you'll be doing:
- Lead the strategic development and management of observability and reliability frameworks across the organization, ensuring alignment with business goals and technical requirements.
- Design and implementation of monitoring and observability solutions, collaborating with engineering teams to define standards and best practices.
- Manage Infrastructure as Code (IaC) initiatives using Terraform, coordinating with cloud and infrastructure teams to ensure scalable and secure deployments.
- Drive automation strategies for monitoring, alerting, and logging pipelines, focusing on process improvements and operational efficiency.
- Develop and maintain comprehensive observability roadmaps, including distributed tracing, logging, and metrics collection strategies.
- Collaborate with product management, sales, and pre-sales teams to provide technical expertise and support during solution design and customer engagements.
- Lead cross-functional teams to enhance CI/CD pipelines and deployment reliability, ensuring smooth integration of observability tools and practices.
- Engage with vendors and strategic partners to evaluate, select, and integrate observability and monitoring solutions, ensuring alignment with organizational needs and fostering strong collaborative relationships.
- Mentor and develop junior engineers and analysts, fostering a culture of reliability, observability, and operational excellence.
What experience you'll bring:
- 5+ years of experience in SRE, Observability, or DevOps roles, with leadership responsibilities.
- Proven expertise with Application Performance Monitoring (APM) tools such as New Relic, Datadog, AppDynamics, or Dynatrace.
- Hands-on experience with OpenTelemetry (OTel) for distributed tracing and observability instrumentation.
- Strong proficiency in Infrastructure as Code (IaC) using Terraform.
- Solid understanding of cloud platforms including AWS, GCP, or Azure.
- Experience with automation/configuration management tools like Ansible, Chef, or Puppet.
- Deep knowledge of CI/CD pipelines and tools such as GitHub Actions, Jenkins, or Azure DevOps.
- Experience managing Kubernetes and containerized environments (Docker, Helm).
- Familiarity with log aggregation and analysis platforms like ELK Stack or Splunk.
- Excellent leadership, communication, and collaboration skills.
Who we are:
We’re a business with a global reach that empowers local teams, and we undertake hugely exciting work that is genuinely changing the world. Our advanced portfolio of consulting, applications, business process, cloud, and infrastructure services will allow you to achieve great things by working with brilliant colleagues, and clients, on exciting projects.
Our inclusive work environment prioritises mutual respect, accountability, and continuous learning for all our people. This approach fosters collaboration, well-being, growth, and agility, leading to a more diverse, innovative, and competitive organisation. We are also proud to share that we have a range of Inclusion Networks such as: the Women’s Business Network, Cultural and Ethnicity Network, LGBTQ+ & Allies Network, Neurodiversity Network and the Parent Network.
For more information on Diversity, Equity and Inclusion please click here: Creating Inclusion Together at NTT DATA UK | NTT DATA
what we'll offer you:
We offer a range of tailored benefits that support your physical, emotional, and financial wellbeing. Our Learning and Development team ensure that there are continuous growth and development opportunities for our people. We also offer the opportunity to have flexible work options.
You can find more information about NTT DATA UK & Ireland here:
We are an equal opportunities employer. We believe in the fair treatment of all our employees and commit to promoting equity and diversity in our employment practices. We are also a proud Disability Confident Committed Employer - we are committed to creating a diverse and inclusive workforce. We actively collaborate with individuals who have disabilities and long-term health conditions which have an effect on their ability to do normal daily activities, ensuring that barriers are eliminated when it comes to employment opportunities. In line with our commitment, we guarantee an interview to applicants who declare to us, during the application process, that they have a disability and meet the minimum requirements for the role. If you require any reasonable adjustments during the recruitment process, please let us know. Join us in building a truly diverse and empowered team.
Recommended Jobs
Spanish Teacher - Outstanding School - Islington
A highly successful, Outstanding secondary school in Islington requires a permanent, full-time Spanish Teacher for a January 2026 start. This is a crucial opportunity within a thriving MFL department…
Exchequer Controls Manager G10
Job Category : Management Location : London Borough of Havering Hours Per Week :36.00 Start Date : Immediate Start Start Time : 09:00 End Time : 17:30 Salary: £35.00 The…
Year 6 Teacher & Year Group Lead - Primary School, Islington
Year 6 Teacher & Year Group Lead – Primary School, Islington Location: Islington, London Contract: Full-Time, Permanent Start Date: January 2026 Salary: Competitive (Dependent on exper…
Principal Enterprise Security Architect
About the Role Our world-leading Time & Frequency department seek a Principal Enterprise Security Architect to lead cyber security architecture, assurance, and operational services for the Nat…
Infrastructure Engineer
Were looking for an IT Infrastructure Engineer to join Snap Inc! Working closely with the IT team you will install network technologies to help build our global offices. The ideal candidate is …
Welfare Manager Summer 2026 (Residential / live in role)
Join our winning team of global superstars in summer 2026 Embassy Summer, under its parent company EC English, offers award winning summer vacation programmes for young people across multiple locat…
LKS2 Teaching Opportunity | Primary School in West London
A vibrant and ambitious three-form entry primary school in Hounslow is seeking an enthusiastic and dedicated Year 3 Teacher to join their team as soon as possible. This is a full-time, permanent…
English Teacher - Southampton - Immediate start
English Teacher | Dynamic Secondary School | Southampton Are you an inspiring English specialist with a passion for literature and a commitment to academic excellence? We are seeking a talented …
New Business Team Manager
Working hours: 37.5 hours per week, Monday to Friday Duration: Permanent Location: Selsdon Job Ref: 28/01_NBT About the role Access Insurance , who are proudly part of Benefact Gr…
Full-time Nanny in WC1X, Job ID J1F8B5
This lovely family is looking for a Full-time Nanny in King’s Cross, London, to take care of their adorable baby. All general Nanny Duties are required in this role. The family is looking for someone…