Site Reliability Engineer (IT)
We are seeking an experienced and proactive Site Reliability Engineer (SRE) to join a team supporting multiple data product and platform groups. This role is focused on improving the reliability, scalability, observability, and operational performance of critical data-driven platforms and services across complex production environments. The successful candidate will work closely with engineering, platform, and support teams to strengthen monitoring and alerting capabilities, improve logging and traceability, troubleshoot production incidents, support deployments, and automate operational processes wherever possible. The environment includes Kubernetes, Helm, the ELK stack, and a strong focus on modern Site Reliability Engineering practices across cloud and platform services. This is a hands-on technical role suited to someone who thrives in fast-paced operational environments and is passionate about reliability engineering, automation, and continuous improvement. The role requires strong collaboration with both client stakeholders and engineering teams to ensure platform stability, operational excellence, and high service availability Candidate profile:
- Support, maintain, and improve highly available production platforms and services across cloud and containerised environments.
- Manage and support Kubernetes clusters and Helm-based deployments across multiple environments.
- Implement and enhance monitoring, alerting, logging, and observability solutions to improve platform reliability and operational visibility.
- Investigate incidents, analyse logs, identify root causes, and drive timely resolution of production issues.
- Participate in incident response, post-incident reviews, and continuous operational improvement initiatives.
- Automate operational tasks and repetitive support activities to reduce manual effort and improve platform efficiency.
- Work closely with engineering and data platform teams to improve system resilience, scalability, deployment reliability, and operational maturity.
- Develop and maintain operational documentation, support procedures, runbooks, and troubleshooting guides.
- Contribute to reliability engineering practices including proactive monitoring, service health management, and operational readiness.
- Support deployment activities, release processes, and production change management activities.
- Strong commercial experience in Site Reliability Engineering, Platform Engineering, DevOps, or Production Support environments.
- Strong hands-on experience with Kubernetes and Helm in enterprise or production environments.
- Proven experience supporting mission-critical production platforms and operational support functions.
- Strong hands-on experience with the ELK stack (Elasticsearch, Logstash, Kibana) for logging, monitoring, troubleshooting, and operational analysis.
- Demonstrated capability in log analysis, incident investigation, troubleshooting, and root cause analysis.
- Strong understanding and practical experience with core SRE practices including:
- Experience working with data platforms, analytics platforms, or data product teams would be highly advantageous.
- Experience with scripting and automation tools such as Bash, Python, or similar technologies is desirable.
- Exposure to CI/CD pipelines, Infrastructure as Code, and cloud-native environments would be beneficial.
- Strong communication, stakeholder engagement, and collaboration skills.
- Ability to work effectively in fast-paced support environments and manage competing priorities under pressure.
- Resource must be willing and able to work onsite at the client location five days per week.
- Candidate must already hold current HLC clearance (mandatory requirement).
- Previous experience working within secure, government, defence, or highly regulated environments will be highly regarded.
- Due to client security requirements, only candidates meeting the required clearance criteria will be considered.
#LI-CGISDI
Recommended Jobs
Geography Specialist | North London | Focus on Academic...
An "Outstanding" secondary school in Enfield is seeking a scholarly Geography Teacher for a full-time, permanent role. This position is ideal for a practitioner who prizes academic rigour and seeks t…
Part-time Nanny-Housekeeper in Kew Gardens, London, Job ID J209C7
This lovely family based in Kew Gardens, London, is looking for a Part-time Nanny-Housekeeper to care for their baby and toddler while maintaining their property. All general Nanny-Housekeeping dutie…
Year 4 Teacher — Good School — Merton — January 2026 start
An ambitious Good primary in Merton is recruiting a reflective Year 4 Teacher to join the KS2 team on a Full-Time basis from January 2026 . The Year 4 Teacher will begin pre-term planning …
DT Technician - Mixed Secondary School in Sutton
DT Technician – Mixed Secondary School in Sutton Location: Sutton Start Date: January 2026 Contract Type: Full-time, Permanent Salary: Competitive, dependent on experience A mixed …
Part-time Nanny in London, Job ID J1EC0DR
This Islington-based family is seeking a Part-time Nanny to care for their toddlers. All general Nanny duties are required in this role. The ideal candidate will be someone very interactive and engag…
Senior Platform Engineer
London’s digital backbone is being rebuilt. Not just upgraded, but completely reimagined. And behind every great revolution in tech is a rock solid platform. We are looking for a Senior Platform En…
Animal Management Lecturer
Animal Management Lecturer Job Type Teaching Job Category Various Employment Basis Established (salaried) Location John Ruskin College - Croydon, East Surrey College - Redhill Salary …
English Teacher - Sixth Form Excellence - Havering
English Teacher – Drive Excellence in A-Level Literature and Academic Scholarship – Havering A high-performing , academic secondary school in Havering with an outstanding Sixth Form is seeki…
Registered Mental Health Nurse (RMN)
Before responding to this job advertisement, please ensure that you have the correct ‘right to work’ in the UK. As an agency we cannot sponsor visa’s so any CVs sent without the correct ‘right to …
Senior 2D (AI) Artist
About us Arena is a fast-moving digital entertainment company creating platforms people genuinely love. Since 2021, we have launched bold brands including MetaWin, WOW Vegas, BetZoo Media, Hit.com…