Site Reliability Engineer
Paymentology is the first truly global issuer-processor, giving banks and fintechs the technology, team and experience to rapidly issue and process Mastercard, Visa and UnionPay cards across more than 60 countries, at scale.
Our advanced, multi-cloud platform, offering both shared and dedicated processing instances, vast global presence and richer, real-time data, set us apart as the leader in payments.
We're on the hunt for an exceptional Site Reliability Engineer (SRE) to join our dedicated team. As an SRE at Paymentology, you'll be the superhero responsible for maintaining, improving, and ensuring the high availability, scalability, and performance of our platform.
What you get to do::Platform Reliability and Scalability:
- Build software that enhances Paymentology services' scalability and reliability.
- Ensure platform services meet required uptime and service quality levels.
- Contribute to the design of reliable cloud infrastructure and implement reusable cloud-uptime components as code.
- Regularly review and optimise SRE practices, tools, and methodologies to enhance overall system reliability and team efficiency.
Observability and Automation:
- Contribute to the design, implementation, and maintenance of observability and monitoring solutions to track the platform health, its cost-effectiveness, the reliability, and scalability, and identify potential issues which can be fed back to product and platform engineering in a continuous improvement loop.
- Develop and implement automation scripts and tools to streamline operations and reduce manual interventions.
- Enable product teams to self-serve by participating in the development of a developer platform.
Production Issue Resolution:
- Play an active role with the incident response teams, diagnosing and resolving production issues quickly to minimise downtime.
Standards Compliance:
- Support product teams in building services that adhere to our security and quality standards.
Cross-team Collaboration:
- Work closely with engineering, operations, and product teams to ensure reliability is considered throughout the end-to-end software development lifecycle. We seek to achieve this through advocacy and developing a culture of reliability.
At Paymentology we value making a difference to the lives of the people who work for us and who live in the communities where we operate. You can look forward to working with a diverse, global team where Paymentologists at all levels play an important part in our global mission to advance the world through payments and make a difference on a global scale.
Travel:< 5%
REQUIREMENTS
What it takes to succeed:
- Bachelor’s Degree in Computer Science, Information Technology, or related field.
- A minimum of 3 years in a dedicated SRE role, as well as 5+ years of prior software development experience.
- Comprehensive understanding of large-scale distributed platform architecture.
- Extensive hands-on cloud experience, particularly with AWS.
- Proven experience developing scalable, modular infrastructure-as-code projects using tools such as Terraform, CloudFormation, Puppet, and Ansible.
- Practical experience with Docker and container orchestrators, including AWS ECS & EKS, and Kubernetes.
- Experience in administering or integrating identity management systems for SSO, including AWS IAM, Okta, and Active Directory.
- Experience with disaster recovery and redundancy strategies in both cloud and on-premises environments.
- Proficiency with leading monitoring tools, such as Datadog, Honeycomb.io, Splunk , Prometheus, Grafana, ELK Stack, and New Relic.
- Programming expertise, especially in systems programming languages (e.g., Java, Kotlin, Scala) and databases (e.g., SQL Server, PostgreSQL).
- Familiarity with industry-leading CI/CD tools such as Jenkins, GitHub Actions, Gitlab CI, CodePipelines, CircleCI, and ArgoCD.
- Track record of achieving platform-level and end-to-end SLIs, SLOs, and SLAs, and fostering accountability.
- Ability to navigate complex situations and lead effective post-incident reviews (PIRs).
- Knowledge of implementing solutions to reduce Mean Time to Identify (MTTI) and Mean Time to Resolve (MTTR).
- Expertise in implementing best practices for load balancing, fault tolerance, and resource allocation to maintain service quality and efficiency at scale.
- Understanding of security best practices within cloud environments.
You'll also need to bring a collaborative mindset, working seamlessly across teams to drive innovative solutions. And of course, your exceptional communication skills in English will allow you to clearly convey your ideas and recommendations.
As a key member of our technical team, you will be expected to maintain high availability and be ready to address critical incidents, ensuring the continuous performance of our systems. This includes being part of an on-call schedule to support 24/7 operations.
Why Paymentology?
- Full-time remote position with flexible hours.
- An inclusive and supportive work environment that values diversity.
- A chance to work on cutting-edge technology projects that make a difference.
- Opportunities for continuous learning and development.
Ready to Join Us?If you're a gadget guru who thrives on optimizing infrastructure, automating all the things, and delivering sky-high availability and performance, we want to hear from you! Apply now and be part of a company that values your skills and fosters your growth.
Recommended Jobs
Teacher of Mathematics - Enfield Independent School
School Status & Location Sector: Leading Independent School (with Sixth Form). Borough: Enfield (Outer London, England). Start Date: Permanent, full-time role commencing January 2026. T…
Events Senior Manager (Association)
Ready to step into a high-impact senior leadership role at the heart of the UK's private capital industry? We are seeking a Senior Events Manager to lead a talented team of four and deliver an excepti…
Head of Technical Design - Luxury F&B, Hospitality (Client-side)
Job description The client is seeking a technically focused Interior Designer / Architect to join their London-based team. The business owns and operates a portfolio of high-end private members’ c…
Payroll Administrator
Job Title: Payroll Administrator (Payroll and Compliance Coordinator) Location: Alperton, HA01HD Salary: £34,000.00 to £37,000.00 Per year Job Description We’re looking for a detail-oriented and o…
Managing Quantity Surveyor
2026 is continuing to provide sustainable growth within OCU Group. Due to this continued growth, an opportunity has arisen for a Managing Quantity Surveyor to join our busy Commercial Team based in Ru…
Commissioning Engineer (Electrical)
As a Senior Commissioning Engineer, you will be aligned to the Engineering Function throughout the project lifecycle. You will be responsible for developing and delivering the Commissioning work scop…
Python Quant Developer - RiskTech
Python Quant Developer – RiskTech £150,000 Quant Capital is urgently looking for a Python Quant Developer to join our high profile client. Our client is well funded Techstar Risk Te…
Finance Manager
Are you a standalone Finance Manager looking to progress your career with a growing small business? We are on the lookout for a passionate and experienced Finance Manager to join one of our favourite …
Surgery Centre Administrator/Receptionist
About the department Our Surgery Centre is a specialist private facility in central London, delivering high-quality surgical care across a range of specialties. The centre is supported by leading …
Housekeeping Supervisor
Housekeeping Supervisor Princess Louise of Kensington Nursing Home, Pangbourne, Westminster, London, W10 6DH £13.97 per hour 40 hours available Why work for us? We spend so mu…