Site Reliability Engineer at High Growth B2C Startup

Gizmo
London

Gizmo is an AI startup on a mission to make learning so easy that anyone can learn anything. We're building Duolingo for anything - a platform that uses gamification and social mechanics to make learning fun.

With over 1 million monthly active users and $4M in annual recurring revenue, we’re already one of the fastest-growing startups in the UK. Backed by leading investors, we recently raised $16M in Series A funding to accelerate our vision of helping 1 billion people learn.

Role Overview
Reporting to the founders, you will own capacity, performance and reliability for Gizmo’s full-stack platform as daily traffic climbs from hundreds of thousands to millions of users. You’ll write code across the stack, but your charter is classic SRE: defend SLOs , eliminate toil , and raise the ceiling on scale before it becomes a hard limit.

Key Responsibilities

  • Define SLIs/SLOs for latency, availability and error rate; codify error budgets and partner with product teams on trade-offs.
  • Perform load-testing, capacity modelling and up-front scalability design for PostgreSQL, OpenSearch, Redis, Hasura and CF Workers; produce data-driven scaling plans.
  • Extend metrics, structured logging and tracing; establish alert rules that page only on user-visible impact; build actionable runbooks .
  • Join the on-call rotation, lead blameless post-mortems, drive remediation work to closure and track MTTR/MTBF improvements.
  • Automate repetitive ops on Kubernetes and CI/CD; keep “toil” <50 % of your time by pushing fixes into code.
  • Coach full-stack engineers on query optimisation, schema design and back-pressure techniques; document patterns and anti-patterns by creating an SRE playbook

Requirements

  • Hands-on scale experience : you have run relational stores at 100 k+ TPS or 1 M+ concurrent users (e.g., multi-tenant PostgreSQL, sharded MySQL).
  • Strong backend fundamentals around concurrency, caching, indexing and distributed systems trade-offs.
  • Proven track record of setting SLOs, building dashboards (Prometheus/Grafana, OpenTelemetry, etc.) and tuning alerts.
  • Comfort with Kubernetes , IaC and cloud-native patterns; can debug from network to application layer.
  • Start-up bias for action: you prioritise high-leverage fixes, ship iteratively and own outcomes end-to-end.
  • Collaborative and feedback-driven; you welcome post-mortem culture and continuous improvement.
  • Driven by impact - you prioritise work that moves the needle!

Nice-to-haves: experience with Hasura internals, Cloudflare Workers edge optimisation, or running OpenSearch clusters at scale.

Benefits

  • Highly competitive salary.
  • You'll own a piece of what you're building - equity included.
  • Hybrid working model with 4 days in our East London office, ideally located between Shoreditch High Street, Old Street, and Liverpool Street stations.
  • The opportunity to become one of the earliest employees in one of the UK’s fastest-growing startups.
  • Private health insurance
Posted 2025-07-31

Recommended Jobs

Mandarin Speaking petrochemical sales

ELP Consult
London

Job Description Key Responsibilities: Responsible for the planning, distribution and execution of commodity products to companies and institutions that have exposure to commodities. - To prop…

View Details
Posted 2025-07-03

Senior WTG Engineer

City of London, Greater London

Our client Scottish Power Renewables are seeking a Senior WTG Engineer for an urgent role initially up until 31/12/2026 Main Purpose of the Job The Senior wind turbine Engineer is a key role wit…

View Details
Posted 2025-07-31

Head of Product Development, Colour & Complexion

Charlotte Tilbury
London

About Charlotte Tilbury Beauty Founded by British makeup artist and beauty entrepreneur Charlotte Tilbury MBE in 2013, Charlotte Tilbury Beauty has revolutionised the face of the global beauty ind…

View Details
Posted 2025-07-25

Payroll and Benefits Manager

Broadwick
London

Role: Payroll and Benefits Manager Reports into: Head of Group Reporting / Group People Director Location: London Contract Type: Full Time, Permanent (42.5 hours per week) Who we are:…

View Details
Posted 2025-08-07

C#.NET Developer Fintech

Quant Capital
London

C#.NET Developer Fintech Quant Capital is urgently looking for a C# .NET Developer to join our high profile client. My client is a rapidly growing specialist fintech software house pr…

View Details
Posted 2025-07-09

Revenue Management Accountant

Vitesse PSP
London

 ABOUT US Created by a team of proven FinTech entrepreneurs in 2015, Vitesse PSP is an FCA regulated business that provides global payment and treasury services to the insurance industry. Vitesse o…

View Details
Posted 2025-07-02

🚀 Join the Croud - Our Always-On Talent Pipeline for Paid...

Croud
London

&##128293; Paid Media Rockstars - We Want You &##128293; Are you ready to take your Paid Media Career to the next level? Whether you're an ambitious Account Exec or a seasoned Account Director, Cr…

View Details
Posted 2025-07-08

Band 6/7 Locum Neuro Physiotherapist - London

Pulse
London

Job Title: Locum Neuro Physiotherapist Banding: 6 Location: London Working Hours: Full-time (37.5 hours per week) Start Date: ASAP Duration: 3 months Rate: £24-27 per hour …

View Details
Posted 2025-07-31

Temporary Receptionist, Investment firm- Immediately Available

London

Do you love a variety and thrive in new environments? Well, temping could be your next adventure!If you enjoy returning to familiar faces, familiar places and love the buzz of a new challenge, temping…

View Details
Posted 2025-07-06

Real Estate Analyst (Hotels)

Michael Page
London

Liaise with brokers and obtain high level information for preliminary acquisition opportunities analysis. Review dataroom, run desktop valuation analysis and discuss with executive committee. M…

View Details
Posted 2025-06-04