Back to results

Site Reliability Engineer

Apply Now

Bh/8721_1773766433 Posted: 17/03/2026

£400 - £550 per day + Inside IR35
London
Contract

Site Reliability Engineer

Contract - 12 months

Inside IR35

Hybrid working

£400-550 per day depending on experience

Job Description
My client is looking for a skilled Senior Site Reliability Engineer to play a key role in improving the reliability, scalability, and operational performance of their production systems. This role works closely with product and engineering teams to enhance system reliability, architecture, deployment safety, and observability.

Role Summary

My client is seeking a Senior Site Reliability Engineer to join a centralized Technical Operations function, where you will lead reliability initiatives and support operations across a range of large-scale, customer-facing digital services.

Operating within a centralized SRE model, you will partner with product and engineering teams while maintaining shared responsibility for production reliability, resilience, and scalability. The role includes participation in an on-call rotation supporting critical services, with shared ownership of overall system health.

You will be responsible for defining reliability standards, influencing architectural improvements, managing complex incidents, and building automation to improve deployment safety and operational efficiency. Your work will directly support high-traffic systems used by a global audience.

Key Responsibilities

Reliability & Risk Engineering

My client is looking for someone who can:

Identify systemic reliability risks and drive long-term preventative improvements
Define and refine SLIs, SLOs, and error budgets aligned with business and customer outcomes
Lead complex incident management, post-incident reviews, and remediation planning
Depth at Networkign Fundamentals - trouble shoting network infrastructure is key
Experiecne working as senrio SRE particularly around AWS

Architecture & Resilience

You will:

Review and influence system architecture to improve scalability, availability, and fault isolation
Design strategies for high availability, graceful degradation, and disaster recovery
Evaluate trade-offs between performance, cost, and operational risk

CI/CD & Deployment Safety

The successful candidate will:

Improve deployment pipelines and implement automation to reduce risk and accelerate delivery
Implement safe deployment strategies such as canary releases and blue/green deployments
Ensure strong rollback and recovery mechanisms

Observability & Performance

You will be expected to:

Build and enhance observability solutions including metrics, logging, and tracing
Work with teams to reduce alert fatigue and improve signal quality
Diagnose performance bottlenecks across infrastructure and applications

Infrastructure & Automation

My client is seeking someone who can:

Design and operate cloud-native, containerised workloads at scale
Use Infrastructure as Code to build and manage resilient platforms
Develop automation to reduce manual effort and operational risk

Cross-Functional Leadership

You will:

Mentor engineers and promote SRE best practices across teams
Collaborate with engineering, product, and security stakeholders to improve system reliability

Required Qualifications

My client is looking for candidates with:

A degree in Computer Science, Engineering, or equivalent practical experience
Strong experience designing and operating CI/CD systems with deployment safety practices
Excellent communication skills with the ability to influence cross-functional teams
7+ years of experience in SRE, production engineering, or systems engineering roles
Strong knowledge of distributed systems concepts, including consistency and failure handling
Hands-on experience with major cloud platforms (e.g., AWS, GCP, Azure), including multi-region environments
Strong experience with Kubernetes and container orchestration at scale
Proficiency in at least one programming language such as Go, Python, or Java
Proven experience managing high-severity incidents and leading remediation efforts

Preferred Qualifications

Ideally, candidates will also have:

Experience with multi-region or multi-cloud architectures
Familiarity with observability tools such as Prometheus, Grafana, or Datadog
Previous mentoring or technical leadership experience
Experience with Infrastructure as Code tools such as Terraform or CloudFormation
Exposure to AI-assisted tooling for incident analysis or operational efficiency

Sphere Digital Recruitment is acting as an Employment Business in relation to this vacancy.

Bex Hudson-Dowdeswell Senior Client Partner

Apply for this role

First Name

Last Name

Telephone Number

Email Address

CV, LinkedIn or Dropbox URL

CV Upload

Choose File

LinkedIn / Dropbox URL

Message

By signing in to your account, you agree to our Terms of Service and consent to our Cookie Policy and Privacy Policy.

Not yet registered? Create an account today

Already have an account? Sign in now

Still looking? What about...

Featured Jobs

Posted: 17/03/2026

Senior Sales Director

e1221_1773766711

US$150000 - US$185000 per annum
San Francisco, California
Permanent

US Senior Sales DirectorLocation: Remote, North AmericaSalary: $150-185K, double OTE About the CompanyOu...

View Job

Posted: 17/03/2026

Site Reliability Engineer

Bh/8721_1773766433

£400 - £550 per day + Inside IR35
London
Contract

Site Reliability EngineerContract - 12 monthsInside IR35 Hybrid working£400-550 per day depending on exp...

View Job

Posted: 17/03/2026

Marketing Manager (B2B)

33472_1773761283

£35000 - £50000 per annum
London
Permanent

About the RoleA fast-growing global business is looking for a B2B Marketing Executive to join their expa...

View Job

Posted: 17/03/2026

Senior Paid Social Manager

BH34591_1773737196

£45000 - £50000 per annum + EXCELLENT BENEFITS
City of London, London
Permanent

Senior Paid Social ManagerMedia Agency | London (Hybrid) | Up to £50,000A fantastic opportunity to join ...

View Job

Posted: 17/03/2026

Graduate Account Executive - Media Planning and Buying

BH34432_1773736949

£25000 - £28000 per annum + excellent benefits
City of London, London
Permanent

Account Executive - Media Planning and Buying London (3 days in office) / Salary - £25,000An exciting e...

View Job

Posted: 17/03/2026

PPC Account Manager

17032026_1773736902

£45000 - £48000 per annum + EXCELLENT BENEFITS
City of London, London
Permanent

Paid Search Account ManagerLondon (Hybrid)£45,000Are you a data‑driven Paid Search specialist ready to t...

View Job

Posted: 16/03/2026

Head of Paid Social

HoPS1_1773682667

£45000 - £50000 per annum + + bonus
Manchester, Greater Manchester
Permanent

Head of Paid SocialManchester Based - 2-3 days a weekThe JobOwn the development of paid social strategie...

View Job

Posted: 16/03/2026

Head of Paid Media

HoPM_1773682650

£50000 - £55000 per annum + + bonus
Manchester, Greater Manchester
Permanent

Head of Paid MediaManchester Based - 2-3 days in the office per week.The JobOwn and optimise paid media ...

View Job

Posted: 16/03/2026

Senior Customer Success Manager

12/3/26_1773681862

+ 20% Bonus
London
Permanent

Senior Client Success Manager Location: Remote | London meetings twice per monthSalary: £50k - £55k + 20...

View Job

Posted: 16/03/2026

Support Engineer - Japanese Speaking

bhd-2928_1773672948

£20 - £30 per hour
London
Contract

Support Engineer- Developer relationsLondon - hybridInside IR35£20-30 per hour - 40 hour weekContract 6 ...

View Job

About Us

About Us

Case Studies View All

Our Services

Hiring

Case Studies View All

Our Industries

Industries

Case Studies View All

Quick CV Dropoff

Site Reliability Engineer

Apply for this role

Still looking? What about...

Featured Jobs

Senior Sales Director

Site Reliability Engineer

Marketing Manager (B2B)

Senior Paid Social Manager

Graduate Account Executive - Media Planning and Buying

PPC Account Manager

Head of Paid Social

Head of Paid Media

Senior Customer Success Manager

Support Engineer - Japanese Speaking

Contact Us

Connect with Us

Quick Links

Specialisms