Last updated: 2025-05-04

23 Site Reliability Engineer jobs in London.

Hiring now: Site Reliability Engr @ Palantir, Sr Site Reliability Engr @ Birdie, Principal Site Reliabilit @ Genomics E, Site Reliability Engr 3 @ Behavox, Infrastructure Proj Mgr @ Cloudflare, Site Reliability Engr @ Writer, Site Reliability Engr @ Toggle, Sre Contributor @ Axon, Sr Site Reliability Engr @ Blacklane, Staff Software Engr Ai Re @ Anthropic.Explore more at jobswithgpt.com.

🔥 Skills

Kubernetes (11) AWS (10) DevOps (9) observability (8) Site Reliability Engineering (8) Python (8) CI/CD (7) automation (6) Site Reliability Engineer (5) Terraform (5)

📍 Locations

London (23)

Palantir

Skills & Focus: Site Reliability Engineer, production infrastructure, cloud environments, on-prem environments, automate processes, systems design, diagnosing production issues, resolution of issues, partner teams, high-performance services
About the Company: Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empo…
Type: Full-time
Skills & Focus: Site Reliability Engineer, production infrastructure, cloud environments, on-prem environments, automate processes, systems design, diagnosing production issues, resolution of issues, partner teams, high-performance services
About the Company: Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empo…
Type: Full-time

Birdie

Skills & Focus: SRE, DevOps, AWS, CI/CD, incident management, reliability engineering, OpenTelemetry, cloud-native architecture, metrics, observability
About the Company: Birdie is the leading home healthcare technology platform that aims to radically transform the lives of older adults. Its all-in-one solution supports around 4…
Salary: £80k - £100k per annum
Type: Full-time
Benefits: 33 days of holiday, learning & development budget, work from home budget, private health insurance, parental leave, pen…

Genomics England

Skills & Focus: Site Reliability Engineering, Platform Engineering, AWS, Python, Infrastructure as Code, CI/CD, Monitoring, Release Automation, Collaboration, Stakeholder Engagement
About the Company: Genomics England is dedicated to advancing the field of genomics and improving healthcare through the use of cutting-edge technology and practices.

Behavox

Skills & Focus: DevOps, DevSecOps, SRE, AWS, GCP, CI/CD, automation, security best practices, observability, scripting
About the Company: Behavox is shaping the future for how businesses harness their most important raw material - data. Our mission is bold: Organize enterprise data into actionabl…
Experience: 5+ years experience in a DevOps/DevSecOps/SRE engineering role
Benefits: Benefits include great health coverage for employee and family, generous time-off policy and flexible work schedule.
Skills & Focus: site reliability, DevOps, production systems, cloud, automation, Python, Golang, CI/CD, monitoring, high-load
About the Company: Behavox is shaping the future for how businesses harness their most important raw material - data. Our mission is bold: Organize enterprise data into actionabl…
Experience: 5+ years of experience as an SRE/DevOps engineer responsible for deployment and maintenance of production systems
Benefits: Benefits include great health coverage for employee and family, generous time-off policy and flexible work schedule

Cloudflare

Skills & Focus: Data Center Operations, Site Reliability Engineering, Linux systems administration, Network Engineering, DevOps, Network Protocols, Configuration management, Automation solutions, Documentation, Technical leadership
About the Company: At Cloudflare, we are on a mission to help build a better Internet. We run one of the world’s largest networks that powers millions of websites and other Inter…
Experience: Minimum of 3 years of prior relevant experience in Data Center Operations, Site Reliability Engineering, Linux systems administration, Network Engineering, and/or DevOps experience
Type: Hybrid
Skills & Focus: Data Center Operations, Site Reliability Engineering, Linux systems administration, Network Engineering, DevOps, Network Protocols, Configuration management, Automation solutions, Documentation, Technical leadership
About the Company: At Cloudflare, we are on a mission to help build a better Internet. We run one of the world’s largest networks that powers millions of websites and other Inter…
Experience: Minimum of 3 years of prior relevant experience in Data Center Operations, Site Reliability Engineering, Linux systems administration, Network Engineering, and/or DevOps experience
Type: Hybrid

Writer

Skills & Focus: Site Reliability Engineering, cloud infrastructure, Terraform, Python, AWS, Azure, GCP, Kubernetes, monitoring, automation
About the Company: Writer is the full-stack generative AI platform delivering transformative ROI for the world’s leading enterprises. Named one of the top 50 companies in AI by F…
Experience: Minimum of 7 years hands-on experience in Site Reliability Engineering
Type: Full-time
Benefits: Generous PTO, medical, dental, and vision coverage, paid parental leave, flexible spending accounts, health savings acc…

Toggle

Skills & Focus: Site Reliability Engineering, Kubernetes, Distributed Systems, Cloud-native Applications, GitOps, Microservice Architectures, Automation, Performance, Data Structures, Networking
About the Company: Toggle's software engineers drive the development of cutting-edge technologies that transform how millions of users connect, explore information, and engage wi…
Experience: 4 years of experience as a software engineer.
Salary: £50,000 - £70,000 GBP
Type: Full-time

Axon

Skills & Focus: site reliability engineering, cloud-native, Kubernetes, Azure, AWS, Python, Terraform, CI/CD, observability, infrastructure as code
About the Company: At Axon, we’re on a mission to Protect Life. We’re explorers, pursuing society’s most critical safety and justice issues with our ecosystem of devices and clou…
Experience: 7+ years of applicable experience.
Benefits: Competitive salary and pension with employer match, 30 days holiday, paid parental leave for all, medical, dental, visi…

Blacklane

Skills & Focus: Site Reliability Engineer, SRE, system reliability, observability, mentoring, scalable systems, microservices, AWS, Kubernetes, Terraform
Benefits: Fair Pay & Shared Success, Blacklane Mystery Rides, Learning & Development, weekly homecooked office lunches, health an…

Anthropic

Skills & Focus: Service Level Objectives, monitoring systems, high-availability infrastructure, automated failover, incident response, cost optimization, distributed systems, SLO/SLA frameworks, chaos engineering, AI infrastructure
About the Company: Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a who…
Salary: Annual Salary: £255,000 - £390,000 GBP
Type: Full-time
Benefits: Competitive compensation, optional equity donation matching, generous vacation and parental leave, flexible working hou…

Vml Enterprise Solutions

Skills & Focus: New Relic, observability, monitoring solutions, cloud platforms, programming languages, stakeholder management, technical documentation, incident response, problem identification, training
About the Company: At VML, we are a beacon of innovation and growth in an ever-evolving world. Our heritage is built upon a century of combined expertise, where creativity meets …
Experience: Deep expertise in New Relic platform configuration and optimisation and extensive understanding of observability principles.
Benefits: Career development opportunities and inclusion initiatives.

Reddit

Skills & Focus: Site Reliability Engineering, software development, distributed systems, Kubernetes, Cloud systems, Go, Python, observability, DevOps, automation
About the Company: Reddit is a community of communities, built on shared interests, passion, and trust. It hosts one of the most open and authentic conversations on the internet.
Experience: 5+ years of experience in Software Engineering, Site Reliability Engineering, or a development-focused DevOps role.
Benefits: Pension Scheme, Private Medical and Dental Scheme, Life Assurance, Income Protection, Workspace benefit for your home o…
Skills & Focus: Site Reliability Engineering, software development, distributed systems, Kubernetes, Cloud systems, Go, Python, observability, DevOps, automation
About the Company: Reddit is a community of communities, built on shared interests, passion, and trust. It hosts one of the most open and authentic conversations on the internet.
Experience: 5+ years of experience in Software Engineering, Site Reliability Engineering, or a development-focused DevOps role.
Benefits: Pension Scheme, Private Medical and Dental Scheme, Life Assurance, Income Protection, Workspace benefit for your home o…

Palantir

Skills & Focus: Kubernetes, Production Infrastructure, Cloud Hyperscalers, K8s Clusters, Engineering Rigor, Operational Excellence, Automation, Self-Healing Systems, Compliance Regimes, CNCF Components
About the Company: Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empo…
Experience: Senior level experience in software engineering, particularly in infrastructure and Kubernetes.
Skills & Focus: Kubernetes, Production Infrastructure, Cloud Hyperscalers, K8s Clusters, Engineering Rigor, Operational Excellence, Automation, Self-Healing Systems, Compliance Regimes, CNCF Components
About the Company: Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empo…
Experience: Senior level experience in software engineering, particularly in infrastructure and Kubernetes.

Wheely

Skills & Focus: infrastructure, cloud-based solutions, AWS, Golang, Python, Ansible, Terraform, Docker, Kubernetes, SQL/NoSQL databases
About the Company: Wheely is not a traditional ride-hailing company. We are building a platform with user privacy at its core while successfully scaling a five-star service to mi…
Experience: Proven experience with cloud-based solutions (AWS/GCP); Ability to code in Golang, Python, etc.; Experience in infrastructure automation using Ansible, Terraform, Bash, Python, etc.
Type: On-site
Benefits: Flexible working hours, employee stock options plan, lunch allowance, medical insurance including dental, life and crit…

Blockchain

Skills & Focus: Site Reliability Engineer, infrastructure, monitoring tools, GCP, AWS, Terraform, GitOps, containerization, programming languages, configuration management
About the Company: Blockchain is the world's leading software platform for digital assets, offering the largest production blockchain platform globally and aiming to build an ope…
Experience: Experience with containerization and service orchestration; strong knowledge of at least one programming language; experience with cloud solutions; experience with modern monitoring tools; experience with infrastructure as code tools; solid background with configuration management tools; experience with GitOps and CI; experience with messaging systems; experience with database management.
Salary: Full-time salary based on experience and meaningful equity
Type: Full-time
Benefits: Hybrid model working from home & awesome office location; unlimited vacation policy; Apple equipment; flexible work cul…

Behavox

Skills & Focus: Site Reliability Engineer, DevOps, cloud computing, automation, Python, Golang, AWS, GCP, CI/CD, Kubernetes
About the Company: Behavox is shaping the future for how businesses harness their most important raw material - data. Our mission is bold: Organize enterprise data into actionabl…
Experience: 5+ years of experience as an SRE/DevOps engineer responsible for deployment and maintenance of production systems
Benefits: Great health coverage for employee and family, generous time-off policy and flexible work schedule.

Blacklane

Skills & Focus: Site Reliability Engineering, Developer Experience, Technical Leadership, Collaboration, Problem Solving, Mentorship, Scalability, Resilience, Security, Continuous Improvement
About the Company: Blacklane provides professional chauffeur services, prioritizing scalability, resilience, and security in their platform. They emphasize employee growth, commu…
Experience: Proven expertise in Site Reliability Engineering
Salary: Up to 120,000
Type: Full-time
Benefits: Fair Pay & Shared Success, Mystery Rides, Learning & Development, Office lunches, Mental Health support, Social respons…

Birdie

Skills & Focus: AWS, cloud-native architecture, Kubernetes, DevOps, CI/CD, SRE, security best-practices, micro-services, platform engineering, observability
About the Company: Birdie is the leading home healthcare technology platform that aims to radically transform the lives of older adults. Its all-in-one solution supports around 4…
Salary: £105k - £125k per annum
Type: Full-time
Benefits: Compensation includes competitive salary packages, generous stock options, L&D budget, Work From Home budget, 33 days o…