31 Site Reliability Engineering jobs in London.

Hiring now: Associate Site Reliabilit @ Servicenow, Site Reliability Engr @ Palantir, Site Reliability Engr @ Bumble, Sr Site Reliability Engr @ Birdie, Cloud Infrastructure Engr @ Contentful, Observability Engr @ Drw, Sr Software Engr Sre Clou @ Google, Site Reliability Engr @ Writer, Sr Site Reliability Engr @ Reddit, Site Reliability Engr @ Vitesse.Explore more at jobswithgpt.com.

🔥 Skills

Site Reliability Engineering (13) Kubernetes (12) automation (10) AWS (10) Terraform (8) distributed systems (8) observability (7) Python (6) Site Reliability Engineer (6) DevOps (6)

📍 Locations

London (30) Staines (1)

Servicenow

Skills & Focus: site reliability, infrastructure, automation, AI, software development, systems engineering, networking, Linux, Python, JavaScript
About the Company: It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today …
Experience: 0-2 years

Palantir

Skills & Focus: Site Reliability Engineer, production infrastructure, cloud environments, on-prem environments, automate processes, systems design, diagnosing production issues, resolution of issues, partner teams, high-performance services
About the Company: Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empo…
Type: Full-time
Skills & Focus: Site Reliability Engineer, production infrastructure, cloud environments, on-prem environments, automate processes, systems design, diagnosing production issues, resolution of issues, partner teams, high-performance services
About the Company: Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empo…
Type: Full-time

Bumble Inc.

Skills & Focus: Site Reliability Engineering, reliability, scalability, performance, software systems, infrastructure, automation, development, security, operations
About the Company: Bumble Inc. is an equal opportunity employer that strongly encourages people of all backgrounds to apply, including those who are LGBTQ+, veterans, parents, an…

Birdie

Skills & Focus: SRE, DevOps, AWS, CI/CD, incident management, reliability engineering, OpenTelemetry, cloud-native architecture, metrics, observability
About the Company: Birdie is the leading home healthcare technology platform that aims to radically transform the lives of older adults. Its all-in-one solution supports around 4…
Salary: ÂŁ80k - ÂŁ100k per annum
Type: Full-time
Benefits: 33 days of holiday, learning & development budget, work from home budget, private health insurance, parental leave, pen…

Contentful

Skills & Focus: cloud infrastructure, AWS, Terraform, Kubernetes, observability, distributed systems, metrics, logs, security, high availability
About the Company: Contentful is the intelligent composable content platform that unlocks all of an organization’s digital content to deliver impactful customer experiences, maki…
Experience: 3+ years of experience building and operating cloud infrastructure in high-availability environments
Type: Full-time
Benefits: Full-time employees receive Stock Options, fertility and family building benefits, generous paid time off, personal ann…

Drw

Skills & Focus: logging, monitoring, automation, CI/CD, git, troubleshooting, Splunk, Grafana, Prometheus, kubernetes
About the Company: DRW is a diversified trading firm with over 3 decades of experience bringing sophisticated technology and exceptional people together to operate in markets aro…
Experience: 5+ years of industry experience using various logging and monitoring tools

Google

Skills & Focus: Site Reliability Engineering, incident management, distributed systems, Cloud Support, GCP, reliability, automation, system design, capacity planning, process improvement
About the Company: Google is a global technology company specializing in internet-related services and products.
Experience: 5 years of software development and 3 years designing distributed systems.
Type: Full-time

Writer

Skills & Focus: Site Reliability Engineering, cloud infrastructure, Terraform, Python, AWS, Azure, GCP, Kubernetes, monitoring, automation
About the Company: Writer is the full-stack generative AI platform delivering transformative ROI for the world’s leading enterprises. Named one of the top 50 companies in AI by F…
Experience: Minimum of 7 years hands-on experience in Site Reliability Engineering
Type: Full-time
Benefits: Generous PTO, medical, dental, and vision coverage, paid parental leave, flexible spending accounts, health savings acc…

Google

Skills & Focus: software development, data structures, algorithms, distributed systems, networking, problem solving, automation, debugging, optimizing code, large-scale systems
About the Company: Google is a global technology company focused on providing Internet-related services and products.
Experience: 2 years of experience with software development in one or more programming languages; 2 years of experience with data structures or algorithms.
Skills & Focus: Site Reliability Engineering, software development, distributed systems, storage, networking, large-scale systems, optimization, debugging, automation, problem-solving
Experience: 2 years of experience with data structures/algorithms and software development in one or more programming languages.

Reddit

Skills & Focus: Site Reliability Engineering, software development, distributed systems, Kubernetes, Cloud systems, Go, Python, observability, DevOps, automation
About the Company: Reddit is a community of communities, built on shared interests, passion, and trust. It hosts one of the most open and authentic conversations on the internet.
Experience: 5+ years of experience in Software Engineering, Site Reliability Engineering, or a development-focused DevOps role.
Benefits: Pension Scheme, Private Medical and Dental Scheme, Life Assurance, Income Protection, Workspace benefit for your home o…
Skills & Focus: Site Reliability Engineering, software development, distributed systems, Kubernetes, Cloud systems, Go, Python, observability, DevOps, automation
About the Company: Reddit is a community of communities, built on shared interests, passion, and trust. It hosts one of the most open and authentic conversations on the internet.
Experience: 5+ years of experience in Software Engineering, Site Reliability Engineering, or a development-focused DevOps role.
Benefits: Pension Scheme, Private Medical and Dental Scheme, Life Assurance, Income Protection, Workspace benefit for your home o…

Vitesse

Skills & Focus: Site Reliability Engineer, Cloud Platform Management, Infrastructure Design, Infrastructure as Code, Continuous Integration, Monitoring and Observability, Docker, Kubernetes, Azure, Terraform
About the Company: Vitesse – the treasury and payment partner of choice for insurance. Formed in 2014 by a team of proven FinTech entrepreneurs, we are an FCA-regulated business …
Experience: 3+ years of experience in a Site Reliability Engineer, DevOps, Platform Engineer, or similar role.
Benefits: 25 days Holiday per year, Hybrid working arrangements, Contributory pension scheme, Enhanced Parental leave, Cycle to W…

Google

Skills & Focus: Site Reliability Engineering, Software Development, Distributed Systems, Incident Management, Cloud Computing, Automation, Capacity Planning, Technical Leadership, Problem Solving, Telemetry Systems
About the Company: Google is a global technology company a leader in internet services, software, and hardware.
Experience: 5 years with software development and 3 years in designing and troubleshooting distributed systems.
Type: Full-time

Blacklane

Skills & Focus: Site Reliability Engineering, Developer Experience, Technical Leadership, Collaboration, Problem Solving, Mentorship, Scalability, Resilience, Security, Continuous Improvement
About the Company: Blacklane provides professional chauffeur services, prioritizing scalability, resilience, and security in their platform. They emphasize employee growth, commu…
Experience: Proven expertise in Site Reliability Engineering
Salary: Up to 120,000
Type: Full-time
Benefits: Fair Pay & Shared Success, Mystery Rides, Learning & Development, Office lunches, Mental Health support, Social respons…

Axon

Skills & Focus: cloud-native, site reliability, Azure, AWS, Kubernetes, Python, Go, CI/CD, Infrastructure as Code, observability
About the Company: At Axon, we’re on a mission to Protect Life. We’re explorers, pursuing society’s most critical safety and justice issues with our ecosystem of devices and clou…
Experience: 10+ years of applicable experience.
Benefits: Competitive Base Salary, Annual Bonus and Restricted Stock Unit Eligibility, Comprehensive Pension Plan with Matching C…

Gfk

Skills & Focus: cloud engineering, Google Cloud Platform, container orchestration, Kubernetes, Infrastructure as Code, Terraform, GitOps, monitoring, CI/CD pipelines, system reliability

Braze

Skills & Focus: Platform Software Engineers, Infrastructure as a Service, automation, high-scale data, Kubernetes, API-driven systems, operational discipline, incident management, monitoring and alerting, distributed systems
About the Company: Braze is the leading customer engagement platform that empowers brands to Be Absolutely Engaging.™ Braze allows any marketer to collect and take action on any …
Experience: 5+ years of full-stack development experience
Type: Full-time
Benefits: Competitive compensation that may include equity, Retirement and Employee Stock Purchase Plans, Flexible paid time off,…

Pleo

Skills & Focus: Grafana, AWS, GCP, Terraform, Kubernetes, Flux, GitOps, Istio, GitHub Actions, Go
About the Company: Pleo is a scale-up in the fintech industry, focused on providing the best possible experience for customers while managing expenses. Their mission is to help e…
Type: Full-time
Benefits: Your own Pleo card, catering for lunch in offices, private health insurance, 25 days of holiday + public holidays, flex…

Palantir

Skills & Focus: Site Reliability Engineering, high-performance services, scalable services, production infrastructure, automate processes, systems design, production issues, diagnosing problems, resolving issues, engineering experience
About the Company: Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empo…
Type: Full-time
Skills & Focus: Site Reliability Engineering, high-performance services, scalable services, production infrastructure, automate processes, systems design, production issues, diagnosing problems, resolving issues, engineering experience
About the Company: Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empo…
Type: Full-time

Cloudflare

Skills & Focus: Data Center Operations, Site Reliability Engineering, Linux systems administration, Network Engineering, DevOps, Network Protocols, Configuration management, Automation solutions, Documentation, Technical leadership
About the Company: At Cloudflare, we are on a mission to help build a better Internet. We run one of the world’s largest networks that powers millions of websites and other Inter…
Experience: Minimum of 3 years of prior relevant experience in Data Center Operations, Site Reliability Engineering, Linux systems administration, Network Engineering, and/or DevOps experience
Type: Hybrid
Skills & Focus: Data Center Operations, Site Reliability Engineering, Linux systems administration, Network Engineering, DevOps, Network Protocols, Configuration management, Automation solutions, Documentation, Technical leadership
About the Company: At Cloudflare, we are on a mission to help build a better Internet. We run one of the world’s largest networks that powers millions of websites and other Inter…
Experience: Minimum of 3 years of prior relevant experience in Data Center Operations, Site Reliability Engineering, Linux systems administration, Network Engineering, and/or DevOps experience
Type: Hybrid

Ably

Skills & Focus: Site Reliability Engineering, Infrastructure, Cloud-native, Automation, Software development, Linux, AWS, Containerization, Observability, Cross-functional collaboration
About the Company: Ably provides a suite of products to build, extend, and deliver powerful digital experiences in realtime, delivering billions of messages for millions of devic…
Type: Full-time
Benefits: Enhanced holiday allowance, equity options, home workstation budget, personal development budget, private healthcare, e…

Blacklane

Skills & Focus: Site Reliability Engineer, SRE, system reliability, observability, mentoring, scalable systems, microservices, AWS, Kubernetes, Terraform
Benefits: Fair Pay & Shared Success, Blacklane Mystery Rides, Learning & Development, weekly homecooked office lunches, health an…

Pioneering Technology Company

Skills & Focus: Site Reliability Engineer, SRE, Language Models, Machine Learning, infrastructure, reliability, scalability, performance, monitor system health, cross-functional teams
About the Company: Specialising in cutting-edge Language Models (LLM) and Machine Learning solutions.

Anthropic

Skills & Focus: Service Level Objectives, monitoring systems, high-availability infrastructure, automated failover, incident response, cost optimization, distributed systems, SLO/SLA frameworks, chaos engineering, AI infrastructure
About the Company: Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a who…
Salary: Annual Salary: ÂŁ255,000 - ÂŁ390,000 GBP
Type: Full-time
Benefits: Competitive compensation, optional equity donation matching, generous vacation and parental leave, flexible working hou…

Wheely

Skills & Focus: infrastructure, cloud-based solutions, AWS, Golang, Python, Ansible, Terraform, Docker, Kubernetes, SQL/NoSQL databases
About the Company: Wheely is not a traditional ride-hailing company. We are building a platform with user privacy at its core while successfully scaling a five-star service to mi…
Experience: Proven experience with cloud-based solutions (AWS/GCP); Ability to code in Golang, Python, etc.; Experience in infrastructure automation using Ansible, Terraform, Bash, Python, etc.
Type: On-site
Benefits: Flexible working hours, employee stock options plan, lunch allowance, medical insurance including dental, life and crit…

Blockchain

Skills & Focus: Site Reliability Engineer, infrastructure, monitoring tools, GCP, AWS, Terraform, GitOps, containerization, programming languages, configuration management
About the Company: Blockchain is the world's leading software platform for digital assets, offering the largest production blockchain platform globally and aiming to build an ope…
Experience: Experience with containerization and service orchestration; strong knowledge of at least one programming language; experience with cloud solutions; experience with modern monitoring tools; experience with infrastructure as code tools; solid background with configuration management tools; experience with GitOps and CI; experience with messaging systems; experience with database management.
Salary: Full-time salary based on experience and meaningful equity
Type: Full-time
Benefits: Hybrid model working from home & awesome office location; unlimited vacation policy; Apple equipment; flexible work cul…

Birdie

Skills & Focus: AWS, cloud-native architecture, Kubernetes, DevOps, CI/CD, SRE, security best-practices, micro-services, platform engineering, observability
About the Company: Birdie is the leading home healthcare technology platform that aims to radically transform the lives of older adults. Its all-in-one solution supports around 4…
Salary: ÂŁ105k - ÂŁ125k per annum
Type: Full-time
Benefits: Compensation includes competitive salary packages, generous stock options, L&D budget, Work From Home budget, 33 days o…