Last updated: 2025-07-21
119 Site Reliability Engineering jobs in Remote - United States.
Anagram
Anagram is a digital assets holding company investing in and incubating innovative crypto projects. Our portfolio companies span multiple blockchain ecosystems…
United States
- Skills: Site Reliability Engineer, DevOps, Infrastructure as Code, Terraform, Kubernetes, CI/CD, Automation, Cloud-native environments, Blockchain node infrastructure, Web3 security considerations
- Level: mid
- Type: full_time
Ashby
Ashby builds software that lets talent teams build an efficient, delightful, respectful hiring process.
United States
- Skills: Platform Engineer, Site Reliability Engineer, infrastructure, SQL, data warehouse, automated guardrails, event-driven architecture, developer tooling, Kubernetes debugging, monitoring tools
- Level: mid
- Type: full_time
Assured
Assured is on a mission to modernize insurance by providing software solutions for claims processing. They focus on automating and enhancing claim filing and f…
United States
- Skills: Site Reliability Engineer, infrastructure, SaaS platforms, automation, scalable infrastructure, PostgreSQL, security regulations, monitoring, compliance, engineering
- Level: mid
- Type: full_time
Affirm
Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compound…
United States
- Skills: cloud-native, incident response, observability, automation, system resilience, backend systems, high availability, observability stacks, devops, scalable
- Level: mid
- Type: contract
Conduit
At Conduit, we're building the rollup-native cloud platform that will scale Ethereum, combining web2 engineering best-practices with web3 rollup technology to …
United States
- Skills: onchain compute, rollup, cloud platform, technical support, solutions engineering, B2B SaaS, debugging, infrastructure, Kubernetes, Ethereum
- Level: mid
- Type: full_time
One
One’s mission is simple - to help customers achieve financial progress. We’re doing this by creating simple solutions to help our customers save, spend, borrow…
United States
- Skills: Site Reliability Engineer, SRE, distributed systems, observability, incident management, cloud native, CI/CD, automation, scalable systems, mentorship
- Level: senior
- Type: full_time
Cabify
Cabify aims to make cities better places to live by improving mobility, offering various transport solutions including scooters and mopeds.
United States
- Skills: Site Reliability Engineer, infrastructure, scalability, automation, observability, SLOs, SLIs, SLAs, programming languages, Kubernetes
- Level: mid
- Type: full_time
Canonical
Canonical is a pioneering tech firm that is at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important…
United States
- Skills: Linux, Infrastructure as Code, Automation, Gitops, Kubernetes, Cloud Infrastructure, Observability Tools, Python, Site Reliability Engineering, Open Source
- Level: mid
- Type: full_time
Paradigm
Paradigm is a zero-fee, institutional liquidity network for derivatives traders across CeFi and DeFi, providing unified access to multi-asset, multi-protocol l…
United States
- Skills: Site Reliability Engineering, Kubernetes, AWS, Docker, Terraform, Cloud Infrastructure, Multi-Cloud Environments, Incident Response, Team Leadership, Cloud Security
- Level: mid
- Type: full_time
MachineFi Lab
At MachineFi Lab, we're not just envisioning the future; we're actively building it—today. We power the new reward economy by fostering a fairer, safer, and mo…
United States
- Skills: DevOps, SRE, Blockchain, Infrastructure, Automation, Deployment Pipelines, System Performance, Scalability, Reliability, Cross-Functional Collaboration
- Level: mid
- Type: part_time
Tailscale
Tailscale is building the new Internet by delivering software that makes it easy to securely interconnect people and their devices, no matter where they are. F…
United States
- Skills: Software Engineer, CI/CD, Developer Tooling, Distributed Systems, SQL Databases, Infrastructure as Code, Observability, Remote Work, Full-time, Networking
- Level: mid
- Type: full_time
Regrello
Regrello is a 40-person startup reimagining automation in supply chains, focusing on building a global operating network that enables supply-chain companies to…
United States
- Skills: Site Reliability Engineer, cloud infrastructure, automation, supply chain, developer platform, security requirements, reliability, AI engine, GPU infrastructure, hybrid culture
- Level: mid
- Type: full_time
Sporty Group
Sporty is a 100% remote company providing online gaming brands, partnered with world-renowned champions and serving hundreds of millions of visitors globally.
United States
- Skills: AWS, Kubernetes, Infrastructure, DevOps, SRE, Cloud Computing, Monitoring, Deployment, Security, Python
- Level: mid
- Type: full_time
Spreedly
Spreedly is the world's leading Open Payments Platform, enabling and optimizing digital transactions with a complete payment services marketplace. Its PCI-comp…
United States
- Skills: Site Reliability Engineer, reliability, observability, scalability, cloud architectures, infrastructure, payments platform, software development, application stack, system performance
- Level: mid
- Type: full_time
Valarian Technologies
Valarian Technologies is a dual-use technology company building critical tools to safeguard the future in an era of evolving global security challenges.
United States
- Skills: Site Reliability Engineer, observability strategy, high availability, fault tolerance, disaster recovery plan, incident detection, performance tuning, capacity planning, Kubernetes, Docker Swarm
- Level: mid
- Type: full_time
Nagarro
Nagarro is a rapidly growing company focused on building new offerings targeted towards diverse market segments in the horizontal tech space, creating new comp…
United States
- Skills: Performance Tuning, Performance Testing, Cloud architecture, Observability, Java/.NET, SQL/NoSQL, Performance engineering, Scalability, Reliability, Architecture design
- Level: mid
- Type: full_time
Paxos
We’re on a mission to open the world’s financial system to everyone by enabling the instant movement of any asset, any time, in a trustworthy way. For over a d…
United States
- Skills: AWS, RDS, PostgreSQL, Aurora, site reliability engineering, automation scripts, infrastructure as code, Terraform, Docker, Kubernetes
- Level: mid
- Type: full_time
GoDaddy
GoDaddy is empowering everyday entrepreneurs around the world by providing the help and tools to succeed online, making opportunity more inclusive for all. GoD…
United States
- Skills: Ceph, site reliability engineering, storage solutions, performance tuning, monitoring and troubleshooting, OpenStack, Agile methodologies, scripting languages, Linux/Unix systems, problem-solving
- Level: senior
- Type: full_time
rackspace
United States
- Skills: MongoDB, database reliability, performance, scalability, automation, index management, monitoring, backup, disaster recovery, infrastructure-as-code
- Level: mid
- Type: full_time
Synopsys
At Synopsys, we drive the innovations that shape the way we live and connect. Our technology is central to the Era of Pervasive Intelligence, from self-driving…
United States
- Skills: Linux, observability tools, DevOps, EDA, HPC, monitoring solutions, performance metrics, data analytics, Kubernetes, Ansible
- Level: mid
- Type: full_time