Last updated: 2025-07-21
35 Site Reliability Engineering jobs in New York City.
Alchemy
Alchemy is the only complete developer platform that offers the powerful APIs, SDKs, and tools necessary to build and scale onchain apps and rollups. Our infra…
New York City
- Skills: Reliability, Observability, Infrastructure Engineer, Production Systems, AWS, Docker, Kubernetes, CI/CD, Infrastructure-as-Code, Engineering Excellence
- Level: mid
- Type: full_time
Rogo
We're building Al thought partners to make people smarter and more creative, accelerating the creation and sharing of knowledge in financial services. Our team…
New York City
- Skills: AWS, Azure, Kubernetes, Infrastructure as Code, Terraform, Datadog, Cloud Infrastructure, CI/CD Pipelines, Linux Administration, Monitoring Tools
- Level: mid
- Type: full_time
Hedge Fund in NYC
A technology company focused on software development and infrastructure support.
New York City
- Skills: Reliability Engineer, software development, infrastructure support, Python, Java, C/C++, Go, distributed software applications, monitoring, performance tuning
- Level: mid
- Type: full_time
Hebbia
Hebbia is AI that works the way you work, designed to be generally capable and capable of tackling complex tasks by citing answers from various sources. Its mi…
New York City
- Skills: Site Reliability Engineer, DevOps Engineer, CI/CD pipelines, cloud platforms, AWS, monitoring tools, observability, infrastructure-as-code, Docker, Kubernetes
- Level: mid
- Type: full_time
Kontakt.io
Kontakt.io is building the platform that care operations run on. We reduce waste, cut costs, and improve revenue by improving throughput, asset utilization and…
New York City
- Skills: SRE (Site Reliability Engineering), cloud-based platform, automation, incident response, performance improvement, observability, self-healing automation, infrastructure management, healthcare technology, real-time systems
- Level: mid
- Type: full_time
Palantir
Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empo…
New York City
- Skills: Kubernetes, K8s clusters, production infrastructure, automation, self-healing systems, cloud hyperscalers, engineering rigor, operational excellence, scale, security
- Level: mid
- Type: full_time
Vercel
Vercel’s Frontend Cloud provides the developer experience and infrastructure to build, scale, and secure a faster, more personalized web. Customers like Under …
New York City
- Skills: SRE, Site Reliability Engineering, cloud, incident management, disaster recovery, distributed system design, engineering management, operational efficiency, service quality, technical risk management
- Level: mid
- Type: full_time
Gusto
Gusto is a modern, online people platform that helps small businesses take care of their teams. On top of full-service payroll, Gusto offers health insurance, …
New York City
- Skills: storage infrastructure, MySQL, Postgres, data streaming, Kafka, cloud platforms, AWS, Terraform, resiliency, automation
- Level: mid
- Type: full_time
Bumble Inc.
Bumble Inc. is an equal opportunity employer that encourages applications from diverse candidates, including various age groups, genders, and those with disabi…
New York City
- Skills: Site Reliability Engineering, reliability, scalability, performance, software systems, infrastructure management, automation, development, security, operations
- Level: mid
- Type: full_time
Virtu
Virtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutio…
New York City
- Skills: Site Reliability Engineering, Windows/Linux, High-Stress Environments, Micro-Market Structure, Technical Agility, Financial Systems, Programming Languages, Configuration Management, SQL, Networking
- Level: mid
- Type: full_time
NetApp
NetApp is the intelligent data infrastructure company, turning a world of disruption into opportunity for every customer. No matter the data type, workload or …
New York City
- Skills: Cloud, Software Engineering, SRE, Incident Management, Observability, Application Security, Python, Golang, DevSecOps, Virtualization
- Level: mid
- Type: full_time
The Trade Desk
The Trade Desk is a global technology company with a mission to create a better, more open internet for everyone through principled, intelligent advertising. H…
New York City
- Skills: network reliability, software engineering, networking protocols, Kubernetes, cloud environments, network automation, resilient systems, infrastructure-as-code, troubleshooting, DevOps
- Level: mid
- Type: full_time
Celonis
Celonis is the global leader in Process Mining technology and one of the fastest-growing SaaS companies worldwide. We are on a mission to unlock unprecedented …
New York City
- Skills: Site Reliability Engineering, Software Engineering, Process Mining, Kubernetes, AWS, Azure, GCP, Cloud-based applications, Operational excellence, Automation
- Level: mid
- Type: full_time
Betterment
Betterment is a leading, technology-driven financial services company that offers investing and retirement solutions for retail investors and investment adviso…
New York City
- Skills: Site Reliability Engineering, Developer Experience, AWS, CI/CD pipelines, cloud native solutions, service-level objectives, GitHub, operational excellence, mentorship, agility
- Level: mid
- Type: full_time
Uniswap Labs
The Uniswap Labs team is building products to unlock value through universal exchange. We envision a future where digital economies flourish, and markets are t…
New York City
- Skills: reliability, performance, monitoring, cloud, automation, DevOps, architecture, scalability, incident response, best practices
- Level: mid
- Type: full_time
Ro
Ro is a direct-to-patient healthcare company offering nationwide telehealth, labs, and pharmacy services, focusing on patient-centric healthcare solutions.
New York City
- Skills: scalability, reliability, security, infrastructure, Site Reliability, Developer Workflow, Observability, Data Infrastructure, growth, strategy
- Level: mid
- Type: full_time
Farther
Farther is a rapidly growing RIA that combines expert advisors with cutting-edge technology - delivering a comprehensive, tailored wealth management experience…
New York City
- Skills: TypeScript, JavaScript, CI/CD, automation frameworks, test infrastructure, mocking, containerization, monitoring, observability, debugging
- Level: mid
- Type: full_time
Palantir
Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empo…
New York City
- Skills: site reliability engineering, production infrastructure, automate processes, scalable systems, network configuration, hardware setup, production issues, systems design, US Government, data-driven decisions
- Level: mid
- Type: full_time
Reddit is a community of communities. It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the intern…
New York City
- Skills: Site Reliability Engineering, Distributed systems, Kubernetes, Cloud systems, Prometheus, Thanos, Grafana, DevOps, Automation, High-traffic backend systems
- Level: mid
- Type: full_time
Spotify
Spotify is a leading global platform for music streaming, offering millions of songs and podcasts to users worldwide.
New York City
- Skills: infrastructure, cloud, observability, security, developer experience, production environment, developer tooling, open-source, scalability, team collaboration
- Level: mid
- Type: full_time