Last updated: 2025-07-21

35 Site Reliability Engineering jobs in New York City.

Filters: Categories: Site Reliability Engineering | Locations: Manhattan

Alchemy

Alchemy is the only complete developer platform that offers the powerful APIs, SDKs, and tools necessary to build and scale onchain apps and rollups. Our infra…

Infrastructure Engineer (Reliability Focus)

New York City

  • Skills: Reliability, Observability, Infrastructure Engineer, Production Systems, AWS, Docker, Kubernetes, CI/CD, Infrastructure-as-Code, Engineering Excellence
  • Level: mid
  • Type: full_time

Rogo

We're building Al thought partners to make people smarter and more creative, accelerating the creation and sharing of knowledge in financial services. Our team…

Cloud Infrastructure Engineer

New York City

  • Skills: AWS, Azure, Kubernetes, Infrastructure as Code, Terraform, Datadog, Cloud Infrastructure, CI/CD Pipelines, Linux Administration, Monitoring Tools
  • Level: mid
  • Type: full_time

Hedge Fund in NYC

A technology company focused on software development and infrastructure support.

Reliability Engineer

New York City

  • Skills: Reliability Engineer, software development, infrastructure support, Python, Java, C/C++, Go, distributed software applications, monitoring, performance tuning
  • Level: mid
  • Type: full_time

Hebbia

Hebbia is AI that works the way you work, designed to be generally capable and capable of tackling complex tasks by citing answers from various sources. Its mi…

Site Reliability Engineer (SRE)

New York City

  • Skills: Site Reliability Engineer, DevOps Engineer, CI/CD pipelines, cloud platforms, AWS, monitoring tools, observability, infrastructure-as-code, Docker, Kubernetes
  • Level: mid
  • Type: full_time

Kontakt.io

Kontakt.io is building the platform that care operations run on. We reduce waste, cut costs, and improve revenue by improving throughput, asset utilization and…

SRE Leader

New York City

  • Skills: SRE (Site Reliability Engineering), cloud-based platform, automation, incident response, performance improvement, observability, self-healing automation, infrastructure management, healthcare technology, real-time systems
  • Level: mid
  • Type: full_time

Palantir

Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empo…

Senior Software Engineer - Substrate

New York City

  • Skills: Kubernetes, K8s clusters, production infrastructure, automation, self-healing systems, cloud hyperscalers, engineering rigor, operational excellence, scale, security
  • Level: mid
  • Type: full_time

Vercel

Vercel’s Frontend Cloud provides the developer experience and infrastructure to build, scale, and secure a faster, more personalized web. Customers like Under …

SRE Manager

New York City

  • Skills: SRE, Site Reliability Engineering, cloud, incident management, disaster recovery, distributed system design, engineering management, operational efficiency, service quality, technical risk management
  • Level: mid
  • Type: full_time

Gusto

Gusto is a modern, online people platform that helps small businesses take care of their teams. On top of full-service payroll, Gusto offers health insurance, …

Storage Infrastructure Engineer

New York City

  • Skills: storage infrastructure, MySQL, Postgres, data streaming, Kafka, cloud platforms, AWS, Terraform, resiliency, automation
  • Level: mid
  • Type: full_time

Bumble Inc.

Bumble Inc. is an equal opportunity employer that encourages applications from diverse candidates, including various age groups, genders, and those with disabi…

Site Reliability Engineer (SRE)

New York City

  • Skills: Site Reliability Engineering, reliability, scalability, performance, software systems, infrastructure management, automation, development, security, operations
  • Level: mid
  • Type: full_time

Virtu

Virtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutio…

Site Reliability Engineer

New York City

  • Skills: Site Reliability Engineering, Windows/Linux, High-Stress Environments, Micro-Market Structure, Technical Agility, Financial Systems, Programming Languages, Configuration Management, SQL, Networking
  • Level: mid
  • Type: full_time

NetApp

NetApp is the intelligent data infrastructure company, turning a world of disruption into opportunity for every customer. No matter the data type, workload or …

Software Engineer SRE (Observability, Incident Management)

New York City

  • Skills: Cloud, Software Engineering, SRE, Incident Management, Observability, Application Security, Python, Golang, DevSecOps, Virtualization
  • Level: mid
  • Type: full_time

The Trade Desk

The Trade Desk is a global technology company with a mission to create a better, more open internet for everyone through principled, intelligent advertising. H…

Senior Software Engineer – Network Reliability Engineering

New York City

  • Skills: network reliability, software engineering, networking protocols, Kubernetes, cloud environments, network automation, resilient systems, infrastructure-as-code, troubleshooting, DevOps
  • Level: mid
  • Type: full_time

Celonis

Celonis is the global leader in Process Mining technology and one of the fastest-growing SaaS companies worldwide. We are on a mission to unlock unprecedented …

Site Reliability Engineer

New York City

  • Skills: Site Reliability Engineering, Software Engineering, Process Mining, Kubernetes, AWS, Azure, GCP, Cloud-based applications, Operational excellence, Automation
  • Level: mid
  • Type: full_time

Betterment

Betterment is a leading, technology-driven financial services company that offers investing and retirement solutions for retail investors and investment adviso…

Senior Site Reliability Engineer

New York City

  • Skills: Site Reliability Engineering, Developer Experience, AWS, CI/CD pipelines, cloud native solutions, service-level objectives, GitHub, operational excellence, mentorship, agility
  • Level: mid
  • Type: full_time

Uniswap Labs

The Uniswap Labs team is building products to unlock value through universal exchange. We envision a future where digital economies flourish, and markets are t…

Senior Site Reliability Engineer (SRE)

New York City

  • Skills: reliability, performance, monitoring, cloud, automation, DevOps, architecture, scalability, incident response, best practices
  • Level: mid
  • Type: full_time

Ro

Ro is a direct-to-patient healthcare company offering nationwide telehealth, labs, and pharmacy services, focusing on patient-centric healthcare solutions.

VP of Infrastructure

New York City

  • Skills: scalability, reliability, security, infrastructure, Site Reliability, Developer Workflow, Observability, Data Infrastructure, growth, strategy
  • Level: mid
  • Type: full_time

Farther

Farther is a rapidly growing RIA that combines expert advisors with cutting-edge technology - delivering a comprehensive, tailored wealth management experience…

Software Engineer, Test Infrastructure (SETI)

New York City

  • Skills: TypeScript, JavaScript, CI/CD, automation frameworks, test infrastructure, mocking, containerization, monitoring, observability, debugging
  • Level: mid
  • Type: full_time

Palantir

Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empo…

Forward Deployed Site Reliability Engineer

New York City

  • Skills: site reliability engineering, production infrastructure, automate processes, scalable systems, network configuration, hardware setup, production issues, systems design, US Government, data-driven decisions
  • Level: mid
  • Type: full_time

Reddit

Reddit is a community of communities. It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the intern…

Senior Site Reliability Engineer

New York City

  • Skills: Site Reliability Engineering, Distributed systems, Kubernetes, Cloud systems, Prometheus, Thanos, Grafana, DevOps, Automation, High-traffic backend systems
  • Level: mid
  • Type: full_time

Spotify

Spotify is a leading global platform for music streaming, offering millions of songs and podcasts to users worldwide.

Site Reliability Engineer

New York City

  • Skills: infrastructure, cloud, observability, security, developer experience, production environment, developer tooling, open-source, scalability, team collaboration
  • Level: mid
  • Type: full_time