Last updated: 2025-06-06

377 Ai Performance Engineering jobs in San Francisco.

Hiring now: Tech Lead Manager @ Baseten, Research Engineer @ Charactera, Sr Ai Performance Engineer @ Crusoe, Software Eng Manager @ Applied In, Deep Learning Performance Engineer @ Genmo, Founding Engineer @ Outspeed, Sr Gpu Optimization Engineer @ Succinct, Sr Firmware Architect @ Celestial , Software Engineer @ Descript, Cryptography Hardware Engineer @ Fabric. Explore more at at jobswithgpt.com

🔥 Skills

CUDA (10) Python (10) performance optimization (9) Machine Learning (9) PyTorch (8) C++ (8) performance (8) cryptography (7) machine learning (6) Deep Learning (6)

📍 Locations

Santa Clara (27) San Francisco (17) Palo Alto (5) Menlo Park (4) Mountain View (4) Burlingame (4) Foster City (3) Sunnyvale (3) Oakland (2) San Leandro (1)

Baseten

Join our dynamic team at Baseten, where we’re revolutionizing AI deployment with cutting-edge inference infrastructure. Backed by premier investors, we’re trus…

Tech Lead Manager

Oakland

  • Skills: ML model inference, performance optimization, technical leadership, cross-functional collaboration, TensorRT, PyTorch, CUDA, Docker, Kubernetes, large language models (LLMs)
  • Experience: 5+ years of professional experience in software engineering, with at least 2 years in a technical leadership role.

Character.AI

Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20 million people visit Character.AI every month, using…

Research Engineer

Menlo Park

  • Skills: Machine Learning, GPU Clusters, Model Development, Triton Kernels, Distributed Systems, Reinforcement Learning, PyTorch, LLM Inference, CUDA, Kubernetes
  • Experience: PhD (or equivalent) research experience

Crusoe

Crusoe is building the World’s Favorite AI-first Cloud infrastructure company, pioneering vertically integrated AI infrastructure solutions trusted by Fortune …

Senior AI Performance Engineer

San Francisco

  • Skills: AI infrastructure, CUDA, deep learning, performance optimization, inference engines, Python, GPU architecture, scalable AI infrastructure, performance analysis, open-source contributions
  • Experience: Experience with deep learning frameworks, proficiency in Python, and expertise in CUDA or OpenCL.
  • Type: Full-time
  • Salary: $183,000 - $210,000 + Bonus

Applied Intuition

Applied Intuition is a vehicle software supplier that accelerates the adoption of safe and intelligent machines worldwide. Founded in 2017, Applied Intuition d…

Software Engineering Manager

Mountain View

  • Skills: software engineering, machine learning, neural simulation, computer vision, 4D reconstruction, Python, C++, Gaussian splatting, NeRFs, Diffusion
  • Experience: 4+ years in the machine learning field, 2+ years of people management experience
  • Type: Full-time
  • Salary: $204,000 - $343,000 USD annually

Genmo

A research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI.

Deep Learning Performance Engineer

San Francisco

  • Skills: Deep Learning, Performance Optimization, CUDA, Multi-GPU, TensorFlow, PyTorch, Benchmarking, High-Performance Computing, Distributed Training, Generative AI
  • Experience: 5+ years in optimizing deep learning models, preferably in a production environment.

Outspeed

Outspeed is solving one of the biggest challenges with current AI systems - latency. We are building the infra for real-time AI for applications in gaming, AR/…

Founding Engineer

Oakland

  • Skills: real-time AI, GPU infrastructure, inference engines, distributed inference, voice models, video models, ML application development, Pytorch, Transformers, cloud features
  • Experience: Experience in end-to-end ML application development, contributing to the architecture of distributed ML systems, and strong understanding of ML development and inference technologies.
  • Type: Full-time
  • Salary: Competitive salary + Equity

Succinct

Succinct is focused on making zero knowledge proofs accessible to any developer, with infrastructure solutions that bridge the gap in blockchain scaling, inter…

Senior GPU Optimization Engineer

San Francisco

  • Skills: GPU performance optimization, CUDA, performance profiling, low-level GPU kernels, NVIDIA Nsight Systems, latency bottlenecks, GPU software stacks, compiler optimizations, Rust, C++
  • Experience: Expert understanding of GPU and CPU architecture; strong proficiency in GPU software stacks and performance optimization.
  • Type: Full-time
  • Salary: Above-market salary and generous equity compensation

Celestial AI

Celestial AI is focused on developing advanced interconnect technology for data centers, specifically through its innovation called Photonic Fabric™, which enh…

Senior Firmware Architect

Santa Clara

  • Skills: firmware architecture, AI platform development, C/C++ programming, Python, Rust, high-speed communication, memory controllers, debugging, agile team environment, performance characterization
  • Experience: 10+ years with Bachelor’s degree or 8+ years with Master’s degree in Computer Science, Electrical Engineering, Information Technology, or a related technical field
  • Type: Full-time
  • Salary: $210,000.00 - $240,000.00

Descript

Descript is building a simple, intuitive, fully-powered editing tool for video and audio — an editing tool built for the age of AI. We are a team of 150 — with…

Software Engineer

San Francisco

  • Skills: AI, video editing, streaming platform, high performance systems, media platform, digital video technologies, codecs, open-source, debug video systems, mentor engineering team
  • Experience: 5+ years of professional software development experience
  • Type: Full-time
  • Salary: $160,000- $230,000/year

Fabric

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Cryptography Hardware Engineer

Santa Clara

  • Skills: Cryptography, Hardware, Zero Knowledge Proofs, ZKP protocols, Framework Improvement, Customer Advisory, Programming Skills, Cryptocurrency, Cryptographic Products, LLVM
  • Experience: Deep knowledge of cryptography, programming skills, and experience working with cryptocurrency or cryptographic products and LLVM.

FABRIC CRYPTOGRAPHY

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Compiler Engineer

Santa Clara

  • Skills: Compiler Engineer, code generation, massively-parallel computing, compiler technologies, hardware-aware optimizations, performance, custom hardware, cryptography, secure computation, zero knowledge proofs

Fabric Cryptography

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Compiler Engineer

Santa Clara

  • Skills: compiler, code generation, massively-parallel computing, hardware-aware optimizations, custom hardware, advanced compiler technologies, performance, trust, privacy, identity

FABRIC CRYPTOGRAPHY

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Compiler Engineer

Santa Clara

  • Skills: Compiler Engineer, code generation, massively-parallel computing, hardware-aware optimizations, advanced compiler technologies, custom hardware, performance, cryptography, secure computation, hardware design

Fabric

Building towards a future beyond trust with custom chips for advanced cryptography. Fabric is a fast-growing Series A deep tech company (full-stack across sili…

Design Verification Engineer

Santa Clara

  • Skills: Design Verification, Cryptography, Processors, Hardware Innovation, Functionality, Performance, Reliability, Computer Chip, Compiler Stack, Acceleration Libraries

Fabric Cryptography

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Systems Performance Engineer

Santa Clara

  • Skills: Systems Performance, low-level programming, assembly language, performance optimization, performance-critical systems, custom hardware platforms, cryptography, encryption, trust, privacy

Fabric

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Design Verification Engineer

Santa Clara

  • Skills: Design Verification, GDDR, Graphics Double Data Rate, cryptographic processors, functionality, performance, reliability, hardware innovation, cryptography, memory

Fuse

An early-stage fusion company focused on accelerating the transition to fusion energy.

Computational Physicist

San Leandro

  • Skills: Computational Physicist, models, simulations, physics, mathematics, computational methods, complex physical systems, materials science, energy, quantum computing

Hive

Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and mos…

Data Center Technician

San Francisco

  • Skills: data center, hardware maintenance, server software, installation, troubleshooting, repair, infrastructure management, AI solutions, machine learning systems, high-performance GPU
  • Experience: hardware and facilities operations experience
  • Type: Full-time

Luma

Building some of the biggest & fastest AI supercomputing clusters in the world.

High-Performance Computing Engineer

Palo Alto

  • Skills: High-Performance Computing, AI supercomputing, CPU, GPU, network devices, Linux kernel optimization, user-space code, system automation, monitoring systems, large-scale deployment

Luma AI

Site Reliability Engineer (SRE)

Palo Alto

  • Skills: Site Reliability Engineer, SRE, Infrastructure, GPU clusters, H100 GPUs, Monitoring tools, Management tools, Performance problems, Maintenance problems, Data Processing

Quadric

Quadric is building the world’s first supercomputer designed for the real-time needs of edge devices. The company was co-founded by technologists from MIT and …

Senior Software Engineer

Burlingame

  • Skills: neural networks, algorithmic optimization, deep learning, graph-based execution, C++, application-appropriate heuristics, NP-hard problems, classical algorithms, machine learning, DSP
  • Experience: Minimum of eight years in the industry, MS or Ph.D. in Computer Science or related field
  • Type: Full-time

Confidential

We are a team of mission-driven engineers with experience across aerospace, robotics, and self-driving cars working to build safety-enhancing technology for av…

Flight Software Infrastructure Engineer

Mountain View

  • Skills: flight software, infrastructure selection, cloud computing, continuous integration, hardware-in-the-loop, simulation infrastructure, collaborative environment, optimization, performance expectations, stakeholder coordination
  • Experience: 5+ years in software engineering with a focus on systems infrastructure
  • Type: Full-time

Renegade

Renegade is building an unstoppable network for the anonymous exchange of value. Our core permissionless protocol, the Renegade dark pool, solves many problems…

Systems Engineer

San Francisco

  • Skills: dark pool, decentralized exchange, trade privacy, network performance, throughput, off-chain networking, Indexer, caching layer, pubsub system, on-chain verification
  • Type: Full-time

Rigetti Computing

Rigetti is developing superconducting quantum computers with a focus on understanding and improving QPU performance and scale through advanced theoretical and …

Principal Device Theorist

Berkeley

  • Skills: superconducting quantum computers, theoretical tools, circuit QED modeling, qubit operations, gate operation fidelity, readout, coherence, noise reduction, quantum advantage, error correction
  • Experience: Experienced
  • Type: Full-time

Cutting-edge Hardware Startup

A startup focused on reimagining silicon and creating Risc-V based computing platforms.

Silicon Verification Engineer

Santa Clara

  • Skills: Silicon, Verification, Risc-V, Computing platforms, Performance, Energy efficiency, Scalability, Designs, Engineers, Startup
  • Type: Full-time

Unnamed Hardware Startup

A cutting-edge and well-funded hardware startup focused on reimagining silicon and creating transformative computing platforms.

Test Engineer

Santa Clara

  • Skills: hardware startup, silicon, computing platforms, test generation, performance validation, functionality validation, accelerator design, hardware teams, software teams, energy efficiency
  • Experience: 3-5 years in hardware testing or related fields
  • Type: Full-time

Rivos

Rivos is focused on developing cutting-edge accelerator designs for high-performance computing environments.

Post-Silicon Performance Engineer

Santa Clara

  • Skills: Post-Silicon Performance, Performance Validation, Server Workloads, Data Analytics, AI/ML, Silicon Performance Measurements, Power/Performance Ratios, Performance Benchmarks, Collaboration, Debugging
  • Experience: 5+ years in SoC bring-up and validation
  • Type: Full-time

Cutting-edge Hardware Startup

A well-funded hardware startup in Silicon Valley focused on RISC-V based accelerated computing platforms.

SoC Performance Architect

Santa Clara

  • Skills: SoC, Performance Architect, RISC-V, accelerated computing, energy efficiency, scalability, hardware design, silicon, engineering, performance
  • Experience: Experienced candidates
  • Type: Full-time

Cutting-edge hardware startup

Focused on reimagining silicon and creating Risc-V based Accelerated computing platforms.

Junior Power-Management Architect

Santa Clara

  • Skills: Power/Performance modeling, Risc-V, Accelerated computing, Energy efficiency, Scalability, Silicon redesign, Design optimization, Hardware engineering, Cutting-edge technology, Performance enhancement
  • Type: Full-time

Cutting-Edge and Well-Funded Hardware Startup

A hardware startup in Silicon Valley focused on modeling and optimizing Power/Performance features on Risc-V based Accelerated computing platforms.

Junior Power-Management Architect

Santa Clara

  • Skills: Power Management, Performance Optimization, Risc-V, Accelerated Computing, Hardware Design, Energy Efficiency, Scalability, Silicon Technology, Engineering, Startup Environment
  • Type: Full-time

Cutting-edge Hardware Startup

A well-funded hardware startup focused on reimagining silicon and creating Risc-V based Accelerated computing platforms.

Deep Learning and Large Language Model Performance Architect

Santa Clara

  • Skills: Deep Learning, Large Language Models, Performance Analysis, Computer Architecture, AI Software Stack, C/C++ Programming, Hardware Modeling, Performance Validation, GPU Programming (CUDA), LLVM/MLIR Development
  • Experience: 5+ years working experience
  • Type: Full-time

xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motiv…

CUDA Kernel Engineer

San Francisco

  • Skills: CUDA, GPU, kernel optimizations, deep learning, Tensor cores, Nsight, memory-bound, compute-bound, pybind, high-performance
  • Experience: Experience with CUDA, high-performance computing, and low-level kernel optimizations.
  • Type: Full-time
  • Salary: $180,000 - $440,000 USD

XPENG Motors

XPENG Motors is one of China’s leading smart electric vehicle (“EV”) companies. We design, develop, manufacture and market smart EVs that are seamlessly integr…

Machine Learning Engineer - AI Foundation

Santa Clara

  • Skills: Machine Learning, deep learning models, PyTorch, transformer architecture, model training framework, distributed training, autonomous driving, Torchscript, Nvidia TensorRT, training and inference
  • Experience: Master in CS/CE/EE, or equivalent, in industry experience
  • Type: Full-time
  • Salary: $148,909 - $252,000

XPeng Motors

XPeng Motors is one of China’s leading smart electric vehicle (“EV”) companies. We design, develop, manufacture and market smart EVs that are seamlessly integr…

AI Performance Engineer

Santa Clara

  • Skills: AI Performance Engineer, Deep Learning, NVIDIA GPU systems, autonomous driving, C++, Python, Machine Learning, performance optimization, computer architecture, parallel programming
  • Experience: 4+ years of experience with C++ and Python programming; PhD in CS/CE/EE, or equivalent in industry experience.
  • Type: Full-time
  • Salary: $244,140-$413,160

Waabi

Waabi, founded by AI pioneer and visionary Raquel Urtasun, is an AI company building the next generation of self-driving technology. With a world class team an…

Software Engineer - Autonomous Driving

San Francisco

  • Skills: autonomous driving, runtime performance, optimization, C++, Python, AI-first approach, GPU, simulation engine, multi-threaded programming, rendering optimization
  • Experience: Solid programming background in Python and C++. Extensive experience with runtime performance improvement projects. Educational background: BS or higher degree in EE/CS or equivalent industry experience.
  • Type: Full-time

Zoox

Zoox is a robotics company focused on developing autonomous vehicles, leveraging high-performance computing services to enhance the development of AI models an…

Software Engineer

Foster City

  • Skills: Software Engineer, High-Performance Computing, HPC services, Distributed system design, Algorithmic job scheduling, Adaptive cloud scaling, Autonomous Vehicle, Data engineering, AI models, Developer experiences
  • Experience: Experienced
  • Type: Full-time

NVIDIA

NVIDIA is widely considered to be one of the technology world's most desirable employers, featuring creative and autonomous personnel.

Senior System Software Engineer - Scientific Computing PaaS

Santa Clara

  • Skills: Scientific computing, Cloud-native, Microservices, APIs, Distributed systems, Parallel processing, AI platforms, Performance optimization, HPC clusters, Algorithmic thinking
  • Experience: 10+ years experience working on building and operating distributed compute and data-intensive platform as a service on cloud
  • Type: Full time
  • Salary: 180,000 USD - 339,250 USD

Meta

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Ap…

Software Engineer, Systems ML - SW/HW Co-design

Menlo Park

  • Skills: AI Infrastructure, Machine Learning, Deep Learning, Hardware Accelerators, GPU Architecture, AI Algorithms, C/C++ Programming, Python, Performance Optimization, SW/HW Co-design
  • Experience: 7+ years with Bachelor's, 4+ years with Master's, or 3+ years with PhD in relevant technical field with experience in AI framework development or hardware acceleration.
  • Type: Full Time
  • Salary: $70.67/hour to $208,000/year + bonus + equity + benefits

Fabric

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Cryptography Hardware Engineer

Santa Clara

  • Skills: cryptography, hardware, ZKP protocols, framework improvement, customer advisory, secure computation, programming skills, cryptocurrency, cryptographic products, LLVM
  • Experience: Deep knowledge of cryptography, programming skills, experience with cryptocurrency or cryptographic products, LLVM

Meta Platforms Inc.

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Ap…

ASIC Engineer, Machine Learning Architecture

Sunnyvale

  • Skills: ASIC, Machine Learning, SoCs, data center, architecture, performance models, C, C++, Python, object-oriented programming
  • Experience: 2+ years of experience with programming skills in C, C++, Python or other Object Oriented Programming Language.
  • Type: Full Time
  • Salary: $114,000/year to $166,000/year + bonus + equity + benefits