Last updated: 2025-06-06

303 Ai Performance Engineering jobs in San Jose.

Hiring now: Tech Lead Manager @ Baseten, Research Engineer @ Charactera, Software Eng Manager @ Applied In, Founding Engineer @ Outspeed, Sr Firmware Architect @ Celestial , Arm Soc Architect @ Espace, Cryptography Hardware Engineer @ Fabric, Compiler Engineer @ Fabric Cry, Compiler Engineer @ Fabric Cry, Computational Physicist @ Fuse. Explore more at at jobswithgpt.com

🔥 Skills

Machine Learning (9) Python (9) PyTorch (7) CUDA (7) C++ (7) performance (7) machine learning (6) performance optimization (5) Kubernetes (5) custom hardware (5)

📍 Locations

Santa Clara (28) Palo Alto (8) Menlo Park (4) Mountain View (4) Burlingame (4) Foster City (3) San Jose (2) Oakland (2) Sunnyvale (2) Saratoga (1)

Baseten

Join our dynamic team at Baseten, where we’re revolutionizing AI deployment with cutting-edge inference infrastructure. Backed by premier investors, we’re trus…

Tech Lead Manager

San Jose

  • Skills: ML model inference, performance optimization, technical leadership, cross-functional collaboration, TensorRT, PyTorch, CUDA, Docker, Kubernetes, large language models (LLMs)
  • Experience: 5+ years of professional experience in software engineering, with at least 2 years in a technical leadership role.

Character.AI

Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20 million people visit Character.AI every month, using…

Research Engineer

Menlo Park

  • Skills: Machine Learning, GPU Clusters, Model Development, Triton Kernels, Distributed Systems, Reinforcement Learning, PyTorch, LLM Inference, CUDA, Kubernetes
  • Experience: PhD (or equivalent) research experience

Applied Intuition

Applied Intuition is a vehicle software supplier that accelerates the adoption of safe and intelligent machines worldwide. Founded in 2017, Applied Intuition d…

Software Engineering Manager

Mountain View

  • Skills: software engineering, machine learning, neural simulation, computer vision, 4D reconstruction, Python, C++, Gaussian splatting, NeRFs, Diffusion
  • Experience: 4+ years in the machine learning field, 2+ years of people management experience
  • Type: Full-time
  • Salary: $204,000 - $343,000 USD annually

Outspeed

Outspeed is solving one of the biggest challenges with current AI systems - latency. We are building the infra for real-time AI for applications in gaming, AR/…

Founding Engineer

Oakland

  • Skills: real-time AI, GPU infrastructure, inference engines, distributed inference, voice models, video models, ML application development, Pytorch, Transformers, cloud features
  • Experience: Experience in end-to-end ML application development, contributing to the architecture of distributed ML systems, and strong understanding of ML development and inference technologies.
  • Type: Full-time
  • Salary: Competitive salary + Equity

Celestial AI

Celestial AI is focused on developing advanced interconnect technology for data centers, specifically through its innovation called Photonic Fabric™, which enh…

Senior Firmware Architect

Santa Clara

  • Skills: firmware architecture, AI platform development, C/C++ programming, Python, Rust, high-speed communication, memory controllers, debugging, agile team environment, performance characterization
  • Experience: 10+ years with Bachelor’s degree or 8+ years with Master’s degree in Computer Science, Electrical Engineering, Information Technology, or a related technical field
  • Type: Full-time
  • Salary: $210,000.00 - $240,000.00

E-Space

E-Space is bridging Earth and space to enable hyper-scaled deployments of Internet of Things (IoT) solutions and services. We are building a highly-advanced lo…

ARM SoC Architect

Saratoga

  • Skills: ARM architecture, SoC design, System on Chip, performance targets, power targets, area targets, cross-functional teams, SoC solutions, design principles, actionable intelligence

Fabric

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Cryptography Hardware Engineer

Santa Clara

  • Skills: Cryptography, Hardware, Zero Knowledge Proofs, ZKP protocols, Framework Improvement, Customer Advisory, Programming Skills, Cryptocurrency, Cryptographic Products, LLVM
  • Experience: Deep knowledge of cryptography, programming skills, and experience working with cryptocurrency or cryptographic products and LLVM.

FABRIC CRYPTOGRAPHY

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Compiler Engineer

Santa Clara

  • Skills: Compiler Engineer, code generation, massively-parallel computing, compiler technologies, hardware-aware optimizations, performance, custom hardware, cryptography, secure computation, zero knowledge proofs

Fabric Cryptography

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Compiler Engineer

Santa Clara

  • Skills: compiler, code generation, massively-parallel computing, hardware-aware optimizations, custom hardware, advanced compiler technologies, performance, trust, privacy, identity

FABRIC CRYPTOGRAPHY

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Compiler Engineer

Santa Clara

  • Skills: Compiler Engineer, code generation, massively-parallel computing, hardware-aware optimizations, advanced compiler technologies, custom hardware, performance, cryptography, secure computation, hardware design

Fabric

Building towards a future beyond trust with custom chips for advanced cryptography. Fabric is a fast-growing Series A deep tech company (full-stack across sili…

Design Verification Engineer

Santa Clara

  • Skills: Design Verification, Cryptography, Processors, Hardware Innovation, Functionality, Performance, Reliability, Computer Chip, Compiler Stack, Acceleration Libraries

Fabric Cryptography

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Systems Performance Engineer

Santa Clara

  • Skills: Systems Performance, low-level programming, assembly language, performance optimization, performance-critical systems, custom hardware platforms, cryptography, encryption, trust, privacy

FABRIC CRYPTOGRAPHY

Fabric is a fast-growing Series A deep tech company (full-stack across silicon, hardware and cloud) that was founded to build towards a future beyond trust wit…

Design Verification Engineer

Santa Clara

  • Skills: Design Verification, Cryptography, Processors, Hardware Innovation, Functionality, Performance, Reliability, Custom Chips, Cryptographic Processor, VPU

Fabric Cryptography

We are building hardware for the next generation of cryptography because we believe in creating a more trustworthy world with secure, private computation at it…

Compiler Engineer

Santa Clara

  • Skills: Compiler Engineer, code generation, massively-parallel computing, compiler technologies, hardware-aware optimizations, performance, cryptographic algorithms, zero knowledge proofs, secure computation, custom hardware
  • Experience: Exceptional expertise in code generation for massively-parallel computing architectures
  • Type: Full-time

Fabric

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Design Verification Engineer

Santa Clara

  • Skills: Design Verification, GDDR, Graphics Double Data Rate, cryptographic processors, functionality, performance, reliability, hardware innovation, cryptography, memory

Fuse

An early-stage fusion company focused on accelerating the transition to fusion energy.

Computational Physicist

San Leandro

  • Skills: Computational Physicist, models, simulations, physics, mathematics, computational methods, complex physical systems, materials science, energy, quantum computing

Luma

Building some of the biggest & fastest AI supercomputing clusters in the world.

High-Performance Computing Engineer

Palo Alto

  • Skills: High-Performance Computing, AI supercomputing, CPU, GPU, network devices, Linux kernel optimization, user-space code, system automation, monitoring systems, large-scale deployment

Luma AI

Site Reliability Engineer (SRE)

Palo Alto

  • Skills: Site Reliability Engineer, SRE, Infrastructure, GPU clusters, H100 GPUs, Monitoring tools, Management tools, Performance problems, Maintenance problems, Data Processing

Recogni

Recogni is a system solution company that specializes in the design of industry-leading high-performance, low-power AI inferencing. Their mission is to enable …

Hands-on Technology Leadership Role in Multimodal Generative AI Inference Produ…

San Jose

  • Skills: Multimodal Generative AI, systems engineering, machine learning, deep learning, Kubernetes, C++, Python, Linux OS, software development, distributed computing
  • Experience: 15+ years of hands-on systems engineering experience
  • Type: Full-time

Quadric

Quadric is building the world’s first supercomputer designed for the real-time needs of edge devices. The company was co-founded by technologists from MIT and …

Senior Software Engineer

Burlingame

  • Skills: neural networks, algorithmic optimization, deep learning, graph-based execution, C++, application-appropriate heuristics, NP-hard problems, classical algorithms, machine learning, DSP
  • Experience: Minimum of eight years in the industry, MS or Ph.D. in Computer Science or related field
  • Type: Full-time

Confidential

We are a team of mission-driven engineers with experience across aerospace, robotics, and self-driving cars working to build safety-enhancing technology for av…

Flight Software Infrastructure Engineer

Mountain View

  • Skills: flight software, infrastructure selection, cloud computing, continuous integration, hardware-in-the-loop, simulation infrastructure, collaborative environment, optimization, performance expectations, stakeholder coordination
  • Experience: 5+ years in software engineering with a focus on systems infrastructure
  • Type: Full-time

Cutting-edge Hardware Startup

A startup focused on reimagining silicon and creating Risc-V based computing platforms.

Silicon Verification Engineer

Santa Clara

  • Skills: Silicon, Verification, Risc-V, Computing platforms, Performance, Energy efficiency, Scalability, Designs, Engineers, Startup
  • Type: Full-time

Unnamed Hardware Startup

A cutting-edge and well-funded hardware startup focused on reimagining silicon and creating transformative computing platforms.

Test Engineer

Santa Clara

  • Skills: hardware startup, silicon, computing platforms, test generation, performance validation, functionality validation, accelerator design, hardware teams, software teams, energy efficiency
  • Experience: 3-5 years in hardware testing or related fields
  • Type: Full-time

Rivos

Rivos is focused on developing cutting-edge accelerator designs for high-performance computing environments.

Post-Silicon Performance Engineer

Santa Clara

  • Skills: Post-Silicon Performance, Performance Validation, Server Workloads, Data Analytics, AI/ML, Silicon Performance Measurements, Power/Performance Ratios, Performance Benchmarks, Collaboration, Debugging
  • Experience: 5+ years in SoC bring-up and validation
  • Type: Full-time

Cutting-edge Hardware Startup

A well-funded hardware startup in Silicon Valley focused on RISC-V based accelerated computing platforms.

SoC Performance Architect

Santa Clara

  • Skills: SoC, Performance Architect, RISC-V, accelerated computing, energy efficiency, scalability, hardware design, silicon, engineering, performance
  • Experience: Experienced candidates
  • Type: Full-time

Cutting-edge hardware startup

Focused on reimagining silicon and creating Risc-V based Accelerated computing platforms.

Junior Power-Management Architect

Santa Clara

  • Skills: Power/Performance modeling, Risc-V, Accelerated computing, Energy efficiency, Scalability, Silicon redesign, Design optimization, Hardware engineering, Cutting-edge technology, Performance enhancement
  • Type: Full-time

Cutting-Edge and Well-Funded Hardware Startup

A hardware startup in Silicon Valley focused on modeling and optimizing Power/Performance features on Risc-V based Accelerated computing platforms.

Junior Power-Management Architect

Santa Clara

  • Skills: Power Management, Performance Optimization, Risc-V, Accelerated Computing, Hardware Design, Energy Efficiency, Scalability, Silicon Technology, Engineering, Startup Environment
  • Type: Full-time

Cutting-edge Hardware Startup

A well-funded hardware startup focused on reimagining silicon and creating Risc-V based Accelerated computing platforms.

Deep Learning and Large Language Model Performance Architect

Santa Clara

  • Skills: Deep Learning, Large Language Models, Performance Analysis, Computer Architecture, AI Software Stack, C/C++ Programming, Hardware Modeling, Performance Validation, GPU Programming (CUDA), LLVM/MLIR Development
  • Experience: 5+ years working experience
  • Type: Full-time

xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motiv…

CUDA Kernel Engineer

Palo Alto

  • Skills: CUDA, GPU, kernel optimizations, deep learning, Tensor cores, Nsight, memory-bound, compute-bound, pybind, high-performance
  • Experience: Experience with CUDA, high-performance computing, and low-level kernel optimizations.
  • Type: Full-time
  • Salary: $180,000 - $440,000 USD

XPENG Motors

XPENG Motors is one of China’s leading smart electric vehicle (“EV”) companies. We design, develop, manufacture and market smart EVs that are seamlessly integr…

Machine Learning Engineer - AI Foundation

Santa Clara

  • Skills: Machine Learning, deep learning models, PyTorch, transformer architecture, model training framework, distributed training, autonomous driving, Torchscript, Nvidia TensorRT, training and inference
  • Experience: Master in CS/CE/EE, or equivalent, in industry experience
  • Type: Full-time
  • Salary: $148,909 - $252,000

XPeng Motors

XPeng Motors is one of China’s leading smart electric vehicle (“EV”) companies. We design, develop, manufacture and market smart EVs that are seamlessly integr…

AI Performance Engineer

Santa Clara

  • Skills: AI Performance Engineer, Deep Learning, NVIDIA GPU systems, autonomous driving, C++, Python, Machine Learning, performance optimization, computer architecture, parallel programming
  • Experience: 4+ years of experience with C++ and Python programming; PhD in CS/CE/EE, or equivalent in industry experience.
  • Type: Full-time
  • Salary: $244,140-$413,160

Zoox

Zoox is a robotics company focused on developing autonomous vehicles, leveraging high-performance computing services to enhance the development of AI models an…

Software Engineer

Foster City

  • Skills: Software Engineer, High-Performance Computing, HPC services, Distributed system design, Algorithmic job scheduling, Adaptive cloud scaling, Autonomous Vehicle, Data engineering, AI models, Developer experiences
  • Experience: Experienced
  • Type: Full-time

NVIDIA

NVIDIA is widely considered to be one of the technology world's most desirable employers, featuring creative and autonomous personnel.

Senior System Software Engineer - Scientific Computing PaaS

Santa Clara

  • Skills: Scientific computing, Cloud-native, Microservices, APIs, Distributed systems, Parallel processing, AI platforms, Performance optimization, HPC clusters, Algorithmic thinking
  • Experience: 10+ years experience working on building and operating distributed compute and data-intensive platform as a service on cloud
  • Type: Full time
  • Salary: 180,000 USD - 339,250 USD

Meta

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Ap…

Software Engineer, Systems ML - SW/HW Co-design

Fremont

  • Skills: AI Infrastructure, Machine Learning, Deep Learning, Hardware Accelerators, GPU Architecture, AI Algorithms, C/C++ Programming, Python, Performance Optimization, SW/HW Co-design
  • Experience: 7+ years with Bachelor's, 4+ years with Master's, or 3+ years with PhD in relevant technical field with experience in AI framework development or hardware acceleration.
  • Type: Full Time
  • Salary: $70.67/hour to $208,000/year + bonus + equity + benefits

Fabric

Fabric believes hardware determines the boundaries of humanity's collective creativity and imagination. We are building hardware for the next generation of cry…

Cryptography Hardware Engineer

Santa Clara

  • Skills: cryptography, hardware, ZKP protocols, framework improvement, customer advisory, secure computation, programming skills, cryptocurrency, cryptographic products, LLVM
  • Experience: Deep knowledge of cryptography, programming skills, experience with cryptocurrency or cryptographic products, LLVM

Meta Platforms Inc.

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Ap…

ASIC Engineer, Machine Learning Architecture

Sunnyvale

  • Skills: ASIC, Machine Learning, SoCs, data center, architecture, performance models, C, C++, Python, object-oriented programming
  • Experience: 2+ years of experience with programming skills in C, C++, Python or other Object Oriented Programming Language.
  • Type: Full Time
  • Salary: $114,000/year to $166,000/year + bonus + equity + benefits