Z
Zooxvia Lever
Senior AI Inference Engineer - Model Optimization & Deployment
Foster City, CAPosted 1mo ago
OtherSeniorFull-time
Not sure if you're a good fit?
Upload your resume and TixelJobs AI will compare it against Senior AI Inference Engineer - Model Optimization & Deployment at Zoox. Get a match score, missing keywords, and improvement tips before you apply.
Free preview · Your resume stays private
About the Role
The Perception team is pioneering the development of a multi-modality foundation model to drive the next generation of autonomous system intelligence.
As a Model Optimization & Deployment Engineer, you will focus on bringing highly efficient, production-ready large-scale models to our on-vehicle stack. We are looking for experts with hands-on experience in compressing, accelerating, and deploying complex models (LLMs, VLMs, or FMs) for power- and thermal-constrained vehicle SOCs. You will optimize the ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.
As a Model Optimization & Deployment Engineer, you will focus on bringing highly efficient, production-ready large-scale models to our on-vehicle stack. We are looking for experts with hands-on experience in compressing, accelerating, and deploying complex models (LLMs, VLMs, or FMs) for power- and thermal-constrained vehicle SOCs. You will optimize the ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.
Ready to apply?
This job is active. Apply now to get in early.