TixelJobs
C
Cerebras Systemsvia Greenhouse

Staff DevRel Engineer - AI Inference

Sunnyvale CA or Toronto CanadaPosted 2mo ago
OtherStaff+Full-time#ai-lab

Not sure if you're a good fit?

Upload your resume and TixelJobs AI will compare it against Staff DevRel Engineer - AI Inference at Cerebras Systems. Get a match score, missing keywords, and improvement tips before you apply.

Free preview · Your resume stays private

About the Role

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.  

Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras, to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. 

Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation.

About the team 

The Inference Ecosystem Engineering team’s mission is to show—not tell—the power of the Cerebras Inference API. We build open-source integrations, reference architectures, and polished demo apps that developers can clone, run, and extend in minutes. From LangChain agents to partner plug-ins and end-to-end “weekend projects,” our code is often the first (and most lasting) impression customers have of Cerebras. 

Responsibilities: 

  • Design, develop, and maintain open-source libraries, SDKs, and sample repos that make Cerebras the easiest-to-adopt inference platform. 
  • Create production-quality demo applications that highlight low latency, high gen speed, and cost advantages. 
  • Build and own CI/CD pipelines, tests, and release automation for all public repos. 
  • Collaborate with partner engineering teams to embed Cerebras inference into their products and publish joint reference architectures. 
  • Collect developer feedback, identify usability gaps, and influence the Cerebras API roadmap. 
  • Contribute to engineering blogs, tutorials, and conference talks to grow community awareness and adoption. 

Skills & Qualifications: 

  • Bachelor’s or Master's degree in computer science or related field, or equivalent practical experience. 
  • 4+ years professional software engineering experience (or equivalent open-source track record). 
Share