W
Whitecirclevia Ashby
MLE/MLOps
Paris$100K - $150K/yrPosted 4mo ago
MLOpsMid LevelFull-time
Not sure if you're a good fit?
Upload your resume and TixelJobs AI will compare it against MLE/MLOps at Whitecircle. Get a match score, missing keywords, and improvement tips before you apply.
Free preview · Your resume stays private
About the Role
TLDR: We're looking for an MLE / MLOps to own our inference stack – from optimizing serving engines to building vector search pipelines – bridging Research and Product to ship models that are fast, cheap, and production-ready.
ABOUT US
White Circle is an AI Safety company building the safety, reliability, and optimization layer for AI systems. At the core of our platform are policies – simple natural-language rules that define what an AI model should and shouldn't do. We automatically test, enforce, and continuously improve these policies at scale.
- We've raised $11M from top funds, founders, and senior leaders at OpenAI, Anthropic, HuggingFace, Mistral, DeepMind, Datadog, Sentry, and others
- We process over one hundred million API calls every month
- We fine-tune and train our own LLMs so they run faster and cheaper than any open or proprietary model
WHAT YOU'LL DO
- Own inference infrastructure end-to-end: optimize latency, throughput, and cost across our model fleet.
- Build and scale model serving with TensorZero, vLLM/SGlang/TRT, and Kubernetes.
- Design and maintain vector search pipelines with Vector storages.
- Familiarity with support metrics (SLAs, FCR, deflection) and ability to define service health KPIs.
- Turn research into product: grab experimental models from the research team, figure out what's production-ready, and ship it - formatting, sampling parameters, deployment, the whole thing
WHO YOU ARE
- 3+ years shipping high performance ML systems in production, not just training notebooks
- Deep hands-on experience with inference optimization - you've debugged latency spikes and know the difference between theoretical and real-world throughput
- Comfortable across the stack: from CUDA kernels to Kubernetes manifests to Grafana dashboards
A big plus: experience with Rust, custom Triton kernels, benchmarks
WHY WHITE CIRCLE
- Salary of $100,000 to $150,000 + equity
- Paid time off in line with your local regulations, no matter where you work from
- Work from Paris (hybrid) + relocation package
- Best medical insurance in France
- All the hardware, tools, and services you need
- Covered subscriptions for AI agents and IDEs
- Team off-sites twice a year: we've recently been to the Alps and to Saint-Tropez
HOW WE HIRE
1. Intro call with one of our colleagues
2. Complete the take-home exercise
3. Show your best during the technical interview
4. Final call with our CEO and CTO
ABOUT US
White Circle is an AI Safety company building the safety, reliability, and optimization layer for AI systems. At the core of our platform are policies – simple natural-language rules that define what an AI model should and shouldn't do. We automatically test, enforce, and continuously improve these policies at scale.
- We've raised $11M from top funds, founders, and senior leaders at OpenAI, Anthropic, HuggingFace, Mistral, DeepMind, Datadog, Sentry, and others
- We process over one hundred million API calls every month
- We fine-tune and train our own LLMs so they run faster and cheaper than any open or proprietary model
WHAT YOU'LL DO
- Own inference infrastructure end-to-end: optimize latency, throughput, and cost across our model fleet.
- Build and scale model serving with TensorZero, vLLM/SGlang/TRT, and Kubernetes.
- Design and maintain vector search pipelines with Vector storages.
- Familiarity with support metrics (SLAs, FCR, deflection) and ability to define service health KPIs.
- Turn research into product: grab experimental models from the research team, figure out what's production-ready, and ship it - formatting, sampling parameters, deployment, the whole thing
WHO YOU ARE
- 3+ years shipping high performance ML systems in production, not just training notebooks
- Deep hands-on experience with inference optimization - you've debugged latency spikes and know the difference between theoretical and real-world throughput
- Comfortable across the stack: from CUDA kernels to Kubernetes manifests to Grafana dashboards
A big plus: experience with Rust, custom Triton kernels, benchmarks
WHY WHITE CIRCLE
- Salary of $100,000 to $150,000 + equity
- Paid time off in line with your local regulations, no matter where you work from
- Work from Paris (hybrid) + relocation package
- Best medical insurance in France
- All the hardware, tools, and services you need
- Covered subscriptions for AI agents and IDEs
- Team off-sites twice a year: we've recently been to the Alps and to Saint-Tropez
HOW WE HIRE
1. Intro call with one of our colleagues
2. Complete the take-home exercise
3. Show your best during the technical interview
4. Final call with our CEO and CTO
Ready to apply?
This job is active. Apply now to get in early.