AgriGatorvia Indeed

Senior Engineer: AI/ML Infrastructure & Generative Systems

Cambridge, MA, US$110K - $120K/yrPosted 3mo ago

MLOpsSeniorFull-time#python#pytorch#tensorflow#huggingface#langchain#llm#azure

Not sure if you're a good fit?

Upload your resume and TixelJobs AI will compare it against Senior Engineer: AI/ML Infrastructure & Generative Systems at AgriGator. Get a match score, missing keywords, and improvement tips before you apply.

Free preview · Your resume stays private

About the Role

Overview
We are a team out of MIT incubated by UM6P Foundry reinventing how organizations capture and leverage their institutional knowledge.Our platform transforms fragmented information into a trusted resource that powers faster decisions and long-term innovation. We are now hiring an experienced engineer to lead the build-out of our AI/ML infrastructure and generative systems (with possible path to CTO). This is a hands-on role at the cutting edge of LLM deployment, GPU optimization, and retrieval-augmented generation (RAG). You’ll own core components of the platform and collaborate directly with the founding team to shape the technical roadmap.

In this role you will

Design, build, and deploy retrieval-augmented generation (RAG) pipelines using LLMs and vector databases.
Develop secure backend APIs for data ingestion, indexing, and semantic search across enterprise systems (e.g., SharePoint, Teams, SQL).
Manage GPU-based inference environments optimized for scalability, latency, and cost.
Implement MLOps best practices for training, fine-tuning, evaluation, and deployment of generative AI models.
Collaborate with founders on architecture and build-vs-buy decisions to accelerate roadmap.
Own the full lifecycle from prototype → MVP → production, ensuring security, compliance, and enterprise readiness.
Support prototyping of lightweight front-end interfaces to showcase platform capabilities.

**This role based in Cambridge, MA**

Required Qualifications

3+ years of experience in ML infrastructure, backend engineering, or AI platform development.
Experience deploying LLMs and generative AI models in production, with fluency across multiple frameworks such as PyTorch, TensorFlow, Hugging Face, and Azure OpenAI.
Hands-on expertise in LLM post-training, alignment, fine-tuning, and deployment.
Strong backend development skills in Python (FastAPI, Flask, or Django) and REST/GraphQL APIs.
Hands-on experience with GPU inference and performance tuning.
Familiarity with vector databases (Pinecone, Weaviate, Milvus, or FAISS) and semantic search.
Comfort working in an early-stage startup environment and delivering under ambiguity.

Preferred Qualifications

Master’s or PhD in Computer Science, ML, or related field.
Experience fine-tuning and aligning LLMs (RLHF, LoRA, adapters, prompt tuning).
Experience with knowledge graphs, enterprise knowledge management, or large-scale search systems.
Familiarity with LLM orchestration frameworks (LangChain, LlamaIndex) or MCP protocol.
Prior experience as a founding/early engineer at a startup.

Job Type: Full-time

Pay: $110,000.00 - $120,000.00 per year

Benefits:

Flexible schedule
Health insurance
Paid time off

Education:

Master's (Required)

Experience:

AI: 5 years (Required)
ML: 5 years (Required)

Ability to Relocate:

Cambridge, MA 02142: Relocate before starting work (Required)

Work Location: In person

Ready to apply?

This job is active. Apply now to get in early.

Similar Jobs

Senior Automation & AI Platform Engineer

Gbsgroup

Data Science & MLOps Specialist (m/f/d)

DEKRA España

AI Infrastructure Engineer

Scout Motors Inc.

Junior MLOps Engineer

Zzazz

View all jobs