TixelJobs
S
Santexvia Greenhouse

Senior AI Engineer - Tech Lead

Argentina / PerúPosted 2d ago
ML EngineerLeadFull-time

Not sure if you're a good fit?

Upload your resume and TixelJobs AI will compare it against Senior AI Engineer - Tech Lead at Santex. Get a match score, missing keywords, and improvement tips before you apply.

Free preview · Your resume stays private

About the Role

 

Santex is a global Certified B technology company as well as one of North America's fastest-growing companies, according to the Inc. 5000. It’s present in 18 countries and over 100 cities, and headquartered in Argentina with offices in the United States, Mexico, and Peru.

Its two service lines — AI Tech Consulting and AI-Optimized Engineering — help organizations comprehensively adopt technology, expand their capabilities, and improve business outcomes at scale.

The same approach defines its culture: Santex has 80.4% employee engagement (global average: 21%), 8% annual turnover (tech sector: 20–25%), 9,369 training hours invested in 2025 — more than double the global average —, and a 26.5% year-over-year reduction in carbon footprint, achieving carbon neutrality through offsetting.

Santex. A company with purpose and judgment. Yes, it's possible.

 

We are an Equal Opportunity Employer and are committed to fostering an inclusive and diverse workplace. We do not discriminate on the basis of race, color, religion, sex, gender identity or expression, sexual orientation, national origin, age, disability, veteran status, genetic information, or any other characteristic protected by applicable law. All qualified applicants will receive consideration for employment without regard to any of these factors. We strongly encourage candidates from all backgrounds to apply.

 

Job Description: We are looking for a Senior AI Developer to join our team. The ideal candidate will have extensive experience in developing and deploying AI and machine learning models.

About the role
Over the next 14 months we're replacing Solr with an AI-native retrieval and generation platform on AWS Bedrock + OpenSearch, and rolling out 9 AI use cases on top of it — AI Augmented Search, Chair Assistant, Ask member-facing, Meeting Prep, Topic Tagging, Content Customization, Digest Emails, People-Like-Me, and a Strategic Planning Wizard.
You're the technical lead for everything that touches retrieval and generation. You set the patterns in Foundation, then carry them through the eight feature phases. The choices you make in the first 16 weeks — chunking strategy, hybrid retrieval design, evaluation harness, prompt registry — propagate to every downstream use case. We need someone who has been here before.
You'll work alongside an external architecture partner (Santex Lab) during Phase 1 for knowledge transfer. By Week 17 you own the platform.

What you'll own
Retrieval architecture
Hybrid retrieval design on OpenSearch (BM25 + k-NN) with RBAC pre-filters across 28 content types.
Per-content-type chunking strategies (paragraph, section, thread-aware, comments-as-array). Tier-A content types include articles, best practices, PeepSo discussions, calendar events, and LMS lessons.
Embedding pipeline on Bedrock (Titan v2 default). Batch + incremental flows. Multilingual fallback path.
Index design — content / profile / metadata indices, mapping per index, k-NN tuning.

Generation layer
Prompt registry — versioned per use case, rollback-ready, with per-field prompts for the SP Wizard (~60 fields).
LLM routing through Bedrock — model selection per intent tier (name lookup, navigation, factual, RAG, advisory).
Bedrock Agents for Chair Assistant + member-facing knowledge bases. Multi-turn orchestration, moderator handoff.
Bedrock Guardrails configuration — PII filters, denied topics, sensitivity classifications.

Evaluation and observability
Search-quality validation dataset (200–500 query/answer pairs seed → ongoing growth). Regression harness run on every prompt / model / index change.
RAG eval metrics — retrieval@k, answer faithfulness, citation accuracy, hallucination rate, suppression precision.
A/B testing framework — cohort assignment, metric collection, toggle infrastructure for safe rollout.
Prompt/response capture for Product team visibility (a non-negotiable Phase 1 deliverable).

Cross-cutting
Cost guardrails — monthly ceilings, model downgrade triggers, throttle rules. The stated cost ceiling is ~$10–18k/yr at 10x current Solr volume; the design must hold.
Knowledge transfer — co-author ADRs, document patterns across five dimensions (what / why / operate / extend / debug), pair-program with mid engineers so the team owns the reasoning, not just the result.
 
Must-have qualifications
5+ years backend / ML engineering experience. 2+ years of which is shipping production RAG systems serving real users — not POCs, not demos.
Hands-on experience with at least one production vector store (OpenSearch k-NN, Pinecone, Weaviate, pgvector, Vespa, or comparable). OpenSearch experience strongly preferred.
Production embedding pipelines — chunking strategy design, batch + incremental ingest, dealing with the realities of mixed c

Share