Nicevia Greenhouse

AI Quality Architect

Israel - RaananaPosted 3w ago

OtherLeadFull-time

Not sure if you're a good fit?

Upload your resume and TixelJobs AI will compare it against AI Quality Architect at Nice. Get a match score, missing keywords, and improvement tips before you apply.

Free preview · Your resume stays private

About the Role

At NiCE, we don’t limit our challenges. We challenge our limits. Always. We’re ambitious. We’re game changers. And we play to win. We set the highest standards and execute beyond them. And if you’re like us, we can offer you the ultimate career opportunity that will light a fire within you.

So, what’s the role all about?

As AI Quality Architect, you are the person who makes quality native to the agentic SDLC. You design the systems, standards, and intelligence layers that ensure every stage of an AI-accelerated pipeline — from requirement ingestion to autonomous deployment — is observable, trustworthy, and continuously improving. You don't retrofit testing onto AI workflows; you architect quality into them from the ground up.

How will you make an impact?

Agentic Quality Architecture

Design the end-to-end quality architecture for agentic SDLC pipelines — spanning requirement analysis, code generation, test creation, execution, triage, and release gates
Define how quality agents are orchestrated: which decisions they own autonomously, which require human-in-the-loop checkpoints, and how confidence thresholds govern both
Architect multi-agent quality workflows: requirement validation agents, test generation agents, failure triage agents, and regression analysis agents working in coordinated pipelines
Establish trust and verification models for agent-produced artifacts — test code, assertions, coverage reports, and defect analyses must all be auditable and traceable
Own the architectural patterns for quality feedback loops between agents: how a deployment agent learns from a triage agent's findings, and how that signal improves future generation

AI-Native Test Engineering Platform

Design and own the LLM-powered test generation platform — from natural language requirement ingestion to executable, maintainable test output
Architect the evaluation harness that continuously measures test generation quality: coverage delta, false-positive rates, assertion accuracy, and maintenance burden over time
Build the self-healing test infrastructure layer — agents that detect broken selectors, drifted APIs, or changed behaviors and propose or apply fixes autonomously
Define the prompt engineering standards, context injection patterns, and RAG architectures that ground test generation agents in real codebase context
Architect test artifact governance: versioning, ownership attribution (human vs. agent), rollback capability, and confidence scoring for every generated artifact

Quality Gates in Autonomous Pipelines

Design intelligent, adaptive quality gates that operate at the speed of agentic CI/CD — gates that reason about risk, not just pass/fail thresholds
Build risk-scoring models that dynamically adjust gate strictness based on change scope, code origin (human vs. AI-generated), historical failure patterns, and deployment context
Architect the observability layer for agentic pipelines: what signals indicate a pipeline agent is making poor quality decisions, and how are those signals surfaced in real time
Define the integration patterns between quality gates and orchestration platforms (LangChain, LlamaIndex, custom agent frameworks) used across the engineering org
Establish rollback and circuit-breaker patterns for autonomous deployments triggered by quality signal degradation

AI Model & Agent Validation

Build behavioral testing frameworks for validating AI agents and LLM-powered features in production — testing non-deterministic outputs with statistical rigor
Design evaluation benchmarks for internal AI tooling: measuring agent task completion accuracy, hallucination rates, context retention, and decision quality over time
Architect drift detection systems that identify when agent behavior changes between model versions, prompt updates, or context window shifts
Define adversarial and edge-case testing methodologies for AI features: prompt injection resistance, boundary condition handling, and graceful degradation under distribution shift
Partner with ML platform and data science teams to establish quality acceptance criteria for every model and agent promoted to production

Have you got what it takes?

12+ years in software engineering with strong depth across both development and quality engineering
4+ years as a hands-on principal architect or distinguished engineer with cross-org technical scope
Demonstrated experience designing quality infrastructure used at scale — 50+ engineers, high-velocity pipelines, enterprise SLAs
Direct production experience building or operating systems that incorporate LLMs or AI agents — not evaluations, but shipped systems
Background in large-scale CI/CD architecture and the performance engineering domain
Enterprise SaaS or platform engineering background; familiarity with regulated, high-uptime environments strongly preferred

Agentic AI & LLM Proficiency

Deep, hands-on understanding of agentic AI patterns: tool use, multi-agent orchestration, planning loops, memory architectures, and human-in-the-loop design
Expert-level prompt engineering including chain-of-thought, few-shot, RAG, and structured output techniques applied to code and test generation
Experience designing evaluation harnesses for non-deterministic AI systems — statistical confidence, behavioral consistency, and regression detection
Familiarity with agent frameworks (LangChain, LlamaIndex, AutoGen, CrewAI, or equivalents) and their tradeoffs in production quality pipelines

Technical Depth

Expert in Python and TypeScript; proficient in at least one compiled language (Java, Go, C#)

Deep knowledge of modern test frameworks across UI, API, and contract layers: Playwright, Stryker, pytest, REST Assured, Pact

Strong CI/CD architecture skills: Jenkins, GitHub Actions, pipeline-as-code, artifact management

Proficient with cloud-native infrastructure: Docker, Kubernetes, AWS, serverless execution environments

Leadership & Influence

Proven ability to drive technical consensus and architectural adoption across multiple teams without direct authority
Exceptional written communication — your ADRs, RFCs, and design docs set the standard others follow
Effective technical mentor to senior and staff engineers; you expand the thinking of those around you
Comfortable and credible presenting to VP and C-level stakeholders; you make complex architecture legible to executives

Ready to apply?

This job is active. Apply now to get in early.

Similar Jobs

Engineering Lead

Caribou

Senior Software Engineer, Hub Team

Weekend (fmr. Volley)

AI Architect

Avahi

Senior Engineering Manager

Just Appraised

View all jobs