TixelJobs
N
NYC Health + Hospitalsvia Indeed

Dir, AI Platform Engineering

New York, NY, US$175K - $210K/yrPosted 2mo ago
MLOpsMid LevelFull-time#python#llm#kubernetes#docker#azure#mlflow

Not sure if you're a good fit?

Upload your resume and TixelJobs AI will compare it against Dir, AI Platform Engineering at NYC Health + Hospitals. Get a match score, missing keywords, and improvement tips before you apply.

Free preview · Your resume stays private

About the Role

About NYC Health + Hospitals


NYC Health + Hospitals is the largest public health care system in the United States. We provide essential outpatient, inpatient and home-based services to more than one million New Yorkers every year across the city’s five boroughs. Our large health system consists of ambulatory centers, acute care centers, post-acute care/long-term care, rehabilitation programs, Home Care, and Correctional Health Services. Our diverse workforce is uniquely focused on empowering New Yorkers.

At NYC Health + Hospitals, our mission is to deliver high quality care health services, without exception. Every employee takes a person-centered approach that exemplifies the ICARE values (Integrity, Compassion, Accountability, Respect, and Excellence) through empathic communication and partnerships between all persons.

Work Shifts


9:00 A.M – 5:00 P.M

Duties & Responsibilities


Purpose of Functional Assignment:
The Director of AI Platform Engineering provides strategic leadership for the cloud, platform, and deployment infrastructure supporting Artificial Intelligence (AI) across the System. This role ensures AI systems used in clinical workflows operate safely, reliably, securely, and are in compliance with applicable laws and NYC Health + Hospitals rules and regulations. The Director leads platform engineering, cloud architecture, Continuous Integration and Continuous Delivery/Deployment (CI/CD) modernization, and reliability functions ensuring that AI tools enhance clinical excellence and protect patient safety.


Essential Duties and Responsibilities:
1. Provides strategic leadership for cloud, platform, and infrastructure engineering, developing and leading multi-year roadmaps, standards, and strategies for the secure and scalable deployment of AI products.

2. Oversees architecture and governance of CI/CD pipelines, infrastructure‑as‑code (Terraform), and Kubernetes/Azure Kubernetes Services (AKS) orchestration to support reliable AI deployment.

3. Defines and oversee the infrastructure for the high-volume, low-latency data pipelines, feature stores, and data access layers required for training and real-time serving of AI models.

4. Establishes enterprise-wide reliability, and monitoring frameworks to ensure stable, and safe operation of AI systems used by clinicians and care teams.

5. Implements platform controls and audit trails to monitor and ensure Responsible AI practices, model explainability (XAI), and checks for model drift and bias on an ongoing basis.

6. Partners with product management, product development, cybersecurity, Machine Learning Operations (MLOps) engineering, and interoperability teams to ensure AI platform readiness, safe integrations.

7. Leads incident management and root‑cause analysis, to minimize disruptions to clinical workflows and drive reliability improvements.

8. Ensures the AI platform and infrastructure provide the necessary controls, logging, and audit capabilities to meet compliance requirements and support AI safety frameworks.

9. Develops long‑term platform resilience, disaster recovery, and cost optimization strategies to support System‑wide AI expansion.

10. Defines and standardizes the platform's toolchain and Application Programming Interface (API) for the Machine Learning (ML) lifecycle, including model experimentation tracking (e.g., MLflow, ClearML), model registry, and automated testing/validation frameworks.

11. Manages a team of platform and infrastructure engineers.

12. Performs other duties as assigned.


Minimum Qualifications


1. Master's Degree from an accredited college or university in Computer Science, Information Systems or Technology, Cybersecurity, Hospital Administration, Health Care Planning, Business Administration, Mathematics, Engineering or Public Administration; and three (3) years of progressively responsible experience in health care information security, multifaced information technology, health and medical service administration, public administration, or a related discipline with an emphasis on systems programming, systems engineering, software developing, or providing technical support as a specialist; two (2) years of which must have been in a related administrative, managerial or supervisory capacity; or,
2. Bachelor’s Degree from an accredited college or university in disciplines, as listed in “1” above; and five (5) years of progressively responsible experience in health care information security, multifaced information technology, health and medical service administration, public administration, or a related discipline with an emphasis on systems programming, systems engineering, software developing, or providing technical support as a specialist; two (2) years of which must have been in a related administrative, managerial or supervisory capacity.

Assignment Qualification Preferences:
1. Master’s degree from an accredited college or university in Computer science, Engineering, Information Systems, or related discipline; and,
2. Five (5) years of experience in Machine Learning Operations (MLOps), Machine Learning (ML) engineering, Artificial Intelligence (AI) platform engineering, or operating production Machine Learning (ML) / Large Language Model (LLM) system; or ten (10) years of experience in Software and Data Engineering.

Certifications Preferred:
1. Professional certifications in cloud architecture, ML/AI engineering, or DevOps from leading cloud platforms.

Preferred Knowledge Areas, Skills, Abilities, and other Qualifications:
1. Figma, Sketch, Adobe XD, or similar design and prototyping tools.
2. Expertise in Azure architecture, Kubernetes/ Azure Kubernetes Services (AKS), Terraform, Continuous Integration and Continuous Deployment (CI/CD), and automation frameworks.
3. Experience supporting production AI/ML systems or mission‑critical workloads.
4. Knowledge of observability tools, monitoring frameworks, and reliability engineering practices.
5. Understanding of security and compliance standards including Health Insurance Portability and Accountability Act of 1996 (HIPAA) and National Institute of Standards and Technology (NIST).
6. Demonstrated leadership, cross-functional collaboration, and technical communication skills.
7. Strong stakeholder engagement and change‑management skill.
8. Experience in healthcare, public sector, or other regulated environments.
9. Experience deploying or supporting AI/ML, LLM, or agentic AI systems in production.
10. Familiarity with Site Reliability Engineering (SRE) or platform engineering frameworks.
11. Experience Using the Following Software and/or Platforms:
  • Python, Bash/Shell, HashiCorp Configuration Language (HCL Terraform), and YAML/ JavaScript Object Notation (JSON), with familiarity in Go preferred for Kubernetes-based platform tooling.
  • Azure cloud services, Docker, Kubernetes/AKS, Terraform, CI/CD platforms (Azure DevOps, GitHub Actions, Jenkins), monitoring/observability tools (Grafana, Azure Monitor), secrets/IAM security tooling.

Benefits


NYC Health and Hospitals offers a competitive benefits package that includes:

  • Comprehensive Health Benefits for employees hired to work 20+ hrs. per week
  • Retirement Savings and Pension Plans
  • Paid Holidays and Vacation in accordance with employees' Collectively bargained contracts
  • Loan Forgiveness Programs for eligible employees
  • College tuition discounts and professional development opportunities
  • College Savings Program
  • Union Benefits for eligible titles
  • Multiple employee discounts programs
  • Commuter Benefits Programs

How To Apply


If you wish to apply for this position, please apply online by clicking the "Apply for Job" button.

Note: Candidates selected for a position are required to come to NYC as part of their onboarding.
Share