TixelJobs

DevOps and SRE Jobs at AI Companies

DevOps and SRE engineers at AI companies manage some of the most demanding infrastructure in tech — GPU clusters for model training, low-latency inference serving, and highly available API platforms serving millions of requests. These roles combine traditional infrastructure expertise with the unique challenges of AI systems.

Last updated: June 27, 2026

1,458
Open positions
$60K+
Avg salary
15
Companies hiring

Latest DevOps & SRE Jobs at AI Companies

View all jobs
Zoom
ZoomNew21h ago

DevOps Engineer

REMOTEFull-time
#remote
L
LeidosNew1d ago

Cloud Engineer

REMOTEFull-time
#remote
M
MediumNew1d ago

Senior Data Platform Engineer

REMOTEFull-time
#remote
M
MonzoNew1d ago

Machine Learning Platform Engineer

REMOTEFull-time
#remote
Zoom
Zoom2d ago

DevOps Engineer

REMOTEFull-time
#remote
H
Honeycomb2d ago

Senior Site Reliability Engineer

REMOTEFull-time
#remote
H
Honeycomb2d ago

Senior Site Reliability Engineer

REMOTEFull-time
#remote
M
Mozilla2d ago

Senior Platform Engineer, WebExtensions

REMOTEFull-time
#remote
I
Inato2d ago

Senior Cloud Infrastructure Engineer

REMOTE$70K - $90K/yrFull-time
#remote
D
Dnb2d ago

Principal Power Platform Engineer (R-19417)

REMOTEFull-time
#remote
M
Mozilla2d ago

Senior Platform Engineer, WebExtensions

REMOTEFull-time
#remote
M
Mozilla2d ago

Senior Platform Engineer, WebExtensions

REMOTEFull-time
#remote
M
Mozilla2d ago

Senior Platform Engineer, WebExtensions

REMOTEFull-time
#remote
G
Gitlab3d ago

Principal Solutions Architect, AI / Core DevOps SME

REMOTEFull-time
#remote
K
Keepersecurity3d ago

Senior Demo Platform Engineer

REMOTEFull-time
#remote
L
Lemon.io3d ago

Senior DevOps Engineer

REMOTE$50K - $150K/yrFull-time
#remote
K
Kyndryl3d ago

Cloud Engineer - AWS

REMOTEFull-time
#remote
B
Bright Vision Technologies3d ago

Site Reliability Engineer (SRE)

REMOTEFull-time
#remote
I
IO Global3d ago

DevOps Engineer - Midnight Foundation

REMOTEFull-time
#remote
A
Agilent Technologies3d ago

DevOps & Platform Engineer (AWS / CI/CD)

REMOTEFull-time
#remote

Frequently Asked Questions

What does a DevOps/SRE engineer do at an AI company?

DevOps and SRE engineers at AI companies manage GPU clusters for model training, build and maintain inference serving infrastructure, design CI/CD pipelines for ML workflows, and ensure high availability for AI APIs that serve millions of users. Unlike traditional DevOps roles, you'll work with specialized hardware (NVIDIA GPUs, TPUs), manage large-scale distributed training jobs, and optimize infrastructure costs that can run into millions per month. Core tools include Kubernetes, Terraform, Docker, and cloud platforms (AWS, GCP, Azure) with deep expertise in GPU orchestration.

What is the salary for DevOps engineers at AI companies?

DevOps and SRE engineers at AI companies earn $140K-$210K at mid-level and $190K-$320K+ for senior and staff roles. The premium over traditional DevOps reflects the specialized skills needed for GPU infrastructure, large-scale distributed systems, and the critical nature of AI inference availability. Companies competing for cloud infrastructure talent in the AI space — particularly those managing large GPU clusters — often pay at the top of the market to attract and retain engineers who can keep their systems running reliably.

What skills do I need for DevOps at an AI company?

Core skills include Kubernetes (especially GPU scheduling), Terraform/Pulumi for infrastructure-as-code, CI/CD pipelines, and deep experience with at least one major cloud provider (AWS, GCP, or Azure). Experience with GPU workloads, NVIDIA CUDA, and container orchestration for ML training is highly valued. Monitoring and observability skills (Prometheus, Grafana, Datadog) are essential since AI systems have unique failure modes. You don't need to understand ML algorithms, but knowing how model training and inference work at an infrastructure level helps you make better architectural decisions.

AI Job Insights for DevOps & SRE Jobs at AI Companies

Salary Range (Yearly, USD)

$130 - $999K

Median $165K from 32 listings with salary data

Top Companies Hiring

Mozilla (4)Zoom (2)Honeycomb (2)Leidos (1)Medium (1)Monzo (1)

Based on recent listings shown on this page.

Common Roles

Senior Platform Engineer, WebExtensions (4)DevOps Engineer (2)Senior Site Reliability Engineer (2)Cloud Engineer (1)Senior Data Platform Engineer (1)Machine Learning Platform Engineer (1)

Counts reflect recent listings, not total market size.

In-Demand Skills

Remote (20)

Derived from tags on recent listings.

DevOps & SRE Jobs at AI Companies | TixelJobs — Jobs at AI Companies