TixelJobs
Z
Zetaglobalvia Greenhouse

Senior Data Engineer

San Francisco, California, United States$165K - $175K/yrPosted 5d ago
Data EngineerSeniorFull-time

Not sure if you're a good fit?

Upload your resume and TixelJobs AI will compare it against Senior Data Engineer at Zetaglobal. Get a match score, missing keywords, and improvement tips before you apply.

Free preview · Your resume stays private

About the Role

WHO WE ARE 

Zeta Global (NYSE: ZETA) is the AI-Powered Marketing Cloud that leverages advanced artificial intelligence (AI) and trillions of consumer signals to make it easier for marketers to acquire, grow, and retain customers more efficiently. Through the Zeta Marketing Platform (ZMP), our vision is to make sophisticated marketing simple by unifying identity, intelligence, and omnichannel activation into a single platform – powered by one of the industry’s largest proprietary databases and AI. Our enterprise customers across multiple verticals are empowered to personalize experiences with consumers at an individual level across every channel, delivering better results for marketing programs. Zeta was founded in 2007 by David A. Steinberg and John Sculley and is headquartered in New York City with offices around the world. To learn more, go to www.zetaglobal.com.

The Role 

We’re looking for a Senior Data Engineer to design, build, and operate the data pipelines and aggregates that power Zeta’s AdTech platform. This is a hands-on individual contributor role focused on high-scale batch + streaming data processing, reliable data products, and analytics-ready datasets that enable prediction, agentic workflows, BI reporting, and measurement. You will partner closely with backend, ML, and product teams to deliver trusted, well-modeled data with strong performance, quality, and observability. 

What You’ll Do 

  • Build data pipelines: Develop robust batch and streaming pipelines (Kafka/Kinesis) to ingest, transform, and enrich large-scale event data (impressions, clicks, conversions, costs, identity signals). 
  • Create data aggregates & marts: Design and maintain curated aggregates and dimensional models for multiple consumers—prediction models, agents, BI dashboards, and measurement workflows. 
  • Data modeling & contracts: Define schemas, data contracts, and versioning strategies to keep downstream systems stable as sources evolve. 
  • Data quality & reliability: Implement validation, anomaly detection, backfills, and reconciliation to ensure completeness, correctness, and timeliness (SLAs/SLOs). 
  • Performance & cost optimization: Optimize compute/storage for scale (partitioning, file sizing, incremental processing, indexing), balancing latency, throughput, and cost. 
  • Orchestration & automation: Build repeatable workflows with scheduling/orchestration (e.g., Airflow, Dagster, Step Functions) and CI/CD for data pipelines. 
  • Observability for data systems: Instrument pipelines with metrics, logs, lineage, and alerting to accelerate detection and root-cause analysis of data issues.
  • Share