Senior SRE Engineer
2024-11-11
USA
Zeta Global
WHO WE ARE
Zeta Global (NYSE: ZETA) is the Data-Powered Marketing Cloud that leverages advanced artificial intelligence (AI) and trillions of consumer signals to make it easier for marketers to acquire, grow, and retain customers more efficiently. Through the Zeta Marketing Platform (ZMP), our vision is to make sophisticated marketing simple by unifying identity, intelligence, and omnichannel activation into a single platform – powered by one of the industry’s largest proprietary databases and AI. Our enterprise customers across multiple verticals are empowered to personalize experiences with consumers at an individual level across every channel, delivering better results for marketing programs. Zeta was founded in 2007 by David A. Steinberg and John Sculley and is headquartered in New York City with offices around the world.
Job Overview:We are looking for a dynamic and highly skilled Senior SRE Engineer to join the team.
Key Responsibilities:
Implement and manage SLOs, SLIs, and error budgets.
Lead and promote postmortems, ensuring robust root cause analysis to drive continuous system improvement. Analyze historical data to identify improvement areas.
Implement full observability across systems using modern tools like OpenTelemetry (OTEL), Honeycomb, New Relic, Datadog, or similar.
Reduce toil through runbook automation.
Record and track key MTTx metrics (MTTA, MTTR, MTTF, etc.).
Lead design sessions on capacity planning, reliability by design, automation, and alerting.
Collaborate with product teams to enhance system reliability.
Engage in strategic initiatives for capacity, reliability, and automation, ensuring alignment with business goals.
What We're Looking For:
7+ years of experience as an SRE.
3+ years of software development experience, with a strong emphasis on automation.
Hands-on experience with Infrastructure as Code (IaC) tools.
Experience with distributed systems and microservices architecture (MSA).
Production experience with distributed tracing.
Proven skills in Python and Bash scripting.
Solid understanding of SLIs, SLOs, and error budgets.
Hands-on experience with CI/CD platforms (GitOps, GitLab, Jenkins, ArgoCD, etc.).
Expertise in incident management and root cause analysis.
Knowledge of modern deployment strategies (Canary, Blue-Green, etc.).
Familiarity with resiliency patterns (circuit breakers, retry mechanisms, load balancing, etc.).
Experience with SQL and NoSQL databases and understanding of distributed systems.
Proficiency in statistical analysis applied to metrics.
Experience with high-performance, low-latency systems.
Proven experience in cloud cost optimization strategies.
Experience with Kafka or other distributed messaging systems.
Strong understanding of security and compliance standards within DevOps/SRE environments.
BENEFITS & PERKS
Unlimited PTO
Excellent medical, dental, and vision coverage
Employee Equity and Stock Purchase Plan
Employee Discounts, Virtual Wellness Classes, and Pet Insurance And more!!
PEOPLE & CULTURE AT ZETA
Zeta considers applicants for employment without regard to, and does not discriminate on the basis of an individual’s sex, race, color, religion, age, disability, status as a veteran, or national or ethnic origin; nor does Zeta discriminate on the basis of sexual orientation, gender identity or expression.
We’re committed to building a workplace culture of trust and belonging, so everyone feels invited to bring their whole selves to work. We provide a forum for employees to celebrate, support and advocate for one another. Learn more about our commitment to diversity, equity and inclusion here: https://zetaglobal.com/blog/a-look-into-zetas-ergs/
ZETA IN THE NEWS!
https://zetaglobal.com/press/?cat=press-release
#LI-DD1
Zeta Global (NYSE: ZETA) is the Data-Powered Marketing Cloud that leverages advanced artificial intelligence (AI) and trillions of consumer signals to make it easier for marketers to acquire, grow, and retain customers more efficiently. Through the Zeta Marketing Platform (ZMP), our vision is to make sophisticated marketing simple by unifying identity, intelligence, and omnichannel activation into a single platform – powered by one of the industry’s largest proprietary databases and AI. Our enterprise customers across multiple verticals are empowered to personalize experiences with consumers at an individual level across every channel, delivering better results for marketing programs. Zeta was founded in 2007 by David A. Steinberg and John Sculley and is headquartered in New York City with offices around the world.
Job Overview:We are looking for a dynamic and highly skilled Senior SRE Engineer to join the team.
Key Responsibilities:
Implement and manage SLOs, SLIs, and error budgets.
Lead and promote postmortems, ensuring robust root cause analysis to drive continuous system improvement. Analyze historical data to identify improvement areas.
Implement full observability across systems using modern tools like OpenTelemetry (OTEL), Honeycomb, New Relic, Datadog, or similar.
Reduce toil through runbook automation.
Record and track key MTTx metrics (MTTA, MTTR, MTTF, etc.).
Lead design sessions on capacity planning, reliability by design, automation, and alerting.
Collaborate with product teams to enhance system reliability.
Engage in strategic initiatives for capacity, reliability, and automation, ensuring alignment with business goals.
What We're Looking For:
7+ years of experience as an SRE.
3+ years of software development experience, with a strong emphasis on automation.
Hands-on experience with Infrastructure as Code (IaC) tools.
Experience with distributed systems and microservices architecture (MSA).
Production experience with distributed tracing.
Proven skills in Python and Bash scripting.
Solid understanding of SLIs, SLOs, and error budgets.
Hands-on experience with CI/CD platforms (GitOps, GitLab, Jenkins, ArgoCD, etc.).
Expertise in incident management and root cause analysis.
Knowledge of modern deployment strategies (Canary, Blue-Green, etc.).
Familiarity with resiliency patterns (circuit breakers, retry mechanisms, load balancing, etc.).
Experience with SQL and NoSQL databases and understanding of distributed systems.
Proficiency in statistical analysis applied to metrics.
Experience with high-performance, low-latency systems.
Proven experience in cloud cost optimization strategies.
Experience with Kafka or other distributed messaging systems.
Strong understanding of security and compliance standards within DevOps/SRE environments.
BENEFITS & PERKS
Unlimited PTO
Excellent medical, dental, and vision coverage
Employee Equity and Stock Purchase Plan
Employee Discounts, Virtual Wellness Classes, and Pet Insurance And more!!
PEOPLE & CULTURE AT ZETA
Zeta considers applicants for employment without regard to, and does not discriminate on the basis of an individual’s sex, race, color, religion, age, disability, status as a veteran, or national or ethnic origin; nor does Zeta discriminate on the basis of sexual orientation, gender identity or expression.
We’re committed to building a workplace culture of trust and belonging, so everyone feels invited to bring their whole selves to work. We provide a forum for employees to celebrate, support and advocate for one another. Learn more about our commitment to diversity, equity and inclusion here: https://zetaglobal.com/blog/a-look-into-zetas-ergs/
ZETA IN THE NEWS!
https://zetaglobal.com/press/?cat=press-release
#LI-DD1