Senior DevOps Engineer - Product Metrics

2024-07-31
Germany
ClickHouse
About ClickHouse
We are the company behind the popular open-source, high performance columnar OLAP database management system for real-time analytics. ClickHouse works 100-1000x faster than traditional approaches. By offering a true column-based DBMS, it allows for systems to generate reports from petabytes of raw data with sub-second latencies. With an amazing community already adopting our open-source technology, we are now embracing our journey in delivering Cloud first solutions to delight our customers.
With top adopters such as Lyft, Cisco, and eBay - not only do our products work at lightning speed, so do we.
We are an open and collaborative company. Our colleagues are curious, engaged and excited about what they do. If you want to work in an environment where you can learn, grow, be an agent of change and have your voice heard - then please read on!
The Observability team owns the observability platform that monitors internal services of ClickHouse Cloud as well as the customers' ClickHouse instances. As a part of the team you will be responsible for designing, building, operating and maintaining components of the petabyte-scale platform that stores trillions of events and processes tens of millions new events every second, owning its reliability, performance and availability. Our stack is built with open source technologies, such as OpenTelemetry and Grafana and of course based on ClickHouse. In addition to the core responsibilities, the Observability team also manages the Product Metrics component of the platform. This function is essential for gathering and processing data for the internal billing and accounting system as well as customer-facing dashboards that provide our customers with immediate insights and analytics. The Product Metrics system is specifically designed to prioritize delivery guarantees, precision, and accuracy in handling extensive data volumes. Its primary goal is to provide data that is both prompt and accurate, supporting reliable operational decisions and enhancing the overall customer experience.
We are looking for highly skilled software and site reliability engineers to join our team.
What will you do?

Take an active part in determining the roadmap for the Product Metrics component
Work closely within the team to deliver new features, iterate and improve them
Design, build, operate, and maintain business-critical petabyte-scale systems
Be responsible for the performance, reliability, availability and cost-efficiency of the Product Metrics component
Mentor and support other team members, participate in design discussions and collaborate with the team
Be a part of on-call rotation and take ownership of the services you're running
Educate and lead efforts to improve observability among all engineering teams

What you bring along:

You demonstrate a strong initiative and a preference for action, high level of responsibility, ownership and accountability
You prioritize customer needs, ensuring that our products are designed with the user in mind
You are able to take on complex challenges and break them down to achieve short feedback loops: to analyze, design, and build modular solutions, deliver MVPs, gather data and feedback and then progress iteratively
You have a strong problem solving mindset and have solid production debugging skills
You have excellent communication skills and the ability to work well within a team and across engineering teams in a fully remote environment
You thrive in a fast-paced environment, and see yourself as a partner with the business with the shared goal of moving the business forward

Requirements:

5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
Solid experience with at least one programming language. We use Go, but if you have familiarity with Python, C, C++, Rust or similar that translates well
Experience with at least one of the major Cloud Service Providers such as AWS, GCP or Azure
Experience with data streaming/message brokering systems such as Kafka, RedPanda or similar
Experience with technologies such as Kubernetes, Helm, ArgoCD, Temporal as well as infrastructure-as-code tools such as Terraform

Bonus Points:

Experience with ClickHouse
Familiarity with open source or enterprise observability technologies; familiarity with solutions in metrics, logs, and tracing domains

#LI-RemoteCompensation
This role offers cash compensation and a stock options grant. For roles based in the United States, you can find above our typical starting salary ranges for this role, depending on your specific location.
The positioning of offers within a certain range depends on various factors, including: candidate experience, qualifications, skills, business requirements and geographical location.
If you have any questions or comments about compensation as a candidate, please get in touch with us at [email protected].
Perks

Flexible work environment - ClickHouse is a distributed company offering remote-first work to all employees
Healthcare - Employer contributions towards your healthcare.
Equity in the company - Every new team member who joins our company receives stock options.
Time off - Flexible time off in the US, generous entitlement in all countries.
A $500 Home office setup if you’re a remote employee.
Employee-driven international mobility- we enable you to relocate internationally if you wish (within certain countries and timelines and subject to role requirements, time zones and work permit considerations)

Culture - We All Shape It
As part of our first 200 employees, you will be instrumental in shaping our culture.
We look for candidates who are:

Motivated by doing great work as part of a team :)
Open to learning from others and sharing with others
Team Players: helpful, resourceful, responsive
Respectful and see feedback as an opportunity to grow

Are you interested in finding out more about our culture? We are a one year old company therefore we are excited to be building it together at the moment. Our first 200 employees are the culture shapers of our future. Check out our blog posts or follow us on LinkedIn to find out more about what’s important to us, and to find out if you’d like to come and contribute to building our culture with us!
Please see here for our Privacy Statement.