Site Reliability Engineer

2024-08-31
India
OpenTable
OpenTable, part of Booking Holdings, Inc. (NASDAQ: BKNG), powers reservations for the hospitality industry. OpenTable’s software seats more than 1 billion people per year and helps more than 50,000+ restaurants, bars, wineries and other venues attract guests, manage capacity, improve operations and maximize revenue.
As an employee of OpenTable, you will be part of a global network that includes OpenTable and KAYAK's portfolio of metasearch brands including Swoodoo, checkfelix, momondo, Cheapflights, Mundi and HotelsCombined. Many employees are dedicated to one brand, but we all benefit from using each other's strengths and insights. Together, we help people experience the world through dining and travel.
Site Reliability Engineer (SRE)
The Serving Platforms team is a DevOps organization integral to OpenTable’s Infrastructure Engineering group. We are responsible for the entire lifecycle of the container stack, which powers the OpenTable business applications. We are a group of engineers with skills in many areas, such as Kubernetes, Service Mesh, Puppet, Networking, Linux, DNS, and Kafka. To provide a platform as a service, we automate our processes with tools written in-house and by 3rd party vendors. We value reliability, efficiency, and security while providing service to our customers, the OpenTable application engineering group.
We provide the following services across the company


Administering our container platform (Kubernetes/Istio)


Config management infrastructure administration (Puppet, Ansible)


SSL certificate management


Vault and Consul


CDN & Message Bus (Akamai, Kafka)


Cloud service operation (AWS)


Various security initiatives


About This Role
The Site Reliability Engineer supports OpenTable’s development and production container infrastructure. In this role, you will work with multiple engineering teams across the globe as an SME for Kubernetes and other technologies owned by the Serving Platforms team. You will participate in high-impact projects, work closely with other team members, support team priorities, and help foster good communication with stakeholders. You can expect to help build greenfield projects, mitigate infrastructure incidents, and participate in on-call rotation. Our infrastructure is self-hosted in our data centers. You will get maximum low-level exposure and experience. We're looking for someone outstanding to join our team!
About You
You love working in a small, agile, highly productive, and focused environment. You enjoy building automation and self-service tools. You are curious and like learning. Picking up new languages or skills and sharing your findings with others is second nature to you. You’re detail-oriented, enjoy writing code, and implement DevOps principles via automation. You like to create repeatable processes that do not require human babysitting. When asked, you always have an opinion or can quickly form one. You seek to understand multiple perspectives and points of view and find the most optimal solution for everyone.
Does this sound like something you'd excel at? If so, keep reading.

Required Experience:


Minimum 5+ years of hands-on Linux experience (Ubuntu, CentOS, Etc.)


Understanding of systems administration concepts and patterns


Experience and proficiency with scripting languages such as GoLang, Python, Ruby, Perl, or Bash


Working expertise with Kubernetes and Docker in on premise environment


Experience in incident response and root cause analysis of service disruptions


3+ years experience with config management tools such as Puppet, Chef, Ansible, or SaltStack


Ability to quickly learn new technologies, frameworks, and architectures, as well as participate in technical conversations with external stakeholders and your team


Experience with operating messaging systems such as Kafka or RabbitMQ in production


Nice to have:


Understanding or experience with cloud computing - AWS, GCE, Azure


Familiarity with CI/CD Pipelines using tools like GitHub, Artifactory, CircleCi, Jenkins, TeamCity, Docker registry, etc.


Experience working with K/V stores such as Zookeeper, Redis, etcd, or Consul in production


Experience with virtualization technologies such as VMware, ESX, Xen, and OpenStack


Experience working with monitoring and alerting systems such as Sensu, Graphite, Prometheus, and Nagios


Applied knowledge of working and communicating with a globally distributed team


Experience with Windows Server OSs


Benefits:


Paid Vacation


One Celebration Day per calendar year


Focus on mental health and well-being


Company-wide weeks off a year - the whole team fully recharges (and returns without a pile-up of work!)


Generous paid parental leave


Focus on your career growth


Work from (almost) anywhere ; wherever you do your best work


Employee Assistance Program (EAP)


Pension Fund


Diversity and Inclusion
We aspire to have a workplace that reflects all of the diverse communities we serve. We know that when we have diverse teams we produce more creative ideas, products, and better outcomes for our team members. OpenTable/KAYAK is proud to be an Equal Opportunity Employer, and we welcome and encourage candidates from all backgrounds and experiences to apply for roles on our team. Whoever you are, just be you.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform job responsibilities, and to receive other benefits and privileges of employment. Please contact us to request accommodation.