Site Reliability Engineer

Remote, europe

We are looking for Site Reliability Engineers who are passionate about working with data, excited to solve complex and high scale (billions + records) data challenges with the most innovative companies and startups in the world today. This person will help us build our new SRE team in a startup-like agile environment

About DoubleCloud and our project

We are creators of the first managed ClickHouse service back in 2018, with more than 500 customers. Our engineers are significant contributors to leading open-source technologies like ClickHouse, PostgreSQL, Odyssey, WAL-G, and others.

Since 2021, we have worked with more than 100 companies that are crunching analytics with various data tools, including Clickhouse, BigQuery, Redshift, MySQL, Postgres and Kafka.

As a result, we created a data platform to specifically help businesses build an end-to-end modern data stack and real-time analytics with fully managed opensource technologies, like Clickhouse, Kafka, etc.

With our platform, data engineers can focus on what they love building instead of spending time on tasks related to scaling up or down, installing updates, deploying additional software and other required admin around open source technologies.

As a company, DoubleCloud is an early-stage startup incorporated in Germany (Berlin) and the USA (Boston).

We are currently over 40 people today, and the team is growing fast.

What you will do:

  • improving availability, performance, monitoring, emergency response, and capacity planning for DoubleCloud
  • rolling out cutting-edge cloud technologies to meet infrastructure needs
  • implementing and improving CI/CD processes
  • growing L3 support competency within the team
  • partnering with development and support L1/L2 teams as well as the CTO and other leaders at DoubleCloud
What we expect from you:
  • 3+ years of experience operating large distributed systems
  • prior experience designing and deploying infrastructure, skilled in "infrastructure as code", e.g. Terraform
  • practical experience designing and improving monitoring and alerting systems
  • expertise in Linux network and container technology
  • good understanding of at least one DBMS, e.g. PostgreSQL, MySQL, ClickHouse, or Redis
  • on-call availability
What would be nice to have:
  • Software engineering experience in Python and/or Go
  • in-depth knowledge of multiple database management systems.

DoubleCloud Culture

We are here to build the best possible product and want our customers to get the most value of it. In order to achieve this objective we work as a team in a startup-like agile rhythm. We help and inspire each other, try new things and learn new lessons. We are here for each other, and we ensure each individual has everything they need to reach their goals.

DoubleCloud is proud to be an equal opportunity employer. Simply put, we do not discriminate which means we treat everyone with respect. Diversity, equity and inclusion are not only of importance for our Talent Team but are DoubleCloud’s fundamental principles.

We’re a global and diverse team full of positive vibes, and we love it that way.

Benefits & Perks

Our Talent Team is working vigorously to provide the best working experience possible. At the minimum you are getting:

  • Exceptional medical benefits with 100% employer-paid premiums and well being perks
  • Paid parental leave
  • Personal and career development courses
  • For WFH: Home office expenses reimbursement options
  • For remote coworking: office space or coworking expenses reimbursement
  • Flexible vacation and paid sick leaves
  • Subsidized retirement plan
  • And plenty more…


Apply for this job
Share this job opening

DevOps and Dev jobs in your inbox every week.

Thank you! You'll receive a confirmation shortly
Oops! Something went wrong while submitting the form.
Made with love️ by Mohamed Labouardy.