Site Reliability Engineer

Remote, USA

RetailNext is looking to expand our SRE team. We need people who have the skillset of good backend developers to focus on the operation and reliability of our SAAS retail analytics solution.

We pull in and process data from thousands of brick and mortar stores to help our customers better understand and serve their customers. We actively develop in Go and use technologies like Cassandra, Redis, Elasticsearch, gRPC, Kafka, PubSub/SQS, and more. We maintain legacy Ruby, NodeJS, Java, and C++ code.
You will be helping us operationalize new features, maintain the stability of the application, and improve how we develop and deploy it. This role includes being part of our on-call rotation, along with the backend team.

Past SRE projects have included bringing cloud resources under Terraform management, migrating from StatsD to Prometheus, re-writing how our application collects diagnostic telemetry from deployed sensors, and much more.
This is a remote role in the United States. Our headquarters is in San Jose and you are welcome to work there if you prefer to work from an office, but most of the people you will be working with are elsewhere.

Who you are:

● Strong in at least one backend programming language (Go, NodeJS, Ruby, etc...)
● Familiar with Linux (You know what the FHS, cgroups, etc... are)
● Able to teach yourself new technologies and programming languages
● Able to debug and fix issues in third-party open-source software
● Meticulously diligent about security and reliability
● Experience in an SRE, DevOps, or Release Engineering role

Bonus points:

● Experience (re-)architecting distributed applications to fix scalability and reliability issues
● Experience building and maintaining CI/CD processes
● Experience operating Cassandra in a production environment
● Experience with any cloud IAAS provider (we use both GCE and EC2)
● Experience with infrastructure-as-code tools such as Terraform

Our technical interviews involve writing and running real code, using your own familiar
development tools while sharing your screen in an online meeting with some of our engineers.
The exercises include implementing functionality within provided scaffolding, and finding/fixing
bugs in implemented code.



Apply for this job
Share this job opening

DevOps and Dev jobs in your inbox every week.

Thank you! You'll receive a confirmation shortly
Oops! Something went wrong while submitting the form.
Made with love️ by Mohamed Labouardy.