Careers


Engineering

Site Reliability Engineer

21 February 2023
Full-timeRemote | UK

Tech Stack: AWS, GCP, Azure, Hashistack (Vault, Nomad, Consul, Terraform, Packer), Pulumi, EventstoreDB, CockroachDB, Grafana, Vector, Linux, ZFS, Tailscale, Github, Google Workspaces, Slack

Languages:  Go, Rust, and Typescript

Location:
We are 100% remote but it is important to have significant overlap with the US Central time zone

Event Store is a rapidly expanding Open Source Software Database company with staff across the world, embarking on an exciting growth journey to revolutionise how modern distributed systems are built. 

The users of Event Store products and services are software architects, senior developers, and operators who strive to build and maintain world-class software applications. Event Store is poised to become a leading database technology for modern distributed applications and these architects, developers, and operators require a high performing database to leverage for their applications.

We are looking for a strong  Site Reliability Engineer to help manage Event Store’s multi-cloud provisioning platform. 

The location for this role is remote within UK. 

 

Responsibilities

  • You will work as part of a team dedicated to the management of the Event Store Cloud platform, and customer provisioned systems to AWS, Azure, and GCP
  • A significant focus of the position will be ensuring availability of provisioned product services in a 24/7 environment
  • Implementing new business functionality with an eye toward resilience, operability, and performance through automation, observability, and continual improvement
  • Working across core infrastructure components i.e. middleware, databases
  • Managing services across AWS, Azure, and GCP cloud platforms
  • Participating in incident responses and subsequent blameless post-mortems/root cause analysis

Experience

  • Go and Typescript experience required
  • Rust experience preferred
  • Distributed database/systems expertise required
  • Experience working in team environments and providing guidance and support to others in the team
  • Excellent experience supporting production services, ensuring they are running reliably, and automating tasks to improve efficiency and safe operations
  • Experience working in DevOps environments with automated CI / CD
  • Strong understanding of Infrastructure as Code
  • Strong understanding of TCP/IP V4 and V6 networks and multi-cloud VPN and network topologies
  • Understanding of Linux internals in the context of troubleshooting performance issues and performance optimization
  • Experience with Hashicorp Vault, Consul, Nomad, and Terraform
  • Experience using ZFS in a production environment highly desirable
  • Experience implementing effective system observability (logging, metrics, tracing, and alerting)
  • Strong experience managing security in AWS, GCP, and Azure (IAM and network)
  • SQL proficiency with either Postgres or Cockroach databases
  • General training on cybersecurity

Qualifications

  • At least three years of experience in programming languages such as Go and Rust
  • At least two years of experience automating production solutions with Terraform or Pulumi
  • A bachelor’s degree in computer science or a related field, or equivalent experience
  • Extensive experience with open-source technology, software development, and system engineering
  • Skilled at working in a remote environment, in tandem with a team of engineers, or alone as required
  • Excellent communication and organisational skills, and the ability to stay focused on completing tasks and meeting goals within a busy workspace

 

All About You

The ideal candidate for this position:

  • Is passionate about the DevOps philosophy and championing the practice
  • Loves creating innovative solutions to problems in a collaborative fun environment
  • Has advanced knowledge and understanding of modern software and platform engineering concepts and methodologies
  • Is passionate about software quality and comfortable in a fluid environment
  • Shows initiative and demonstrate a willingness to take on challenging opportunities
  • Possesses excellent communication (verbal and written) and strong collaboration skills
  • Must be driven, highly motivated and a strong team-player, but able to work independently in a high trust environment to do the right things
  • Is interested in mentoring and helping grow the skills of the next generation of software engineers