Site Reliability Engineer
Tech Stack: AWS, GCP, Azure, Hashistack (Vault, Nomad, Consul, Terraform, Packer), Pulumi, EventstoreDB, CockroachDB, Grafana, Vector, Linux, ZFS, Tailscale, Github, Google Workspaces, Slack
Languages: Go, Rust, and Typescript
Location: We are 100% remote but it is important to have significant overlap with the US Central time zone
Event Store is a rapidly expanding Open Source Software Database company with staff across the world, embarking on an exciting growth journey to revolutionise how modern distributed systems are built.
The users of Event Store products and services are software architects, senior developers, and operators who strive to build and maintain world-class software applications. Event Store is poised to become a leading database technology for modern distributed applications and these architects, developers, and operators require a high performing database to leverage for their applications.
We are looking for a strong Site Reliability Engineer to help manage Event Store’s multi-cloud provisioning platform.
The location for this role is remote within UK.
Responsibilities
- You will work as part of a team dedicated to the management of the Event Store Cloud platform, and customer provisioned systems to AWS, Azure, and GCP
- A significant focus of the position will be ensuring availability of provisioned product services in a 24/7 environment
- Implementing new business functionality with an eye toward resilience, operability, and performance through automation, observability, and continual improvement
- Working across core infrastructure components i.e. middleware, databases
- Managing services across AWS, Azure, and GCP cloud platforms
- Participating in incident responses and subsequent blameless post-mortems/root cause analysis
Experience
- Go and Typescript experience required
- Rust experience preferred
- Distributed database/systems expertise required
- Experience working in team environments and providing guidance and support to others in the team
- Excellent experience supporting production services, ensuring they are running reliably, and automating tasks to improve efficiency and safe operations
- Experience working in DevOps environments with automated CI / CD
- Strong understanding of Infrastructure as Code
- Strong understanding of TCP/IP V4 and V6 networks and multi-cloud VPN and network topologies
- Understanding of Linux internals in the context of troubleshooting performance issues and performance optimization
- Experience with Hashicorp Vault, Consul, Nomad, and Terraform
- Experience using ZFS in a production environment highly desirable
- Experience implementing effective system observability (logging, metrics, tracing, and alerting)
- Strong experience managing security in AWS, GCP, and Azure (IAM and network)
- SQL proficiency with either Postgres or Cockroach databases
- General training on cybersecurity
Qualifications
- At least three years of experience in programming languages such as Go and Rust
- At least two years of experience automating production solutions with Terraform or Pulumi
- A bachelor’s degree in computer science or a related field, or equivalent experience
- Extensive experience with open-source technology, software development, and system engineering
- Skilled at working in a remote environment, in tandem with a team of engineers, or alone as required
- Excellent communication and organisational skills, and the ability to stay focused on completing tasks and meeting goals within a busy workspace
All About You
The ideal candidate for this position:
- Is passionate about the DevOps philosophy and championing the practice
- Loves creating innovative solutions to problems in a collaborative fun environment
- Has advanced knowledge and understanding of modern software and platform engineering concepts and methodologies
- Is passionate about software quality and comfortable in a fluid environment
- Shows initiative and demonstrate a willingness to take on challenging opportunities
- Possesses excellent communication (verbal and written) and strong collaboration skills
- Must be driven, highly motivated and a strong team-player, but able to work independently in a high trust environment to do the right things
- Is interested in mentoring and helping grow the skills of the next generation of software engineers
Find out more or apply
Download the job description for this vacancy by clicking the link below.
Interested in applying for this vacancy? Send us your CV using the button below:
Apply now