Job Summary
A website management platform company has an open position for a Remote Site Reliability Engineer.
Core Responsibilities of this position include:
- Implementing shared infrastructure used by all engineering teams
- Improving visibility into how distributed services interact and scale in production
- Executing disaster recovery drills
Applicants must meet the following qualifications:
- Experience with analyzing and troubleshooting systems
- Experience programming in one or more of the following: Go, Python, Ruby, C, C++, Java, etc
- Experience with Infrastructure as code tooling (e.g., Terraform, Chef, Puppet, Ansible, Pulumi, Vault, etc)
- Knowledge of large-scale, high traffic platforms and the design of scalable, robust services in the real world
- Experience with Unix/Linux operating systems internals (e.g., filesystems, system calls, namespaces, containers)
- Knowledge of large-scale, high traffic platforms and the design of scalable, robust services in the real world