Job Summary
A technology company has an open position for a Telecommute Lead Site Reliability Engineer.
Core Responsibilities Include:
- Building and deploying monitoring infrastructure
- Defining Service Level Objects and availability guarantees for software
- Planning capacity, provisioning, and change management
Qualifications for this position include:
- Able to work 4 overlapping business hours with CEST (Central European Time)
- 7 years experience in a tech ops / SRE role
- Proficiency with Linux systems and at least one dynamic programming language
- Proven experience building, scaling and operating software in AWS
- A working understanding of Docker, AWS, and MySQL/Postgres
- Proficiency in configuration management tools (e.g. Ansible, Chef, Puppet)A