Job Summary
A cloud solutions company is seeking a Telecommute Site Reliability Engineer in Northern California.
Core Responsibilities of this position include:
- Designing, building, running and monitoring production infrastructure
- Triaging and troubleshooting complex production issues
- Identifying and automating manual processes
Must meet the following requirements for consideration:
- Experience with Linux systems administration including strong scripting skills
- In-depth knowledge and experience supporting web applications running on Java / Apache / Tomcat
- Experience with running Docker containers in a production environment and/or on AWS
- Experience running production services in AWS (EC2, ECS, KMS, Kinesis, CloudWatch)
- Experience automating systems and infrastructure via Ansible, Chef or Terraform
- Solid understanding of networking concepts and IP protocols