Job Summary
A software developer has an open position for a Telecommute Senior Site Reliability Engineer.
Individual must be able to fulfill the following responsibilities:
- Architect and run a high availability, global scale infrastructure
- Evaluate, deploy and support new technologies
- Manage vendor management and selection
Skills and Requirements Include:
- At least 7 years experience with highly-available enterprise servers
- Experience with orchestration (e.g. Puppet, Chef, Salt, Ansible)
- Experience with Linux shell scripting, automation, troubleshooting, and performance tuning
- Experience with one or more of MySQL, Mongo (or other NoSQL), HDFS
- Experience with big data tools (e.g. Hadoop, Kafka, Storm, Spark)
- Experience with icinga, nagios, PTRG, Sentry or other monitoring tools