Job Summary
An information technology company has a current position open for a Remote Incident Response Engineer.
Candidates will be responsible for the following:
- Providing real-time monitoring, triage & escalation of critical & major issues & incoming alarms within the environment
- Participating in incident management calls & coordinating response, triage, recovery, and reporting of incidents
- Actively engaging through the service restoration and ensure senior leadership is aware of activities being carried out
Must meet the following requirements for consideration:
- 3+ years of combined experience in DevOps, SRE, and/or Unix/Linux system administration and monitoring tools
- Hands on experience with cloud architecture and deploying infrastructure in a cloud environment
- Solid networking experience (TCP/IP, BGP routing, load balancing, DNS)
- Experience with Linux distributions (CentOS, Ubuntu, and Amazon Linux)
- Experience with Amazon Web Services (EC2, VPC, ELB, S3, CloudFormation, etc)
- Experience with configuration management (Chef, Puppet, Ansible, and/or Salt)