Job Summary
A risk analytics, finance, and technology advisory firm has an open position for a Remote Mid and Senior Level Observability and Incident Response Site Reliability Engineer.
Candidates will be responsible for the following:
- Creating resilient and reliable architecture
- Developing and deploying tools and utilities
- Independently determining the needs of the customer
Qualifications Include:
- Bachelor’s degree and 4+ years of relevant professional experience
- 3-4 years of experience in designing / implementing cloud applications
- 2 years of experience establishing Service Level Indicators and Objectives
- 2 years of experience in building observability / monitoring dashboards and alerts in Splunk
- Current or recent Site Reliability Engineer experience (2+ years)
- 2 years of experience in disaster recovery planning and failover testing