Job Summary
An information technology company has an open position for a Remote Senior Site Reliability Engineer Remote.
Individual must be able to fulfill the following responsibilities:
- Modify existing systems to detect and report symptoms in addition to disruptions
- Manage low and mid-level severity incidents; escalate high severity incidents to resolution team
- Design and maintain monitoring, log centralization, and facilitate observability and incident management
Skills and Requirements Include:
- 5+ years’ experience in a Site Reliability Engineering (SRE) role / Software Engineering Role
- Experience mentoring and leading a team in a high-growth organization
- Experience with SRE topics like SLOs, Error Budgets, resiliency, auto-scaling, self-healing, performance, and more
- Experience in one or more of the following: node.js, Java, Linux, Python, Go, PHP, or Scala
- Understanding of Monitoring & Alerting tools: Datadog, Pagerduty, Alert Manager, and more