Sr Site Reliability Engineer

Location: Remote
Compensation: To Be Discussed
Staff Reviewed: Mon, Feb 12, 2024
This job expires in: 19 days

Job Summary

A company is looking for a Sr. Site Reliability Engineer.

Key Responsibilities:
  • Responsible for availability, latency, performance, efficiency, monitoring/observability, emergency response, capacity planning, setting and maintaining SLOs, SLIs and Error Budgets, creating dashboards
  • Analyze, troubleshoot and resolve operational challenges contributing to defined SLO's
  • Manage site stability, performance, reliability, and maintain uptime for production environments

Required Qualifications, Training, and Education:
  • Strong background as a SRE supporting a 24x7 highly available production environment for a SaaS or cloud service provider
  • Solid experience with Monitoring/APM/Observability tools (Splunk, New Relic, Prometheus, Grafana etc.,)
  • Experience implementing observability plans around logs, metrics, and traces
  • Experience in an agile development team developing software
  • Experience with cloud infrastructure environments, preferably AWS, and Infrastructure as code (Terraform, CloudFormation)

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...

BECOME A PREMIUM MEMBER TO
UNLOCK FULL JOB DETAILS & APPLY

  • ACCESS TO FULL JOB DETAILS AND APPLICATION INFORMATION
  • HUMAN-SCREENED REMOTE JOBS AND EMPLOYERS
  • COURSES, GROUP CAREER COACHING AND RESOURCE DOWNLOADS
  • DISCOUNTED CAREER SERVICES, RESUME WRITING, 1:1 COACHING AND MORE
  • EXCELLENT CUSTOMER SUPPORT FOR YOUR JOB SEARCH