Job Summary
An information technology security firm is in need of a Remote Site Reliability Engineer.
Must be able to:
- Interact with customers to ensure the health and maintenance of customer’s stack
- Perform deep dives into both systemic and latent reliability issues
- Troubleshoot issues across the entire stack: hardware, software, application and network
Qualifications for this position include:
- Minimum 5 years of managing services in an internet scale *nix environment
- Sound fundamentals in operating systems, networking, and distributed systems
- Strong familiarity with Linux systems administration and management / best practices
- Familiarity with OS container technology: Docker, LXC, namespaces/cgroups
- Strong understanding of: Ethernet, VLAN, IPv4/IPv6, ARP, DHCP, DNS, and TCP
- Familiarity with distributed system problems: leader election, consensus, etc