Home/Resume Examples/Site Reliability Engineer
Software Engineering

Site Reliability Engineer Resume Example

Use this site reliability engineer resume example as a reference. Our AI tailors it to any job description in seconds.

Avg. Salary

$130,000 - $195,000

Level

Mid-Senior Level

1

Professional Summary

Site reliability engineer with 6+ years ensuring the availability, performance, and scalability of large-scale distributed systems. Expert in observability, incident response, and infrastructure automation, with a proven record of maintaining 99.99% uptime for platforms serving millions of users.

2

Key Skills

KubernetesPrometheus/GrafanaTerraformPythonGoAWS/GCPIncident ManagementSLO/SLI DesignChaos EngineeringLinux SystemsPagerDuty
3

Sample Experience Bullets

  • Kept 99.99% uptime for a platform with 8M daily active users. Built automated failover and self-healing for the core infrastructure
  • Got MTTR from 45 minutes down to 8 minutes by automating runbooks and cleaning up noisy alerts that were hiding real issues
  • Wrote the SLO framework that 15 services now use. Each team has clear reliability targets and error budget policies
  • Automated certificate rotation, capacity planning scripts, and deployment rollbacks. Eliminated about 200 hours of monthly toil
  • Ran the chaos engineering program - injected 50+ failure scenarios per quarter and found 30 failure modes nobody knew about
  • On-call for all production infrastructure. Responded to pages, coordinated incident response, and wrote postmortems for every P0/P1
  • Managed Kubernetes clusters across 3 AWS regions. Handled upgrades, node scaling, and troubleshooting pod scheduling issues
  • Built Grafana dashboards and Prometheus alerting rules for 40+ services. Became the go-to person for observability questions
  • Worked with dev teams to define resource requests and limits for their services. Prevented about 15 OOM-related outages per quarter
4

ATS Keywords

Include these keywords in your resume to pass Applicant Tracking Systems.

site reliability engineerSREinfrastructure automationobservabilityincident responseSLOuptimedistributed systemstoil reductionchaos engineering
5

Recommended Certifications

  • Google Professional Cloud DevOps Engineer
  • Certified Kubernetes Administrator (CKA)

Build your Site Reliability Engineer resume

Paste a job description and get a tailored, ATS-optimized resume in 20 seconds.

Generate Resume Free

No credit card required