This site uses cookies. To find out more, see our Cookies Policy

Site Reliability Engineer in Chicago, IL at Solution Partners

Date Posted: 2/17/2019

Job Snapshot

  • Employee Type:
  • Location:
    Chicago, IL
  • Job Type:
  • Experience:
    Not Specified
  • Date Posted:

Job Description

As a Site Reliability Engineer, you will champion our client's SRE practice and own the reliability of our client's services and applications. Site Reliability Engineers will work closely with their engineering teams to build mature, production-ready services and applications. As part of the SRE team, you will help define their standards for monitoring, alerting, scalability, and production-readiness. You will monitor and report on the uptime of their systems and services, the performance of their applications, and the capacity of their platform.

The SRE team owns their incident response process. As an SRE, you will be a front-line responder to production incidents. They like writing runbooks to make operations and on-call easier. They get excited about things like runbook automation, autoscaling, and metrics.

To be successful, you'll need
:A proven career working in a Linux-based environment
:Experience monitoring, operating and tuning production applications (their teams write Java)
:Experience operating and scaling services in AWS using technologies such as: EC2, ALB, VPC, RDS, and Aurora
:Experience with automation and infrastructure management tools such as Ansible, Terraform, and Docker
:Familiarity with popular monitoring tools such as: New Relic, DataDog, Prometheus, CloudWatch, and the ELK stack
:Prior experience participating in an on-call rotation
:A passion for automation and optimization and an unrelenting commitment to a good customer experience