drjobs Senior Site Reliability Engineer - Incident Management/Resiliency (Hybrid)

Senior Site Reliability Engineer - Incident Management/Resiliency (Hybrid)

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Chicago, IL - USA

Monthly Salary drjobs

$ 85000 - 115000

Vacancy

1 Vacancy

Job Description

We are interested in every qualified candidate who is eligible to work in the United States. However we are not able to sponsor visas or take over sponsorship at this time.

About Resilience Engineering

Resilience Engineering is a subset of the Site Reliability Engineering team that strives to foster a culture of continuous improvement through incident analysis process evolution and problemsolving. We work closely with teams across Tech Product and Operations through our Production Incident process to uncover system weaknesses learn from failures and make our technology more reliable.

What Youll Be Doing

In this role youll play a key role in enhancing the resiliency of our systems. Your work will focus on our incident response reporting and analysis processes enabling the organization to better prepare for and respond to complex system failures.

Youll drive efforts to optimize how we manage unexpected outages from leading realtime incident response to facilitating postincident reviews. Youll identify patterns across incidents uncover contributing factors and work across teams to recommend longterm solutions that improve our systems resilience.

Your core priorities will be to:

  • Lead production incidents as part of our PI PIC (or Incident Commander) rotation after completing training ensuring clear communication and resolution.
  • Capture and maintain detailed documentation of incidents contributing factors and learnings in formal incident reports.
  • Facilitate and document blameless postincident reviews that promote learning and continuous improvement.
  • Collect and analyze incident data to identify systemic issues risks and trends.
  • Collaborate with engineering product and operations teams to address vulnerabilities and build more resilient systems.
  • Drive improvements to how we collect analyze and learn from system failures.
  • Design and run failure simulations (e.g. mock incidents disaster recovery exercises) to proactively identify weak points.
  • Champion a culture of operational excellence and resilience across the organization.
  • Continuously evolve our incident management processes to ensure they scale with our technology and business needs.

What you should have:

  • 3 years of experience in a technology or analyst role (e.g. Software Engineering Systems Operations SRE or Product).
  • A strong interest in how complex distributed systems operateand how to make them more reliable.
  • Excellent analytical and problemsolving skills with a systemsthinking mindset.
  • Strong communication skills both verbal and written with the ability to tailor messaging to technical and nontechnical audiences.
  • Experience querying and analyzing data (e.g. SQL PostgreSQL Kafka).
  • Comfort with ambiguity and the ability to turn vague problems into actionable insights.
  • Demonstrated maturity sound judgment and organizational awareness.
  • Ability to lead crossfunctional teams during highpressure situations such as incident response and reviews.

Nice to have:

  • Experience leading resolution of major system outages or production incidents.
  • Experience driving largescale technical or process changes.

Compensation:

This position includes various levels within our career ladder. The actual annual salary will be determined based on qualifications skills experience and level assessed during the hiring process and may fall outside of the ranges shown.

Budgeted annual salary ranges:

Senior Site Reliability Engineer I: $85000 $115000
Senior Site Reliability Engineer II: $94000 $125000

Additional compensation for this role may include a bonus. All fulltime employees are eligible to participate in Company benefits described in more detail here.

#BIHybrid #LIHybrid

Benefits & Perks:

About Enova

Enova International is a leading financial technology company that provides online financial services through our AI and machine learningpowered Colossusplatform. We serve nonprime consumers and businesses alike while offering worldclass technology and services to traditional banksin order to create accessible credit for millions.

Being a valuesdriven organization is at the core of Enovas success. We live our values by listening to our customers challenging assumptions thinking big setting high expectations and hiring and developing the best. Through our values and our commitment to making Enova an awesome place to work we maintain an environment of inclusion and culture where our employees can thrive. You can learn more about Enovas values and culture here.

It is our policy to provide equal employment opportunity for all persons and not discriminate in employment decisions by placing the most qualified person in each job without regard to any other classification protected by federal state or local law. California Applicants: Click here to review our California Privacy Policy for Job Applicants.


Required Experience:

Senior IC

Employment Type

Full Time

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.