Director, Site Reliability (SRE, SLI SLO, Monitoring, Automation)

Vertafore

Not Interested
Bookmark
Report This Job

profile Job Location:

Hyderabad - Pakistan

profile Monthly Salary: Not Disclosed
Posted on: 4 days ago
Vacancies: 1 Vacancy

Job Summary

The Director Site Reliability Engineering (SRE) will lead reliability performance and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs incident response automation and CI/CD practices for assigned product families. Directors will manage multiple teams and collaborate withProductDevelopment Cloud OperationsInformation Securityand other SRE leaders to ensure operational excellence.

Key Responsibilities

  • Product Reliability Leadership
  • Define and enforce SLIs/SLOs fora subset of Vertafore flagshipproducts.
  • Drive observability strategy across application and infrastructure layers.
  • Release Engineering & Automation
  • Oversee CI/CD pipelines for product deployments usingtools likeGitLab Jenkins AnsibleLaunchDarkly.
  • Implement Infrastructure-as-Code (Terraform AWS CloudFormation/CDK) for application provisioning.
  • Incident Management
  • Define24x7 on-call rotations for assigned products; ensure rapid resolution and blameless postmortems.
  • Cross-Functional Collaboration
  • Partner with Cloud Ops on capacity planning OS patching (app tier) and load balancing (ALB F5).
  • Align reliability goals with product roadmaps and customer SLAs.
  • Team Leadership
  • Managea group ofManagers and Engineers; mentor teams on automation observability and reliability best practices.

Qualifications

  • Bachelors degree in Computer ScienceInformation Systems or related field.
  • 18 years inSoftware EngineeringSRE DevOps or reliability roles; 5 years in leadership(Director).
  • Proven ability toleveragesoftware engineeringprinciples andpractices tosolve reliability andoperationalchallenges.
  • Expertisein CI/CD observability and incident response.
  • Strong AWS knowledge and experience with container orchestration.
  • Proven ability to lead reliability programs across multiple SaaS products.
  • Experience architecting applications or infrastructure forhighgrowthcloud platforms.
  • Experience in B2B SaaS environments involving large-scale distributed systems.
  • Proven leadership communicating and influencing at team peer andleadershiplevels.
  • Demonstrated experience driving operational excellence through metricsandKPIs.
  • (Preferred)Background supporting financial services healthcare or regulated industries.





Required Experience:

Director

The Director Site Reliability Engineering (SRE) will lead reliability performance and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs incident response automation and CI/CD practices for assigned product families. Directors will manage multiple teams and col...
View more view more

About Company

Company Logo

Looking to start your career in Technology? We have opportunities right here in mid-Michigan! Vertafore is looking for talented people to join our team in Michigan. Our dynamic environment provides professional development, fast upward mobility, and e

View Profile View Profile