Principal Site Reliability Engineer
Job Summary
Who we are
DigiCert is a global leader in intelligent trust. We protect the digital world by ensuring the security privacy and authenticity of every interaction. Our AI-powered DigiCert ONE platform unifies PKI DNS and certificate lifecycle management to secure infrastructure software devices messages AI content and agents. Learn why more than 100000 organizations including 90% of the Fortune 500 choose DigiCert to stop todays threats and prepare for a quantum-safe future
Job summary
The Platform Ops team within CloudOps is responsible for the reliability scalability and modernization of DigiCerts cloud infrastructure. As a Principle SRE you will own the intersection of software engineering and operationsdriving automation-first practices reducing toil and accelerating our cloud transformation across AWS Azure and GCP environments.
You will be a technical force multiplier: raising reliability standards across the organization defining SLOs that matter and building the internal platforms and tooling that enable product teams to ship with confidence.
What you will do
Reliability Engineering
- Define implement and own SLIs SLOs and error budgets for critical platform services
- Lead blameless post-mortems and drive systemic reliability improvements across the platform
- Design and implement observability pipelines (metrics logs traces) using tools such as Splunk Prometheus Grafana or OpenTelemetry
- Participate in on-call rotation and serve as an incident commander for P0/P1 events
Cloud Modernization
- Architect and execute migration strategies from legacy infrastructure to cloud-native patterns (containers serverless managed services)
- Champion adoption of Kubernetes service mesh and managed cloud services (EKS GKE AKS)
- Evaluate and introduce emerging cloud technologies that improve availability cost efficiency and developer experience
- Partner with architecture and security teams to embed reliability and compliance into platform design
Automation & Platform Development
- Build and maintain infrastructure-as-code using Terraform across multi-cloud environments
- Develop internal tooling self-service platforms and golden-path templates that reduce operational burden for development teams
- Automate operational workflows including provisioning scaling patching and secret rotation
- Contribute to and maintain CI/CD pipelines (GitHub Actions) to enable safe frequent deployments
Engineering Leadership
- Mentor mid-level engineers on SRE principles distributed systems and infrastructure best practices
- Collaborate cross-functionally with product security and compliance teams to deliver on platform roadmap commitments
- Document architectural decisions runbooks and platform standards; raise the engineering bar through code and design reviews
What you will have
- 5 years of experience in SRE platform engineering or infrastructure engineering roles
- Deep proficiency in at least one major cloud provider (AWS GCP or Azure) with working knowledge of multi-cloud environments
- Strong software engineering skills in Python Go or Bash; comfortable writing production-grade automation and tooling
- Hands-on Kubernetes experience: cluster operations workload management networking (CNI/service mesh) and security (RBAC pod security)
- Infrastructure-as-code expertise with Terraform or equivalent; experience with GitOps workflows
- Proven experience designing and operating observability systems and responding to production incidents at scale
- Strong understanding of networking fundamentals: DNS TLS/PKI load balancing and zero-trust networking concepts
Nice to have
- Experience in PKI certificate lifecycle management or security-adjacent infrastructure
- Familiarity with compliance frameworks such as SOC 2 FedRAMP or ISO 27001 in cloud environments
- Prior experience driving cloud migration or modernization programs at scale
- Contributions to open-source infrastructure or platform projects
- AWS/GCP/Azure professional-level certifications (e.g. AWS Solutions Architect Professional CKA/CKS)
What success looks like
In your first 90 days youll have a deep understanding of our platforms reliability posture contributed to at least one automation or modernization initiative and be a trusted voice in incident response. Within a year youll have measurably reduced toil improved SLO attainment across key services and delivered at least one major platform capability that enables product teams to move faster.
Working at DigiCert CloudOps
- Greenfield modernization: we are actively migrating workloads and building new platform capabilitiesyoull shape the architecture not just maintain it
- Engineering-first culture with a strong bias toward automation GitOps and platform thinking
- Cross-functional visibility: PlatformOps partners directly with product security and complianceyour work has organization-wide impact
- Competitive compensation equity and comprehensive benefits including flexible PTO and remote-first flexibility
Benefits
- Competitive compensation and comprehensive health dental and vision coverage
- Retirement savings programs with company matching (401(k) or RRSP)
- Generous paid time off including holidays and vacation
- Paid parental leave and family support benefits
- Life and disabilitycoverage
- Flexible spending and health savings options (where applicable)
- Health and wellness support including gym reimbursement and wellness programs
- Employee Assistance Program with24/7confidential support for employees and families
- Educationassistanceand professional development opportunities
- Access to LinkedIn Learning and continuous learning resources
- Employee referral bonus program andadditionalcompanyperksand discounts
- Internal rewards and recognition platform (Motivosity) to celebrate and acknowledge project wins milestone achievements and the outstanding contributions of our colleagues
- Business travel insurance and global employee support programs
To protect candidate information and maintain a secure hiring process all applications must be submitted through our careers portal. Resumes or CVs sent directly via email will not be reviewed or considered.
DigiCert is an Equal Opportunity employer and is committed to diversity in its compliance with applicable federal and state laws DigiCert prohibits discrimination on the basis of race or ethnicity religion color national origin sex age sexual orientation gender identity/expression veterans status status as a qualified person with a disability or genetic information. Individuals from historically underrepresented groups such as minorities women qualified person with disabilities and protected veterans are strongly encouraged to apply.
#LI-RR1
Compensation Transparency:
The annualized base salary range for this position is outlined below.
Each candidates compensation offer will bedeterminedbased on factors including experience skills qualifications job duties business needs and location. For roles that includeadditionalcompensation components total compensation may include base pay bonus equity or other incentives.
This role may also be eligible for benefits which will be discussed during the hiring are committed to fair and transparent pay practices andcomply withall applicable pay transparency requirements. If you would like more information about compensation or benefits we are happy to provideadditionaldetails during the hiring process.
For more informationregardingour comprehensive benefitssee the benefits section.
Base Salary
$160 - $190 USD
Required Experience:
Staff IC
About Company
DigiCert is the leading TLS/SSL Certificate Authority specializing in digital trust for the real world through PKI, IoT, DNS, Document & Software security solutions.