drjobs Site Reliability Engineer, Enterprise Technology Services

Site Reliability Engineer, Enterprise Technology Services

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Austin - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Architect Scalable Infrastructure: Design evolve and review highly reliable performant and cost-efficient cloud-native and hybrid infrastructure using IaC containers and micro services Cryptographic Systems at ScaleDesign and operationalize scalable secure integrations with Hardware Security Modules (HSMs) for sensitive workloads key management and cryptographic SRE Best Practices: Define and implement service-level indicators (SLIs) objectives (SLOs) and agreements (SLAs) to guide engineering teams towards reliability and observability Architecture & Prevention: Serve as a technical lead during major incidents. Partner with security and platform teams to conduct deep post-incident reviews drive systemic improvements and establish preventive architectural Design & Tooling: Build and maintain reusable tooling automation frameworks and reliability platforms (observability alerting chaos testing auto-scaling failover).Reliability as Code: Champion resilience engineering via automation pipelines CI/CD integrations canary releases and chaos engineering -Cloud and Hybrid Systems: Design assess and guide architecture decisions across AWS GCP AliCloud and on-premises infrastructure. Ensure consistency interoperability and regulatory & Compliance: Ensure architectural patterns are aligned with security standards compliance requirements and audit readiness.


  • 7 years of experience in SRE DevOps or Infrastructure Engineering roles with 2 years in an architectural or principal engineering capacity.
  • Deep expertise in cloud infrastructure (AWS GCP or AliCloud) and container orchestration (Kubernetes EKS).
  • Proven experience with Infrastructure as Code (Terraform Pulumi CloudFormation).
  • Strong understanding of distributed systems networking and systems design at scale.
  • Proficiency in at least one programming or scripting language (Python Go Bash or similar).


  • Experience designing observability stacks (Prometheus Grafana Datadog OpenTelemetry ELK etc.).
  • Solid background in CI/CD tools and modern deployment strategies (ArgoCD Spinnaker GitOps).
  • Familiarity with security best practices in cloud and containerized environments.
  • Familiarity with HSMs and crypto operations at scale will be a plus.

Employment Type

Full-Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.