drjobs Site Reliability Engineer, Apple Ads

Site Reliability Engineer, Apple Ads

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Austin - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

As an SRE in Apple Ads you will own the health performance and scalability of ad-serving infrastructure and associated platform tooling. Your focus will be on building automation that eliminates manual processes improves service resilience and enables teams to move faster with person in this role will:* Build and operate distributed systems using AWS managed services such as EKS MSK and ElastiCache.* Develop internal tooling and automation frameworks to improve infrastructure reliability cost-efficiency and operational visibility.* Collaborate with engineering teams to define infrastructure architecture troubleshoot complex issues and drive production excellence.* Design and manage Infrastructure as Code with Terraform ensuring repeatable secure and scalable deployments.* Lead or participate in incident response postmortems and continuous improvement cycles to reduce future is not a DevOps-only or CI/CD-focused role. We are looking for engineers who build platform solutions not just configure pipelines.


  • 5 years of experience supporting internet-facing production systems and distributed cloud infrastructure.
  • Strong programming skills in at least one of: Python Go or Java.
  • Proven expertise with AWS-managed infrastructure especially:
  • Hands-on experience with Linux systems and deep knowledge of its internals.
  • Demonstrated experience with Infrastructure as Code especially Terraform.
  • Strong foundation in SRE concepts: Monitoring alerting and observability incident response and root cause analysis error budgets SLAs/SLOs and system reliability


  • Built tools or services that automate platform operations reduce toil or improve cost efficiency.
  • Experience managing Kubernetes clusters at scale in production environments.
  • Hands-on experience troubleshooting distributed systems under real-world load.
  • Clear communication skills and comfort collaborating across engineering infrastructure and product teams.
  • AWS certifications or broad experience across multiple AWS services is a plus.

Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.