Principal Platform Site Reliability Engineer (SASE Central Cloud Platforms)

Palo Alto Networks

Not Interested
Bookmark
Report This Job

profile Job Location:

Santa Clara County, CA - USA

profile Monthly Salary: Not Disclosed
Posted on: 3 hours ago
Vacancies: 1 Vacancy

Department:

Engineering

Job Summary

Your Career

Palo Alto Networks runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer you will be part of a team supporting the services running on this infrastructure. This includes automation architecture performance metrics troubleshooting security and reliability.

Central Infrastructure & Platform Engineering Team Santa Clara CA (Hybrid/Onsite as applicable)

Were hiring a Sr Staff Platform SRE for our SASE central cloud platform team. Were looking for a well-rounded platform SRE who can architect build and operate cloud-native infrastructure at very large scale across GCP AWS and OCI.

This is a unique opportunity to operate at a humongous scalethe platforms youll influence are tied to hundreds of millions of dollars of annual cloud spend and the work you do will directly impact reliability efficiency developer velocity and operational excellence across the organization

Your Impact

  • Act as an architect for infrastructure owned by the teamplan ahead and design in line with scale requirements.

  • Design develop and execute infrastructure components for the platforms owned by the team.

  • Own Infrastructure as Code(IaC) Monitoring as Code(MaC) Policy as Code(PaC) components and build the golden path for future platforms with best practices

  • Strive for autonomy with an automation-first mindset including modern AI-driven approaches.

  • Redefine and continuously update modern CI/CD practices for cloud-native workloads

  • Perform on-call duties and reduce on-call toil through automation AI agents analyzers and self-healing patterns

  • Support internal platform users as a forward-deployed engineer close the feedback loop and modernize the platform based on user needs

  • Maintain a security-first mindset without compromising reliability and operability

  • Design cost-effective infrastructure solutions across AWS GCP and OCI including cost governance capacity planning and efficiency improvements


Qualifications :

Your Experience 

  • BS or MS in Computer Science a related field or equivalent professional experience

  • Expert knowledge of Kubernetes and CNCF ecosystem tools such as Helm Prometheus Backstage Istio and Crossplane.

  • Strong mastery of Terraform: building reusable modulesdesigning complex infrastructure offerings operating in protected / restricted environments

  • Strong foundational knowledge of operating and scaling cloud-native workloads using KEDA Karpenter NAP etc.

  • Ability to architect CI/CD infrastructure for cloud-native workloadsprimarily Golang and Pythonand build DevSecOps pipelines.

  • Programming skills with GoLang & Python scripting experience with bash

  • Strong knowledge of Argo CD including controlling and scaling thousands of deployments across Kubernetes and multiple clouds.

  • Deep experience in cost governance and optimization at scale including allocation models anomaly detection efficiency recommendations and guardrails across cloud and Kubernetes workloads.

  • Ability to diagnose and troubleshoot complex distributed systems handling high volume transactions

  • Excellent written and verbal communication able to collaborate and rally support

  • Self-disciplined self-managed self-motivated and strong sense of ownership urgency and drive

  • Strong communication skills and the ability to partner across platform security and application engineering teams


Additional Information :

The Team

Our engineering team is at the core of our products and connected directly to the mission of preventing cyberattacks. We are continually innovating challenging the way we and the industry think about cybersecurity. Our engineers dont shy away from building products to solve problems no one has pursued before.

We define the industry instead of waiting for directions. We need individuals who feel comfortable in ambiguity excited by the prospect of a challenge and motivated by the unknown risks facing our everyday lives that are only mitigated by a secure digital environment.

Compensation Disclosure

The compensation offered for this position will depend on qualifications experience and work location. For candidates who receive an offer at the posted level the starting base salary (for non-sales roles) or base salary commission target (for sales/commissioned roles) is expected to be between $140000 - $230000YR. The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here.

#LI-TD1

Our Commitment

Were problem solvers that take risks and challenge cybersecuritys status quo. Its simple: we cant accomplish our mission without diverse teams innovating together.

We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need please contact us at  .

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace and all qualified applicants will receive consideration for employment without regard to age ancestry color family or medical care leave gender identity or expression genetic information marital status medical condition national origin physical or mental disability political affiliation protected veteran status race religion sex (including pregnancy) sexual orientation or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.


Remote Work :

No


Employment Type :

Full-time

Your CareerPalo Alto Networks runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer you will be part of a team supporting the services running on this infrastructure. This includes automation architecture performance metrics troubleshooting securi...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting

About Company

Company Logo

Our enterprise security platform detects and prevents known and unknown threats while safely enabling an increasingly complex and rapidly growing number of applications. Come be part of the team that redefined the firewall industry and is now the fastest-growing security company in hi ... View more

View Profile View Profile