Senior Staff Site Reliability Engineer (Cortex Observability)

Palo Alto Networks

Posted on : 14-07-2025

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

Santa Clara - USA

Monthly Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 14-07-2025

Job Description

Your Career

The Cortex team builds and delivers the industrys most advanced SecOps platform consisting of XDR XSIAM XSOAR and XPANSE. As a member of the Cortex DevOps team your role involves operating and maintaining a large-scale GCP environment including the design implementation and continuous enhancement of our comprehensive observability systems. To meet the opportunities that such a role provides you will have a deep knowledge of modern observability and monitoring tools and practices having managed high cardinality metrics implemented tracing and operationalized large-scale logging solutions. As part of this role you will collaborate closely with our engineering teams to develop innovative solutions that provide clear and actionable insights into our systems performance and health.

Your Impact

As a Senior Staff SRE with the Cortex Observability team you will:

Cloud Expertise: Utilize your expertise in monitoring cloud platforms particularly GCP to optimize our infrastructure leveraging cloud-native technologies
Monitoring Expertise: Improve monitoring processes alerts and metrics. Work with development teams to ensure that all of our services have the right monitoring and metrics in place so that we detect problems before our customers do
Incident Management: Leverage incident management processes to ensure efficient resolution of system issues and minimal impact on services
Automation: Automate complex monitoring and alerting tasks by building tools for cloud operations such as automated remediation of known issues and auto-scaling
Continuously Improve: Stay up-to-date with cutting-edge technologies evaluate their potential impact on our operations and implement them when appropriate
On-Call: Provide follow-the-sun operational coverage in the production of our Observability infrastructure
Collaborate: Work with our Engineering team to influence the operability of the product and ensure the reliability and availability of our services

Qualifications :

Your Experience

DevOps/SRE Expertise: 5 years of experience as a DevOps/SRE engineer with a passion for technology and a strong motivation for high reliability at the service level
Observability Tools: High proficiency with Thanos Prometheus Grafana Open Telemetry and other monitoring tools
Incident and Alerts Management: Clear understanding of incident and alerts management using tools like Pagerduty and Prometheus Alert Manager
Cloud Proficiency: High proficiency in either Google Cloud Platform or Amazon Web Services
Kubernetes and Docker: High proficiency with Kubernetes and Docker for container orchestration
Scripting and Automation: High proficiency in Python programming and Linux Shell commands. Experience with Ansible and Terraform for infrastructure as code
Communication Skills: Effective communication and interpersonal skills with the ability to work and coordinate between multiple teams in different time zones
Troubleshooting: Ability to effectively troubleshoot and address emerging and complex problems
Independence: Ability to operate independently make decisions take action and take responsibility

Additional Information :

The Team

Were trailblazers who dream big take risks and challenge cybersecuritys status quo. Its simple: we cant accomplish our mission without diverse teams innovating together.

Compensation Disclosure

The compensation offered for this position will depend on qualifications experience and work location. For candidates who receive an offer at the posted level the starting base salary (for non-sales roles) or base salary commission target (for sales/commissioned roles) is expected to be between $126000/YR - $203500/YR The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here.

Our Commitment

Were problem solvers that take risks and challenge cybersecuritys status quo. Its simple: we cant accomplish our mission without diverse teams innovating together.

We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need please contact us at .

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace and all qualified applicants will receive consideration for employment without regard to age ancestry color family or medical care leave gender identity or expression genetic information marital status medical condition national origin physical or mental disability political affiliation protected veteran status race religion sex (including pregnancy) sexual orientation or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.

Remote Work :

Employment Type :

Full-time

Employment Type

Full-time

Company Industry

Key Skills

Apply Now

About Company

Palo Alto Networks

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Senior Staff Site Reliability Engineer (Cortex Observability)

Palo Alto Networks

Job Description

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Site Reliability Engineer, Platform

Associate Staff Engineer, QA

(8NW) Staff Software Engineer (React/TypeScript/)

Senior Software Engineer in Test (Mobile)

Senior Java Software Engineer in SRE (f/m/d)

Senior HR Generalist

Chassis Simulation Engineer

Senior HR Generalist