Cloud Operations Engineer - Observability

Splunk

Posted on : 22-04-2025

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

Hyderabad - India

Monthly Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 22-04-2025

Job Description

Description

Join us as we pursue our groundbreaking vision to make machine data accessible usable and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk we are committed to our work customers having fun and most significantly to each others success.

The Splunk Observability Cloud provides fullfidelity monitoring and fixing across infrastructure applications and user interfaces in realtime and at any scale to help our customers keep their services reliable innovate faster and deliver great customer experiences.

Role

You will help us run one of the largest and most sophisticated cloudscale bigdata and microservices platforms in the world. You will be responsible to monitor and resolve issues that affect the availability and performance of critical components of Splunk Observability Cloud. You will use your Kubernetes cloud and infrastructureascode knowledge to enhance Splunk Observability Cloud infrastructure while reducing its operational costs.

As such you will be providing oncall support & incident management for our customers. To ensure coverage you will work a 40 hour MonFri week and be available for production support on a rotating basis on either a Saturday or/& Sunday. The flexible rotating roster is intended to balance employee wellbeing and business requirements to ensure customer expectations are met.

Responsibilities:

Respond to monitoring alerts according to defined playbooks and procedures.
Enhance playbooks and procedures to reduce oncall toil.
Participate in Post Incident Reviews and discussions.
Ensure stability and performance of production environments.
Deploy software to production environments.
Build effective working relationships with crossfunctional team members
Make suggestions for process improvements and enhance operational efficiencies.
Implement various process improvements and operational efficiencies.

Qualifications:

5 years related experience in Cloud Operations.
You have experience with Cloud Computing Platforms such as AWS and GCP.
You have experience with Kubernetes and Docker.
You have experience with one or more scripting languages such as Python Bash etc.
You have 2 years in incident response and major incident management.
You enjoy problemsolving and analyzing globalscale distributed systems.
You are collaborative with strong interpersonal and communication skills both verbal and written.
You remain calm and collected in stressful situations such as a major service outage.
You demonstrate attention to detail followthrough and the ability to prioritize quickly.
You demonstrate good judgment on when to solve problems individually and when to involve others.
Experience in Infrastructureascode Terraform Helm YAML.

Nice to have:

Experience handling SaaS applications for a large customer base.
Experience with CI/CD frameworks and PipelineasCode such as Jenkins Gitlab Artifactory etc.
Familiarity with microservices fundamentals including Service Mesh using Istio service discovery deployment strategies monitoring scheduling and load balancing.

We are an equalopportunity employer and value diversity at our company. We do not discriminate on the basis of race religion color national origin gender sexual orientation age marital status veteran status or disability status.

We value diversity at our company. All qualified applicants will receive consideration for employment without regard to race color religion sex sexual orientation gender identity national origin or any other applicable legally protected characteristics in the location in which the candidate is applying.

Note:

Thank you for your interest in Splunk!

Employment Type

Full Time

Company Industry

Department / Functional Area

Engineering

Key Skills

Apply Now

About Company

Splunk

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Cloud Operations Engineer - Observability

Splunk

Job Description

Description

Employment Type

Company Industry

Department / Functional Area

Key Skills

About Company

Similar Jobs

Cloud Support Operations Engineer II

VMware Cloud Engineer

Senior Problem Manager - Cloud Operations

Senior Cloud Native Engineer

Cloud Engineer - Azure Admin

Cloud Engineer - Azure Admin

Senior Engineer, Cloud IAM PAM

[CMI] DevOps Cloud Engineer (GCP)