Senior Software Engineer (Kubernetes, Automation, Python)

NetApp

Posted on : 29-07-2025

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

Bengaluru - India

Monthly Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 29-07-2025

Job Description

Job Summary

As a Cloud Infrastructure/Site Reliability Engineer you will be operating at the intersection of development and operations. Your role will involve engaging in and enhancing the lifecycle of cloud services - from design through deployment operation and refinement. You will be responsible for maintaining these services by measuring and monitoring their availability latency and overall system health.
You will play a crucial role in sustainably scaling systems through automation and driving changes that improve reliability and velocity. As part of your responsibilities you will administer cloud-based environments that support our SaaS/IaaS offerings which are implemented on a microservices container-based architecture (Kubernetes).
In addition you will oversee a portfolio of customer-centric cloud services (SaaS/IaaS) ensuring their overall availability performance and security. You will work closely with both NetApp and cloud service provider teams including those from Google located across the globe in regions such as RTP Reykjavk Bangalore Sunnyvale Redmond and more.
Due to the critical nature of the services we support this position involves participation in a rotation-based on-call schedule as part of our global team. This role offers the opportunity to work in a dynamic global environment ensuring the smooth operation of vital cloud services. To be successful in this role you should be a motivated self-starter and self-learner possess strong problem-solving skills and be someone who embraces challenges.

Job Requirements

Incident Response and Troubleshooting: Address and perform root cause analysis (RCA) of complex live production incidents and cross-platform issues involving OS Networking and Database in cloud-based SaaS/IaaS environments. Implement SRE best practices for effective resolution.
Analysis and Infrastructure Maintenance: Continuously monitor analyze and measure system health availability and latency using tools like Prometheus Stackdriver ElasticSearch Grafana and SolarWinds. Develop strategies to enhance system and application performance availability and addition maintain and monitor the deployment and orchestration of servers docker containers databases and general backend infrastructure.
Document system knowledge as you acquire it create runbooks and ensure critical system information is readily accessible.
Security Management: Stay updated with security protocols and proactively identify diagnose and resolve complex security issues.
Automation and Efficiency: Identify tasks and areas where automation can be applied to achieve time efficiencies and risk reduction. Develop software for deployment automation packaging and monitoring visibility.
Issue Tracking and Resolution: Use Atlassian Jira Google Buganizer and Google IRM to track and resolve issues based on their priority.
Team Collaboration and Influence: Work in tandem with other Cloud Infrastructure Engineers and developers to ensure maximum performance reliability and automation of our deployments and infrastructure. Additionally consult and influence developers on new feature development and software architecture to ensure scalability.
Debugging Troubleshooting and Advanced Support: Undertake debugging and troubleshooting of service bottlenecks throughout the entire software stack. Additionally provide advanced tier 2 and 3 support for NetApps Cloud Data Services solutions.
Directly influence the decisions and outcomes related to solution implementation: measure and monitor availability latency and overall system health.
Proficiency in Linux/Unix and CORE OS.
Demonstrated experience in scripting and infrastructure automation using tools such as Ansible Python Go or Ruby.
Deep working knowledge of Containers Kubernetes and Serverless computing implementation.
DevOps development methodologies.
Experience with distributed systems design patterns using tools such as Kubernetes.
Experience with cloud platforms such as AWS Azure or Google Cloud.

Education

A minimum of 8-12 years of experience is required.
A Bachelor of Science Degree in Computer Science a masters degree; or equivalent experience is required.

Required Experience:

Senior IC

Employment Type

Full-Time

Company Industry

Key Skills

Apply Now

About Company

NetApp

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Senior Software Engineer (Kubernetes, Automation, Python)

NetApp

Job Description

Job Summary

Job Requirements

Education

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Data Engineer (ADF, Snowflake)

Senior WinForms Developer (C#)

Data Engineer (Dbt ,Snowflake)

Big Data Engineer-4+ Years

Software Developer (Junior) 9432-3111

Plastic Product Design Engineer /Sr. Design Engineer

NOC Engineer

NOC Engineer