Job Summary
NetApp is seeking a Senior Site Reliability Engineer to join its growing Instaclustr team in India. Instaclustr is NetApps open-source-as-a-service platform delivering highly reliable scalable solutions built on technologies such as Cassandra Kafka PostgreSQL Redis/Valkey OpenSearch ClickHouse and Cadence. The platform manages the full application lifecyclefrom infrastructure provisioning and application deployment to ensuring reliable production operations.
Since its founding in 2013 Instaclustr has grown to support over 300 customers and manage more than 22000 nodes globally. Site Reliability Engineers play a critical role in maintaining the security performance and availability of large-scale cloud-hosted open-source clusters while collaborating closely with global customers across industries such as gaming banking and logistics.
Job Requirements
Essential Skills
- 4-8 years of PostgreSQL administration experience in large-scale enterprise environments.
- Strong knowledge of PostgreSQL internals: configuration tuning WAL autovacuum MVCC indexing and partitioning.
- Hands-on experience with replication (streaming logical cascading) failover and disaster recovery.
- Expertise in pgBackRest including PITR and troubleshooting restore issues.
- Experience managing critical incidents database migrations and architecture reviews
Desired / Additional Skills
- Performance tuning using query logs system metrics diagnostic views and execution plans
- Experience with Patroni or similar HA solutions including DCS and failover management.
- Strong Linux command-line skills.
- Excellent written and verbal English communication
- ITIL-based support experience preferred
- Solid fundamentals in OS memory management and networking
- Team-oriented process-driven and proactive
- Programming in Python or Java; Git experience a plus
- Exposure to AWS Docker and Ansible is an advantage
Education
- 4 to 8 years of experience is preferred.
- A Bachelor of Science Degree in Electrical Engineering or Computer Science a Masters Degree or a PhD; or equivalent experience is required.
- Demonstrated ability to complete multiple moderately complex technical tasks.
Required Experience:
IC
Job Summary NetApp is seeking a Senior Site Reliability Engineer to join its growing Instaclustr team in India. Instaclustr is NetApps open-source-as-a-service platform delivering highly reliable scalable solutions built on technologies such as Cassandra Kafka PostgreSQL Redis/Valkey OpenSearch Clic...
Job Summary
NetApp is seeking a Senior Site Reliability Engineer to join its growing Instaclustr team in India. Instaclustr is NetApps open-source-as-a-service platform delivering highly reliable scalable solutions built on technologies such as Cassandra Kafka PostgreSQL Redis/Valkey OpenSearch ClickHouse and Cadence. The platform manages the full application lifecyclefrom infrastructure provisioning and application deployment to ensuring reliable production operations.
Since its founding in 2013 Instaclustr has grown to support over 300 customers and manage more than 22000 nodes globally. Site Reliability Engineers play a critical role in maintaining the security performance and availability of large-scale cloud-hosted open-source clusters while collaborating closely with global customers across industries such as gaming banking and logistics.
Job Requirements
Essential Skills
- 4-8 years of PostgreSQL administration experience in large-scale enterprise environments.
- Strong knowledge of PostgreSQL internals: configuration tuning WAL autovacuum MVCC indexing and partitioning.
- Hands-on experience with replication (streaming logical cascading) failover and disaster recovery.
- Expertise in pgBackRest including PITR and troubleshooting restore issues.
- Experience managing critical incidents database migrations and architecture reviews
Desired / Additional Skills
- Performance tuning using query logs system metrics diagnostic views and execution plans
- Experience with Patroni or similar HA solutions including DCS and failover management.
- Strong Linux command-line skills.
- Excellent written and verbal English communication
- ITIL-based support experience preferred
- Solid fundamentals in OS memory management and networking
- Team-oriented process-driven and proactive
- Programming in Python or Java; Git experience a plus
- Exposure to AWS Docker and Ansible is an advantage
Education
- 4 to 8 years of experience is preferred.
- A Bachelor of Science Degree in Electrical Engineering or Computer Science a Masters Degree or a PhD; or equivalent experience is required.
- Demonstrated ability to complete multiple moderately complex technical tasks.
Required Experience:
IC
View more
View less