About the Role
We are seeking an experienced Senior SaaS Operations Engineer to join our team and lead the reliability scalability and performance of our cloud-based SaaS platform. This role requires deep technical expertise in cloud infrastructure container orchestration and DevOps practices along with the ability to mentor team members and drive operational excellence.
Key Responsibilities
Design implement and maintain highly available SaaS infrastructure on AWS
Lead the development and optimization of CI/CD pipelines using Jenkins or GitLab CI
Architect and manage Kubernetes clusters with expertise in CNI plugins (Calico Cilium or similar) cert-manager for certificate automation and policy enforcement using Gatekeeper/OPA or Pod Security Standards
Develop and maintain complex Helm charts for application deployment and configuration management
Drive infrastructure-as-code initiatives using Ansible and other automation tools
Establish monitoring logging and alerting standards using monitoring and observability tools.
Lead incident response efforts and conduct post-mortem analyses to prevent recurrence
Optimize database performance for both relational and non-relational systems
Mentor mid-level and junior engineers providing technical guidance and best practices
Participate in capacity planning and architectural decision-making
Collaborate with development teams to ensure operational requirements are built into applications
Participate in 24/7 on-call rotation covering EMEA and US time zones
Required Qualifications
5 years of experience in SaaS operations DevOps or Site Reliability Engineering roles
Expert-level knowledge of AWS services (EC2 ECS EKS RDS S3 CloudWatch etc.)
Advanced proficiency with Kubernetes including cluster administration CNI configuration (Calico/Cilium) cert-manager implementation and policy management (Gatekeeper/OPA or PSP/PSS)
Knowledge of Kubernetes operators CRDs and admission controllers
Experience with Kubernetes ingress controllers (NGINX Traefik or similar)
Strong experience developing and managing Helm charts for complex applications
Proven expertise with CI/CD tools (Jenkins or GitLab CI)
Deep understanding of Ansible for configuration management and automation
Strong experience with relational databases (PostgreSQL MySQL etc.)
Advanced knowledge of monitoring and observability tools (ELK stack Grafana Prometheus)
Excellent troubleshooting and problem-solving skills
Strong communication skills and ability to work effectively with cross-functional teams
Experience with 24/7 on-call responsibilities
Preferred Qualifications
Experience with Azure cloud platform
Knowledge of Java applications and JVM tuning
Experience with non-relational databases (MongoDB Cassandra DynamoDB etc.)
Background in software development or scripting (Python Bash Go)
Experience with service mesh technologies (Istio Linkerd)
Familiarity with security best practices and compliance requirements
Terraform or CloudFormation experience
Relevant certifications (AWS Certified Solutions Architect CKA etc.)
Total Experience Expected: 08-11 years
Qualifications :
Required Qualifications
5 years of experience in SaaS operations DevOps or Site Reliability Engineering roles
Expert-level knowledge of AWS services (EC2 ECS EKS RDS S3 CloudWatch etc.)
Advanced proficiency with Kubernetes including cluster administration CNI configuration (Calico/Cilium) cert-manager implementation and policy management (Gatekeeper/OPA or PSP/PSS)
Knowledge of Kubernetes operators CRDs and admission controllers
Experience with Kubernetes ingress controllers (NGINX Traefik or similar)
Strong experience developing and managing Helm charts for complex applications
Proven expertise with CI/CD tools (Jenkins or GitLab CI)
Deep understanding of Ansible for configuration management and automation
Strong experience with relational databases (PostgreSQL MySQL etc.)
Advanced knowledge of monitoring and observability tools (ELK stack Grafana Prometheus)
Excellent troubleshooting and problem-solving skills
Strong communication skills and ability to work effectively with cross-functional teams
Experience with 24/7 on-call responsibilities
Preferred Qualifications
Experience with Azure cloud platform
Knowledge of Java applications and JVM tuning
Experience with non-relational databases (MongoDB Cassandra DynamoDB etc.)
Background in software development or scripting (Python Bash Go)
Experience with service mesh technologies (Istio Linkerd)
Familiarity with security best practices and compliance requirements
Terraform or CloudFormation experience
Relevant certifications (AWS Certified Solutions Architect CKA etc.)
Additional Information :
Noida (Hybrid)
At our organization we are committed to fighting against all forms of discrimination. We foster a work environment that is inclusive and respectful of all differences.
All of our positions are open to people with disabilities.
Remote Work :
No
Employment Type :
Full-time
Sopra Steria, a major Tech player in Europe with 52,000* employees in nearly 30 countries, is recognised for its consulting, digital services and solutions. It helps its clients drive their digital transformation and obtain tangible and sustainable benefits. The Group provides end-to- ... View more