DescriptionWere passionate about building software that solves problems. We count on our Cloud Engineers and Site Reliability Engineers (SREs) to empower our users with a rich feature set high availability and stellar performance level to pursue their missions. We are currently seeking a public cloud experienced engineer for planning designing and implementing next generation cloud infrastructure solutions. Cloud Engineer will be a part of the Engineering team and will require a strong knowledge of application monitoring infrastructure monitoring automation maintenance and Service Reliability Improvements.
Specifically we are searching for someone who brings fresh ideas demonstrates a unique and informed viewpoint and enjoys collaborating with a crossfunctional team to develop realworld solutions and positive user experiences at every interaction.
Responsibilities- Design automate and manage a highly available and scalable cloud deployment that allows development teams to deploy and run their services.
- Collaborating with engineering and Architects teams to evaluate and identify optimal cloud solutions also leveraging scalability highperformance and security.
- Modernise existing onprem solution and improving existing systems.
- Extensively automated deployments and managed applications in GCP.
- Developing and maintaining cloud solutions in accordance with best practices.
- Ensuring efficient functioning of data storage and processing functions in accordance with company security policies and best practices in cloud security.
- Collaborate with Engineering teams to identify optimization strategies help develop selfhealing capabilities
- Designing and architecting middleware solutions that align with the overall system architecture and meet business requirements. This involves selecting the appropriate middleware technologies and patterns for seamless integration.
- Writing code and configuring middleware components to enable communication and data flow between various systems. This includes developing APIs message queues and other middleware services.
- Integrating different applications and services using middleware technologies ensuring they can communicate effectively and exchange data in a standardized manner.
- Identifying and resolving issues related to middleware such as communication failures performance bottlenecks or data inconsistencies.
- Experience in developing a strong observability capabilities
- Identifying analysing and resolving infrastructure vulnerabilities and application deployment issues.
- Regularly reviewing existing systems and making recommendations for improvements.
Qualifications- Proven workexperience in designing deploying and operating mid to large scale public cloud environments.
- Proven work experience inprovisioning Infrastructure as Code (IaC) using Terraform Enterprise or community edition.
- Proven work experience in writing custom terraform providers/pluginswith Sentinel Policyas Code
- Proven work experience in containerisation via Docker
- Good to have strong working experience in Virtualisation via Kubernetes(image building k8s schedule)
- Experience in package config and deployment management via Helm Kustomize ArgoCD.
- Strong knowledge in Github DevOps (Tekton / GCP Cloud Build is an advantage)
- Should be proficient in scripting and coding that include traditional languages like Java Python GoLang JS and .
- Proven working experience inMessaging Middleware Apache Kafka RabbitMQ Apache ActiveMQ
- Proven working experience in API gateway Apigee is an advantage.
- Proven working experience in API development REST.
- Proven working experience in Sec and IAM SSL/TLS OAuth and JWT.
- Extensive knowledge and handson experience in Grafana and Prometheus micro libraries.
- Experience in self hosted private / public cloud setup.
- Exposure to Cloud Monitoring and logging.
- Experience with distributed storage technologies like NFS HDFS Ceph S3 as well as dynamic resource management frameworks(Mesos Kubernetes Yarn)
- Experience with automation toolsshould be a priority
- Professional Certification is an advantage
- Public Cloud >> GCP is a good to have.
IaC Terraform: 1) Muti Env Deployment 2) Secret management 3) Backward Compatibility 4) State management | Containers & Virtualisation: 1) Container Orchestration 2) Docker content Trust 3) Docker overlay 4) integrate with CICD | PowerShell: Error handling trouble shooting and expressions. | IaaS: 1) Troubleshoot Network connectivity problems between GCP resources 2) Load Balancer issues 3) IAM errors | DevOps: 1) Automated pipeline 2) Security in DevOps 3) Config Management 4) DevOps Metrics |
Preferred Qualifications
- Previous success in Cloud Engineering DevSecOps.
- Must have 5 experience DevSecOPs
- Must have minimum 3 experience in cloud Engineering