Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailNot Disclosed
Salary Not Disclosed
1 Vacancy
Cloud operations encompass the management maintenance and continuous improvement of cloud-based infrastructure and services ensuring they operate reliably securely and efficiently to support critical business functions.
We are seeking a Cloud Operations Engineer who thrives in fast-paced environments and understands the importance of operational excellence. The ideal candidate will possess strong technical acumen in AWS and Terraform be passionate about automation and have a solid grasp of ITIL-based service management. This role will play a critical part in maintaining the stability performance and availability of cloud infrastructure with a commitment to 24/7 support coverage.
Responsibilities:
Supports the day-to-day operations of cloud infrastructure by proactively monitoring maintaining and improving system availability performance and reliability.
Owns and executes the full lifecycle of operational support following ITIL standards including:
Incident Management Rapid detection investigation and resolution of infrastructure incidents to minimize business impact.
Request Fulfillment Handling and fulfilling infrastructure service requests within agreed service levels.
Problem Management Identifying root causes of recurring issues and implementing permanent fixes to prevent reoccurrence.
Ensures infrastructure availability for business-critical services in a 24/7 support model including participation in on-call rotations.
Builds and manages AWS cloud environments using Infrastructure as Code (Terraform).
Implements automation for operational tasks to reduce manual intervention and improve system consistency and reliability.
Collaborates with development and application teams to streamline deployments and operational processes.
Maintains detailed documentation of infrastructure configurations operating procedures and troubleshooting guides.
Ensures compliance with security governance and operational standards across all environments.
Participates in capacity planning cost optimization and performance tuning activities.
Supports change management processes ensuring safe and auditable infrastructure changes.
Qualifications :
Proven experience managing AWS cloud infrastructure is essential.
Strong hands-on experience with IaC Terraform.
Working knowledge of ITIL service management practices and experience in incident request and problem management is essential.
Experience supporting and operating production infrastructure in a 24/7 environment.
Strong scripting skills in at least one language (e.g. Bash Python PowerShell).
Familiarity with CI/CD pipelines configuration management tools and automation frameworks.
Understanding of networking security groups load balancers IAM policies and monitoring in cloud environments.
Exposure to observability tools such as Splunk CloudWatch Datadog or similar is a plus.
Strong communication and collaboration skills with the ability to interface effectively with technical and non-technical stakeholders.
AWS certifications (e.g. Solutions Architect SysOps Administrator) and ITIL certifications are a plus.
Additional Information :
Discover some of the global benefits that empower our people to become the best version of themselves:
At Endava were committed to creating an open inclusive and respectful environment where everyone feels safe valued and empowered to be their best. We welcome applications from people of all backgrounds experiences and perspectivesbecause we know that inclusive teams help us deliver smarter more innovative solutions for our customers. Hiring decisions are based on merit skills qualifications and potential. If you need adjustments or support during the recruitment process please let us know.
Remote Work :
No
Employment Type :
Full-time
Full-time