Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailJob Purpose -
We are looking for a System Reliability Engineer (SRE) with a strong background in Azure to join our team. The ideal candidate will have at least 4 years of experience in managing cloud-based infrastructure ensuring system reliability and improving operational efficiency. You will play a critical role in designing building and maintaining systems that are robust scalable and secure
KRA -
Monitor manage and ensure the reliability and performance of cloud-based systems on Azure.
Implement and maintain tools for monitoring logging and alerting using Azure Monitor Application Insights and related tools.
Automate routine operational tasks including deployments monitoring and incident response.
Work closely with development and DevOps teams to implement best practices for reliability and availability.
Troubleshoot and resolve incidents performing root cause analysis to prevent recurrence.
Optimize cloud infrastructure for cost-efficiency scalability and performance.
Design and maintain disaster recovery and backup strategies on Azure.
Define and enforce service-level objectives (SLOs) and indicators (SLIs) to measure system performance.
Support CI/CD pipelines and deployment processes ensuring smooth operations in production environments.
Stay current with new Azure features tools and industry best practices.
Required Experience:
Manager
Full Time