Senior Site Reliability Engineer

IFS

Not Interested
Bookmark
Report This Job

profile Job Location:

Colombo - Sri Lanka

profile Monthly Salary: Not Disclosed
Posted on: 14 hours ago
Vacancies: 1 Vacancy

Job Summary

As a Senior Site Reliability Engineer (SRE) within the Web Center of Excellence (Web COE) you will be responsible for ensuring the reliability security scalability and performance of enterprise web platforms. You will support and optimize web applications built on Sitecore WordPress and IIS-based solutions while actively driving proactive monitoring anomaly detection and vulnerability remediation.

This role blends hands-on engineering operational excellence and forward-looking innovation including participation in AI-driven observability and automation initiatives. You will work closely with developers QA solution architects and business stakeholders to ensure highly available secure and resilient web services.

 

Key Responsibilities

Reliability & Operations

  • Own the availability performance and stability of web applications hosted on Azure including PaaS and IaaS workloads.
  • Proactively monitor systems to detect anomalies performance degradation and reliability risks and take preventive actions before customer impact occurs.
  • Lead incident response root cause analysis (RCA) and post-incident reviews ensuring long-term corrective actions are implemented.

Azure & Microsoft Ecosystem

  • Design operate and optimize solutions using Azure services such as App Services Azure VMs Azure Monitor Log Analytics Application Insights Azure Front Door and Azure Networking.
  • Automate operational tasks using PowerShell and Azure-native automation capabilities.
  • Ensure adherence to Microsoft security and compliance best practices.

Web Platform Support

  • Support hosting deployment and operational health of Sitecore WordPress and legacy IIS-based applications.
  • Collaborate with development teams to ensure applications are production-ready scalable and operationally sound.
  • Guide teams on web hosting architecture DNS governance SSL/TLS and traffic management.

Security & Vulnerability Management

  • Proactively identify security vulnerabilities misconfigurations and exposure risks across infrastructure and applications.
  • Partner with security teams to implement remediation plans patching strategies and hardening standards.
  • Ensure secure-by-design principles are embedded into web hosting and operational processes.

Observability Monitoring & AI Initiatives

  • Build and enhance monitoring alerting and observability across the web ecosystem.
  • Leverage data logs and metrics to identify trends and systemic risks.
  • Contribute to AI-driven initiatives such as intelligent alerting anomaly detection predictive reliability and automated remediation.
  • Continuously improve operational maturity through tooling dashboards and insights.

Collaboration & Leadership

  • Work closely with Software Engineers QA and Architects to deliver reliable web services.
  • Provide technical mentorship to junior SREs and engineers setting operational best practices.

Qualifications :

Required

  • Bachelors degree in Computer Science Information Technology or equivalent professional experience.
  • Strong experience as a Site Reliability Engineer Systems Engineer or Cloud Engineer supporting production web systems.
  • Hands-on expertise with Microsoft Azure and the broader Microsoft ecosystem.
  • Strong scripting and automation skills using PowerShell.
  • Solid understanding of web technologies IIS HTTP/S DNS SSL and load balancing.
  • Experience with monitoring alerting and incident management in production environments.
  • Proven ability to identify anomalies and proactively prevent failures and vulnerabilities.

Preferred / Added Advantage

  • Experience supporting Sitecore and WordPress in enterprise environments.
  • Exposure to AI/ML-based observability AIOps or automation initiatives.
  • Familiarity with ITIL practices especially Incident Problem and Continual Service Improvement.
  • Experience working in agile cross-functional teams.

Additional Information :

We embrace flexibility and hybrid work opportunities to support diverse needs and lifestyles while also valuing inclusive workplace experiences. By fostering a sense of community we drive innovation strengthen connections and nurture belonging. Our commitment ensures you can work in a way that suits you best while also engaging with colleagues to share ideas and build meaningful relationships.


Remote Work :

No


Employment Type :

Full-time

As a Senior Site Reliability Engineer (SRE) within the Web Center of Excellence (Web COE) you will be responsible for ensuring the reliability security scalability and performance of enterprise web platforms. You will support and optimize web applications built on Sitecore WordPress and IIS-based so...
View more view more

About Company

Company Logo

We are growing! At IFS we are constantly growing to deliver award-winning solutions to hundreds of partners and thousands of customers worldwide! We help companies who want to be their best when it matters most – at their #momentofservice. Visit https://ifs.link/IzM0px to find out mo ... View more

View Profile View Profile