IT Incident and Problem Manager

Not Interested
Bookmark
Report This Job

profile Job Location:

Chennai - India

profile Monthly Salary: Not Disclosed
Posted on: 9 hours ago
Vacancies: 1 Vacancy

Department:

Engineering

Job Summary

What youll do 
    The Incident Manager is responsible for orchestrating the response to high-impact incidents ensuring timely resolution minimizing service disruption and maintaining communication across stakeholders. The role demands exceptional crisis management skills technical acumen and a strong ability to coordinate cross-functional teams under pressure. 
    Lead end-to-end incident resolution for major incident events through orchestrating combined technical efforts of responding teams. 
    Assume ownership of major incidents and drive coordinating efforts to ensure quick resolution of impacting events. 
    Identify and remove blockers escalate appropriately and continuous momentum of troubleshooting efforts. 
    Ensure adherence to established incident management processes and protocols. 
    Contribute to the improvement of incident response runbooks and documentation. 
    Own internal and external communications during major incidents. 
    Translate technical details into business-impact language (scope severity risk ETA confidence level). 
    Maintain clear and continuous communication with stakeholders during incidents providing timely updates. 
    Ensure safe execution of mitigations rollbacks feature flags and failovers 
    Lead post incident review meetings with stakeholders to confirm event details and assign problem investigators. 
    Track and report on incident metrics identifying patterns and areas for systemic improvement. 

    Augment Change Managers and / or Problem Managers as required in the performance of those responsibilities. 

What youve done 
    Bachelors or masters Degree and/or equivalent experience relevant to functional area. 
    10 years of experience in incident management IT operations or a similar role. 
    Experience managing critical incidents in a 24/7 production environment. 
    Experience with ServiceNow ITSM and on call incident coordination via PagerDuty / Zen duty (or comparable ITSM/on call tools). 

Knowledge Skills Abilities & Behaviours 
    Understand a wide breadth of technical concepts across a CI/CD environment. 
    Background in cloud-based systems and DevOps practices preferred. 
    Ability to use AI tools to synthesize communication reports and troubleshooting leads. 
    Certification in ITIL incident management or related frameworks preferred. 
    Experience in SaaS or technology product companies preferred. 
    Strong leadership and decision-making skills under pressure. 
    Excellent verbal and written communication skills for both technical and non-technical audiences. 
    Deep understanding of IT service management principles and practices. 
    Ability to manage multiple priorities and deadlines in high-stakes situations. 
    Strong analytical skills to drive root cause analysis and trend identification. 
    Familiarity with modern monitoring and incident management tools. 
    Demonstrated ability to build consensus across diverse teams. 
    Effective at maintaining calm and focus during critical situations. 
    Knowledge of cloud infrastructure (e.g. AWS Azure) and application architecture. 
    Proven track record of improving incident management processes. 
    Attention to detail in documentation and follow-through. 
    Adept at facilitating collaboration across remote and global teams. 
    Proactive in identifying operational risks and implementing preventive measures. 
    Committed to continuous learning and process improvement. 
    Ethical dependable and resilient in challenging scenarios. 

 

Perks

  • Day off on the 3rd Friday of every month (one long weekend each month)
  • Monthly Wellness Reimbursement Program to promote health well-being
  • Monthly Office Commutation Reimbursement Program
  • Paid paternity and maternity leaves

Remote Work :

Yes


Employment Type :

Full-time

What youll do     The Incident Manager is responsible for orchestrating the response to high-impact incidents ensuring timely resolution minimizing service disruption and maintaining communication across stakeholders. The role demands exceptional crisis management skills technical acumen and a stron...
View more view more

Key Skills

  • Computer Hardware
  • IT Experience
  • IT Management
  • Network Installation
  • Vendor Management
  • Computer Networking
  • Microsoft Windows Server
  • Windows
  • IT Service Management
  • ITIL
  • SAN
  • Network Architecture

About Company

Company Logo

Forbes Advisor is a new initiative for consumers under the Forbes Marketplace umbrella that provides journalist- and expert-written insights, news and reviews on all things personal finance, health, business, and everyday life decisions. We do this by providing consumers with the kno ... View more

View Profile View Profile