Job Title: Datacenter operation Lead
Location: Framingham MA
Role Type: Full-time
Work Model: Onsite
Job description:
1. Operational Ownership (BAU)
- Own end to end infrastructure operations across assigned towers or environments.
- Ensure availability performance and stability of infrastructure services.
- Enforce runbooks SOPs SLAs and OLAs across L1/L2/L3 teams.
- Monitor service health dashboards and proactively address risks.
2. Incident Problem & Major Incident Management
- Act as primary escalation point for critical (P1/P2) incidents.
- Lead Major Incident Management (MIM) calls and coordination.
- Ensure timely root cause analysis (RCA) and preventive actions.
- Drive reduction of repeat incidents through problem management.
3. Change & Release Management
- Review and approve infrastructure changes patches upgrades and maintenance.
- Ensure risk assessment rollback planning and blackout compliance.
- Coordinate with application teams and business stakeholders for releases.
- Validate post change stability and sign off.
4. Vendor & Stakeholder Management
- Coordinate with OEMs cloud providers and third party vendors.
- Manage vendor performance escalations and service reviews.
- Act as the single operational interface for customer stakeholders.
- Present service health risks and improvement plans.
5. Governance Compliance & Security
- Ensure adherence to enterprise IT policies security standards and audits.
- Support compliance requirements (e.g. SOX DR BCP access controls).
- Drive DR drills backup validation and resiliency planning.
- Ensure audit readiness and closure of observations.
Note:
- Interested candidates are requested to share their updated resume along with their LinkedIn profile for further discussion.
- Also please confirm if you are open to relocation.
Job Title: Datacenter operation Lead Location: Framingham MA Role Type: Full-time Work Model: Onsite Job description: 1. Operational Ownership (BAU) Own end to end infrastructure operations across assigned towers or environments. Ensure availability performance and stability of infrast...
Job Title: Datacenter operation Lead
Location: Framingham MA
Role Type: Full-time
Work Model: Onsite
Job description:
1. Operational Ownership (BAU)
- Own end to end infrastructure operations across assigned towers or environments.
- Ensure availability performance and stability of infrastructure services.
- Enforce runbooks SOPs SLAs and OLAs across L1/L2/L3 teams.
- Monitor service health dashboards and proactively address risks.
2. Incident Problem & Major Incident Management
- Act as primary escalation point for critical (P1/P2) incidents.
- Lead Major Incident Management (MIM) calls and coordination.
- Ensure timely root cause analysis (RCA) and preventive actions.
- Drive reduction of repeat incidents through problem management.
3. Change & Release Management
- Review and approve infrastructure changes patches upgrades and maintenance.
- Ensure risk assessment rollback planning and blackout compliance.
- Coordinate with application teams and business stakeholders for releases.
- Validate post change stability and sign off.
4. Vendor & Stakeholder Management
- Coordinate with OEMs cloud providers and third party vendors.
- Manage vendor performance escalations and service reviews.
- Act as the single operational interface for customer stakeholders.
- Present service health risks and improvement plans.
5. Governance Compliance & Security
- Ensure adherence to enterprise IT policies security standards and audits.
- Support compliance requirements (e.g. SOX DR BCP access controls).
- Drive DR drills backup validation and resiliency planning.
- Ensure audit readiness and closure of observations.
Note:
- Interested candidates are requested to share their updated resume along with their LinkedIn profile for further discussion.
- Also please confirm if you are open to relocation.
View more
View less