Facilities Specialist 4
Job Summary
The Data Center Facilities Engineer (DCFE) is responsible for the planning coordination and successful execution of all critical maintenance activities across the data centers electrical and mechanical infrastructure. The role requires close collaboration with OCI colocation partners ensuring all maintenance work is completed safely on schedule and in full compliance with manufacturer recommendations industry best practices and operational standards.
The DCFE will review risk assess and manage all maintenance activities through established change management processes ensuring site resiliency and redundancy are maintained at all times. By fostering strong stakeholder relationships and maintaining rigorous operational oversight the DCFE plays a key role in protecting the integrity reliability and availability of critical infrastructure.
In addition the DCFE is responsible for incident management and operational response during infrastructure-related events. This includes leading incident investigations coordinating cross-functional response teams managing escalation processes conducting root cause analyses and driving corrective and preventive actions to minimize risk and prevent recurrence. The DCFE will ensure timely communication to stakeholders throughout the incident lifecycle and support continuous improvement initiatives based on lessons learned.
Supporting mission-critical server environments the underlying infrastructure is expected to deliver 100% uptime. As such the DCFE is accountable for ensuring all maintenance and incident management activities are executed with meticulous attention to operational risk service continuity safety and reliability.
Daily Duties:
Lead regular meetings with co-location partners to review annual maintenance calendars.
Review of MOPs and SOPs for accuracy and safety to assess the potential for business impact.
Ensure critical systems redundancy is available and equipment is maintained in accordance with published schedules.
Creation of Oracle change management requests to risk assess critical infrastructure activities.
Act as the Oracle lead on the investigation and resolution of engineering incidents within assigned data centers.
Undertake power and cooling capacity audits ensuring data centers are operating to design capacities.
Monitoring data center environmental data and responding and investigating power/cooling alarms.
Support Infra-red scanning of electrical infrastructure.
Support local data center operations teams providing SME guidance on matters of critical infrastructure.
Provide on-call out-of-hours coverage in accordance with schedule.
Requirements:
Knowledge of data center power and cooling critical systems and their troubleshooting experience in operating and maintaining electrical and/or power electronic systems.
Change management experience. Creating and implementing maintenance tickets.
Ability to read and utilize electrical and mechanical schematics.
Ability to read and understand construction drawings.
Establish manage and maintain effective co-location relationships.
Excellent time management skills.
Diligent with excellent organizational skills.
Good interpersonal and relationship-building skills.
Strong curiosity about learning new DC concepts.
Conscientious and trustworthy.
Process-oriented.
Must be able to lift 75 lbs.
Strong presentation and written communication skills.
Experienced in taking on-call shifts and ability to respond to critical events as needed.
Responsibilities
Responsibilities
Represent Oracle with partner organizations and hold them accountable for meeting their commitments.
Ensure partner organizations adhere to Oracle change management policies.
Data center utilization is maintained within capacity limits.
Ensure environmental and capacity tools are reporting accurately and are up-to-date.
Ensure the facility documentation repository is complete and up-to-date.
Ensure data hall cooling containment meets design standards.
Installation of IT equipment adheres to room layout and power striping design.
Assist with developing dashboards and automation to support scalable solutions.
Be accessible via cell phone and willing to work onsite as emergencies require outside of business hours.
Participate and complete training that aligns with corporate objectives to bridge skill gaps and learn new relevant technologies.
Act within Service Level Agreements (SLA) on the tickets assigned to you in your space.
Qualifications
Career Level - IC4
Required Experience:
IC
About Company
As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges. We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity. We know that true innovation starts when eve ... View more