As a Data Center Technician II youll independently manage and prioritize host repair efforts for multiple datacenters perform initial troubleshooting for ambiguous hardware and network issues and own well-defined projects with guidance from senior engineers while helping us scale our Core/Edge Data Centers and hardware infrastructure at a time of incredible growth for our business.
You will:
- Manage and prioritize your ticket queue according to defined priorities performing initial troubleshooting for server and network issues and escalating clearly when problems fall outside standard procedures.
- Maintain the Core Data Center and hardware infrastructure to meet the large scale and real-time requirements of our Imagination Platform to ensure our community has an awesome experience anywhere in the world. This includes all aspects of the server network infrastructure power and environmental life cycles.
- Collaborate across regions to track and mitigate systemic issues preventing hosts from returning to service.
- Identify and solve recurring operational problems through root cause analysis and propose improvements to runbooks SOPs and MOPs to prevent re-occurrence.
- Contribute data feedback and requirements to partners building automation ensuring that automation reflects real-world operational workflows
- Coordinate with peers to establish and uphold best practices related to breakfix install decom and all other aspects of datacenter operations.
- Influence and improve the development platform infrastructure standards (Runbooks SOPs MOPs) and methods to ensure the goal of scalability and high availability can be achieved.
- Leverage partnerships across teams to ensure prompt expansion and recovery of hardware capacity.
- Actively participate in continuous improvement and ongoing learning within the engineering team
- Assist in coordinating vendors and ensuring quality of outsourced projects
- Participate in the on-call rotation for our critical infrastructure.
- Travel: International and Domestic travel may be required 25%
You have:
- At minimum 3 years of experience working in large-scale Data Center Infrastructure environments and experience planning executing and documenting repairs in the server and networking domains.
- Extensive experience installing monitoring and maintaining server and network equipment. This includes brand new server and network provisioning.
- In-depth knowledge of data center environments servers and network equipment.
- Proven experience executing on multiple tasks simultaneously.
- Proficiency with server outofband management tools to perform initial troubleshooting on servers including when the operating system is not fully available.
- Proficiency with Linux/Unix or Windows command-line tools to collect logs run diagnostics and perform initial troubleshooting on servers and network devices
- You have installed various equipment that commonly resides in the data center environment and are able to lift 75 pounds occasionally.
You are:
- Someone who is ready for action wielding a wealth of server and network hardware troubleshooting knowledge to support Robloxs systems.
- Excited about getting in front of complex problems and can effectively organize your work to overcome emergent high-impact issues.
- Someone who enjoys building processes and procedures for the day to execute our workload and for developing new capabilities as a team.
- Someone who asks the right questions to solve issues within your expertise and you use data to test your theories. You are able to formulate and identify problems generate and evaluate a variety of solutions (some of which are novel) and implement the best one(s).
- Someone who is committed to demonstrating professionalism in all interactions with partners both inside and outside Roblox to ensure continued success in cross-functional initiatives. You are able to foster trust and uphold the reputation of the team and company.
Required Experience:
IC
As a Data Center Technician II youll independently manage and prioritize host repair efforts for multiple datacenters perform initial troubleshooting for ambiguous hardware and network issues and own well-defined projects with guidance from senior engineers while helping us scale our Core/Edge Data ...
As a Data Center Technician II youll independently manage and prioritize host repair efforts for multiple datacenters perform initial troubleshooting for ambiguous hardware and network issues and own well-defined projects with guidance from senior engineers while helping us scale our Core/Edge Data Centers and hardware infrastructure at a time of incredible growth for our business.
You will:
- Manage and prioritize your ticket queue according to defined priorities performing initial troubleshooting for server and network issues and escalating clearly when problems fall outside standard procedures.
- Maintain the Core Data Center and hardware infrastructure to meet the large scale and real-time requirements of our Imagination Platform to ensure our community has an awesome experience anywhere in the world. This includes all aspects of the server network infrastructure power and environmental life cycles.
- Collaborate across regions to track and mitigate systemic issues preventing hosts from returning to service.
- Identify and solve recurring operational problems through root cause analysis and propose improvements to runbooks SOPs and MOPs to prevent re-occurrence.
- Contribute data feedback and requirements to partners building automation ensuring that automation reflects real-world operational workflows
- Coordinate with peers to establish and uphold best practices related to breakfix install decom and all other aspects of datacenter operations.
- Influence and improve the development platform infrastructure standards (Runbooks SOPs MOPs) and methods to ensure the goal of scalability and high availability can be achieved.
- Leverage partnerships across teams to ensure prompt expansion and recovery of hardware capacity.
- Actively participate in continuous improvement and ongoing learning within the engineering team
- Assist in coordinating vendors and ensuring quality of outsourced projects
- Participate in the on-call rotation for our critical infrastructure.
- Travel: International and Domestic travel may be required 25%
You have:
- At minimum 3 years of experience working in large-scale Data Center Infrastructure environments and experience planning executing and documenting repairs in the server and networking domains.
- Extensive experience installing monitoring and maintaining server and network equipment. This includes brand new server and network provisioning.
- In-depth knowledge of data center environments servers and network equipment.
- Proven experience executing on multiple tasks simultaneously.
- Proficiency with server outofband management tools to perform initial troubleshooting on servers including when the operating system is not fully available.
- Proficiency with Linux/Unix or Windows command-line tools to collect logs run diagnostics and perform initial troubleshooting on servers and network devices
- You have installed various equipment that commonly resides in the data center environment and are able to lift 75 pounds occasionally.
You are:
- Someone who is ready for action wielding a wealth of server and network hardware troubleshooting knowledge to support Robloxs systems.
- Excited about getting in front of complex problems and can effectively organize your work to overcome emergent high-impact issues.
- Someone who enjoys building processes and procedures for the day to execute our workload and for developing new capabilities as a team.
- Someone who asks the right questions to solve issues within your expertise and you use data to test your theories. You are able to formulate and identify problems generate and evaluate a variety of solutions (some of which are novel) and implement the best one(s).
- Someone who is committed to demonstrating professionalism in all interactions with partners both inside and outside Roblox to ensure continued success in cross-functional initiatives. You are able to foster trust and uphold the reputation of the team and company.
Required Experience:
IC
View more
View less