drjobs HPC Data Center Operator- Owl Shift

HPC Data Center Operator- Owl Shift

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Livermore, CA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Do you love High Performance Computing (HPC)  Would you like to work with four of the fastest HPC systems in the world

We are looking for an HPC Data Center Operator to monitor diagnose and repair system faults on a large number of highperformance computer (HPC) systems storage systems and networks. You will interact with other Livermore Computing (LC) staff to remediate problems and provide advanced technical support in a complicated HPC computing and networking environment working Owl Shift (12:00am8:00am). This position is in the Livermore Computing Operations Group in the LC Division within the Computing Directorate.

This position will be filled at either the 525.2 or 525.3 level depending on your qualifications. Additional responsibilities (outlined below) will be assigned if you are selected at the higher level.

You will 

  • Provide broad technical support and monitoring capabilities for the HPC systems file systems and storage systems under minimal supervision.
  • Apply Unix system knowledge along with using a variety of inhouse and vendor supplied diagnostic tools to monitor and effect basic system repairs.
  • Troubleshoot moderately complex software hardware & networks. Document issues apply corrective action and repairs to the problem or notify the appropriate oncall personnel.
  • Receive document and accommodate all customer calls particularly during offhours and resolve customer issues if possible or escalate to the appropriate level.
  • Perform data center facilities monitoring problem remediation and emergency event response during normal daily operation and offhours.
  • Participate in the decommission process of older HPC systems & system relocation activities.
  • Promote the use of interdepartmental resources for tools metrics and common solutions to team members via email and presentations.
  • Perform a variety of technical tasks including installation diagnosis repair and maintenance of clustered computer systems and related file systems and networks. 
  • Perform other duties as assigned.

Additional job responsibilities at the 525.3

  • Act as escalation resource for advanced technical issues
  • Be a change agent to improve current processes and procedures
  • Act as subject matter expert on specific hardware or procedures

Qualifications :

  • Ability to obtain and maintain a U.S. DOE (L/Q)level security clearance which requires U.S. Citizenship.
  • Associates degree in a computerrelated field or equivalent combination of technical training and/or data center experience.
  • Mechanical ability to disassemble troubleshoot and replace electronic computer components.
  • Ability to document and maintain clear and accurate records with a high level of attention to detail.
  • Ability to learn new concepts multitask and prioritize workload in a rapidly changing environment.  
  • Proficient verbal and written communication skills necessary to interact with customers and team members with the ability to work as a member of a team.
  • Ability to work the assigned shift including a rotating weekend and holiday schedule.
  • Basic knowledge and experience working with Linux system administration commands and utilities.
  • Experience working with a team to prioritize participate and share workload efficiently. 
  • Ability to work all scheduled shifts including weekends and holidays.                                                            

Additional qualifications at the 525.3

  • Previous experience in Data Center environment troubleshooting hardware independently  prioritizing calls and acting as escalation resource
  • Advanced experience with file systems raid technology and general scripting methods
  • Leadership experience driving escalations directing daily tasks or managing projects


Additional Information :

#LIOnsite

Position Information

This is a Career Indefinite position open to Lab employees and external candidates.

Why Lawrence Livermore National Laboratory

Employment Type

Full-time

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.