drjobs HPC SRE Engineer

HPC SRE Engineer

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Dearborn, MI - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Description

We are seeking a highly skilled and motivated HPC (High Performance Computing) SRE Engineer to join our growing team. You will be responsible for monitoring and collecting metrics on our HPC infrastructure triaging infrastructure issues from health monitoring alerts and responding to customer incidents and tickets to ensure a high quality of service for our user community and our SLAs are met. This role will also focus on deploying and maintaining the metrics logs and monitoring stack we use verify the health of our systems and automating responses to system issues.



Responsibilities

What youll do

  • Implement monitoring solutions to ensure the health and availability of critical infrastructure and applications.
  • Collect metrics on system performance service availability and user experience.
  • Respond to infrastructure alerts and user community tickets to resolve issues that may impact business continuity or missing our SLA targets.
  • Build automation to restore health to hardware systems that have had failures.
  • Develop and maintain documentation for software and procedures.
  • Stay up-to-date on the latest advancements in HPC technologies and best practices.


Qualifications

Youll have...

  • Associates degree in Computer Science Engineering or work experience equivalent
  • 5 years of experience in Systems or Software engineering
  • Strong understanding of Linux operating systems preferably in an HPC environment
  • Experience with metrics collection tools Prometheus or Elasticsearch
  • Experience building visualizations and alerts in tools like Grafana or Kibana
  • Proficiency programming in one or more languages preferably go python or bash scripting.
  • A self motivated attitude and be able to autonomously respond to alerts and fix issues
  • Strong communication and collaboration skills
You may not check every box or your experience may look a little different from what weve outlined but if you think you can bring value to Ford Motor Company we encourage you to apply!
As an established global company we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe or keep you close to home Will your career be a deep dive into what you love or a series of new teams and new skills Will you be a leader a changemaker a technical expert a culture builderor all of the above No matter what you choose we offer a work life that works for you including:
Immediate medical dental and prescription drug coverage
Flexible family care parental leave new parent ramp-up programs subsidized back-up child care and more
Vehicle discount program for employees and family members and management leases
Tuition assistance
Established and active employee resource groups
Paid time off for individual and team community service
A generous schedule of paid holidays including the week between Christmas and New Years Day
Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits click here: Benefit Summary


Employment Type

Full-Time

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.