drjobs Lead Systems Engineer, High-Performance Computing

Lead Systems Engineer, High-Performance Computing

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Ashburn, IL - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

IaaS Systems and Storage & Engineering (ISSE) team is part of the Operations & Infrastructure technology organization. Distributed Compute engineering (DCE) is part of ISSE and High-performance compute platform engineering is part of DCE. Our vision mission and purpose are summarized as following:

Vision: To become a leading technical engineering professional pioneering in the design and automation of server infrastructure. We envision creating highly secure and efficient operations environments that drive business success and technological advancement.

Mission: Our mission is to deliver high-quality server infrastructure design and automated implementation. We are committed to operating in complex highly secure and highly available environments while maintaining rigorous operations security and procedural models.

Purpose: The purpose of this role is to utilize strong hands-on technical engineering skills to design and automate the implementation of server infrastructure based on business requirements. This role will interact with technology domain experts to maintain high security and availability in complex operational environments thereby driving business efficiency and security.

Essential Functions:

  • GPU as a Service and High-Performance Compute Platform Support: Expertise in deploying managing and optimizing GPU as a Service (GaaS) and high-performance compute platforms to support advanced computational workloads.
  • Extensive Datacenter Experience: Proficient in managing complex geographically distributed IT infrastructures to ensure high availability and performance.
  • Advanced Technical Knowledge: Profound understanding of high-performance highly available and secure computing systems utilizing x86 technologies and protocols (NVME GPU PCI-E).
  • Enterprise Server and Component Expertise: In-depth knowledge of server components (storage/network controllers HBA SSDs) and their functionalities essential for maintaining high-performance compute environments.
  • Processor and GPU Systems Proficiency: Strong grasp of Intel/AMD architectures GPU systems memory hierarchy and hardware-level security to enhance system performance and reliability.
  • Out-of-Band UEFI and BIOS Expertise: Comprehensive understanding of out-of-band management UEFI BIOS settings and their impact on system performance and security in high-performance computing environments.
  • Hardware Lifecycle Management: Experienced in hardware lifecycle management including firmware and OS driver certifications to ensure the longevity and reliability of compute resources.
  • Infrastructure Management and Automation: Proficient in installing configuring supporting and maintaining compute infrastructure management tools with skills in Ansible for automation to streamline deployment and operational tasks.
  • Performance Benchmarking and Tech Evaluation: Capable of running performance benchmarks and evaluating new technologies for various platforms (Linux Windows containerized and virtualized) to ensure optimal performance.
  • Scripting Proficiency: Advanced skills in scripting languages such as PowerShell and Python to automate and optimize infrastructure tasks.
  • Team and Independent Work: Highly motivated excellent team player capable of working independently with strong analytical and troubleshooting abilities to resolve complex issues and mentor junior staff.

This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site) with a general guidepost of being in the office 50% or more of the time based on business needs.


Qualifications :

Basic Qualifications:
10 years of relevant work experience with a Bachelors Degree or at least 7 years of work experience with an Advanced degree (e.g. Masters MBA JD MD) or 4 years of work experience with a PhD OR 13 years of relevant work experience.

Preferred Qualifications:
12 or more years of work experience with a Bachelors Degree or 8-10 years of experience with an Advanced Degree (e.g. Masters MBA JD MD) or 6 years of work experience with a PhD
Bachelors degree or higher in Computer Science Information Systems Computer Engineering Electrical or other relevant engineering field.
Broad knowledge in hardware software network and applications deployments thru automation
Hardware and infrastructure automation experience in at least one of the following server product lines - HP ProLiant Dell PowerEdge.
Strong technical analytical and troubleshooting skills and possess an ability to explain technical concepts and provide guidance to junior staff.
Experience in system monitoring with tools supporting unattended operations.
Engineering Knowledge to troubleshoot and solve storage issues (Hosts SAN switches and Storage Devices).
Engineering knowledge in TCPIP networking link aggregation redundancy switches routing and load-balancing.
Ability to write technical designs documentation and presentations for Compute Infrastructure.
Ability to provide level 3 support and guide level 2 administrators on problem resolution.


Additional Information :

Work Hours: Varies upon the needs of the department.

Travel Requirements: This position requires travel 5-10% of the time.

Mental/Physical Requirements: This position will be performed in an office setting.  The position will require the incumbent to sit and stand at a desk communicate in person and by telephone frequently operate standard office equipment such as telephones and computers.

Visa is an EEO Employer.  Qualified applicants will receive consideration for employment without regard to race color religion sex national origin sexual orientation gender identity disability or protected veteran status.  Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.

Visa will consider for employment qualified applicants with criminal histories in a manner consistent with applicable local law including the requirements of Article 49 of the San Francisco Police Code.

U.S. APPLICANTS ONLY: The estimated salary range for a new hire into this position is 160600.00 to 232900.00 USD per year which may include potential sales incentive payments (if applicable). Salary may vary depending on job-related factors which may include knowledge skills experience and location. In addition this position may be eligible for bonus and equity. Visa has a comprehensive benefits package for which this position may be eligible that includes Medical Dental Vision 401 (k) FSA/HSA Life Insurance Paid Time Off and Wellness Program.


Remote Work :

No


Employment Type :

Full-time

Employment Type

Full-time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.