Title: Senior Solution Architect High Performance Computing (HPC)
Location: Eglin AFB FL (Onsite)
Duration: 12 Months
Position Overview
We are seeking a Senior Solution Architect with hands-on experience in High Performance Computing (HPC) administration and infrastructure operations. The consultant will assist in managing and maintaining a large HPC cluster environment ensuring optimal system performance reliability and alignment with operational service levels.
The role includes supporting daily cluster operations assisting in solution documentation and transferring knowledge of best practices for HPC environments. The ideal candidate will be proactive collaborative and capable of working in a dynamic mission-critical setting.
Key Responsibilities
Administer and maintain a large-scale HPC cluster supporting compute GPU and storage nodes.
Share operational responsibility for the solutions infrastructure configuration provisioning maintenance monitoring and service-level alignment.
Assist in documenting operational procedures system policies and best practices in coordination with key stakeholders.
Provide knowledge transfer and training on best practices for daily HPC operations.
Support troubleshooting and performance tuning to ensure system stability and efficiency.
Collaborate closely with project management and technical leads when addressing complex issues.
Technical Environment
The HPC environment includes:
Up to 300 compute nodes (various configurations)
GPU nodes
Head and login nodes
Parallel file systems (e.g. BeeGFS Lustre or GPFS)
Dell and Mellanox hardware
Infiniband and GbE network switches
HPC software and applications (e.g. MPI Schedulers simulation tools)
Required Skills
Strong experience with Linux administration (RHEL or CentOS preferred)
Hands-on experience in HPC cluster management and operations
Familiarity with system monitoring configuration and maintenance
Excellent problem-solving and communication skills
Ability to collaborate effectively in a team environment and escalate issues appropriately
Preferred Qualifications
Experience working in federal or government environments
Active or obtainable security clearance
Familiarity with parallel file systems (BeeGFS Lustre GPFS)
Experience with MPI Schedulers and cluster management tools (e.g. Bright Cluster Manager)
Knowledge of Dell and Mellanox hardware solutions
Experience supporting GPU-based applications and engineering workloads
For more details reach at
Required Experience:
Senior IC
Title: Senior Solution Architect High Performance Computing (HPC) Location: Eglin AFB FL (Onsite) Duration: 12 MonthsPosition OverviewWe are seeking a Senior Solution Architect with hands-on experience in High Performance Computing (HPC) administration and infrastructure operations. The consultant ...
Title: Senior Solution Architect High Performance Computing (HPC)
Location: Eglin AFB FL (Onsite)
Duration: 12 Months
Position Overview
We are seeking a Senior Solution Architect with hands-on experience in High Performance Computing (HPC) administration and infrastructure operations. The consultant will assist in managing and maintaining a large HPC cluster environment ensuring optimal system performance reliability and alignment with operational service levels.
The role includes supporting daily cluster operations assisting in solution documentation and transferring knowledge of best practices for HPC environments. The ideal candidate will be proactive collaborative and capable of working in a dynamic mission-critical setting.
Key Responsibilities
Administer and maintain a large-scale HPC cluster supporting compute GPU and storage nodes.
Share operational responsibility for the solutions infrastructure configuration provisioning maintenance monitoring and service-level alignment.
Assist in documenting operational procedures system policies and best practices in coordination with key stakeholders.
Provide knowledge transfer and training on best practices for daily HPC operations.
Support troubleshooting and performance tuning to ensure system stability and efficiency.
Collaborate closely with project management and technical leads when addressing complex issues.
Technical Environment
The HPC environment includes:
Up to 300 compute nodes (various configurations)
GPU nodes
Head and login nodes
Parallel file systems (e.g. BeeGFS Lustre or GPFS)
Dell and Mellanox hardware
Infiniband and GbE network switches
HPC software and applications (e.g. MPI Schedulers simulation tools)
Required Skills
Strong experience with Linux administration (RHEL or CentOS preferred)
Hands-on experience in HPC cluster management and operations
Familiarity with system monitoring configuration and maintenance
Excellent problem-solving and communication skills
Ability to collaborate effectively in a team environment and escalate issues appropriately
Preferred Qualifications
Experience working in federal or government environments
Active or obtainable security clearance
Familiarity with parallel file systems (BeeGFS Lustre GPFS)
Experience with MPI Schedulers and cluster management tools (e.g. Bright Cluster Manager)
Knowledge of Dell and Mellanox hardware solutions
Experience supporting GPU-based applications and engineering workloads
For more details reach at
Required Experience:
Senior IC
View more
View less