Solutions Architect – HPC and container Platform.
Job Summary
Enterprise Solutions Architect - HPC and container Platform.
Job Title -Solutions Architect - HPC and container Platform.
Job Description:
We are seeking an experienced Enterprise Solution Architect with
expertise in High Performance Computing (HPC) and Container platforms in
Linux-based environments to join our dynamic team. The ideal candidate
will be responsible for architecting solutions including but not limited
to installation configuration integration and implementation of HPC
technologies and enterprise-grade container orchestration platform.
We are looking for a person with very strong technical skills and a
proven track record of designing and working with container platforms
such as Red Hat OpenShift SUSE Rancher CNCF Kubernetes HPC Schedulers
such as Slurm or PBS Pro HPC/AI Cluster management technologies such as
HPCM or BCM and the ability to work in a dynamic customer-focused team
that requires excellent interpersonal skills.
Key Responsibilities:
Candidate must have strong proficiency as a Solution Architect in at
least three areas and proficiency in other areas.
1. Leadership and Strategy:
- Develop HPC and container platform roadmaps and strategies for
growth based on business needs.
- Engage in and enhance the complete service lifecyclefrom
conceptualization and design to implementation and operation.
- Identify the growth path and scalability options of a solution
and include these in design activities.
2. Solution Planning and Design:
- Gather requirements assess technical feasibility and design
integrated HPC and container solutions that align with business
objectives.
- Architect and optimize the technical solutions to meet the
requirements of the customer.
- Identify the potential challenges and constraints that impact
the solution and project plan.
3. Opportunity assessment:
- Respond to the technical sections of RFIs/RFPs and Lead
proof-of-concept engagements to a successful conclusion
- Utilize an effective consultative approach to advance
opportunities.
4. Innovation and Research:
- Stay abreast of emerging technologies trends and industry
developments related to HPC Kubernetes containers cloud
computing and Security.
- Develop best practices Accelerators and Show & Tell for HPC and
container platform solutions and integrations.
5. Customer-centric mindset:
- Strong focus on understanding customer business requirements and
solving complex cloud technology issues
- Be the trusted advisor delight customers and deliver
exceptional customer experiences to drive customer success.
- Communicate complex technical concepts and findings to
non-technical stakeholders
6. Team Collaboration:
- Collaborate with cross-functional teams including system
administrators developers data scientist and project managers
to ensure successful project delivery.
- Understands the roles and effectively engages other teams and
resources within the company
- Mentor and trainnew team members and lead the way in
participation in tech talks forums innovation.
Required Skills:
- Knowledge and experience with Linux System Administration package
management scheduling boot procedures/troubleshooting performance
optimization and networking concepts.
- In-depth knowledge and hands-on experience with HPC technologies
workload schedulers - Slurm Altair PBS pro and cluster managers -
HPCM Bright cluster manager.
- In-depth knowledge and hands-on experience with containerization
technologies like Docker Singularity or Podman.
- In-depth knowledge and hands-on experience with at least two of the
container orchestration technologies like CNCF Kubernetes Red Hat
OpenShift SUSE Rancher RKE/K3S Canonical charmed kubernetes or HPE
Ezmeral Runtime. Good to have
- Good knowledge and hands-on experience with OpenStack cloud
solutions.
- Good knowledge and hands-on experience with virtualization
technologies like KVM OpenShift virtualization
- Good knowledge and hands-on experience with at least two various
Linux distributions like RHEL SLES Ubuntu Debian.
- Good Knowledge of GPU technologies NVIDIA GPU operator NVIDIA vGPU
technology
- Good knowledge of HPC networking stack (high speed networking)
InfiniBand Mellanox
- Good experience in performance optimization and health assessment of
HPC components
such as operating systems storage servers parallel file systems
schedulers and container orchestration platforms.
- Must be good in any of the programming or scripting languages like
python bash
- Good understanding of DNS TCP/IP Routing and Load Balancing.
- Knowledge of network protocols like TCP/IP S3 FTP NFS or
SMB/CIFS
- Familiarity with microservices architecture CI/CD pipelines and
DevOps tools.
- Excellent problem-solving abilities analytical thinking and
communication skills to interact with technical and non-technical
stakeholders.
- Ability to lead technical projects by gathering the requirements
preparing the architecture / design and executing it end to end.
Must be able to bring clarity and drive complex projects involving
multiple stakeholders.
- Solid business acumen and ability to converse with client on issues
and challenges.
- Demonstrates a solid knowledge of the companys breadth of
solutions.
- Demonstrates a high-level understanding of all related vendor
product roadmaps.
Qualifications:
- Bachelors/masters degree in computer science Information
Technology or a related field.
- Proven experience as a Solutions Architect HPC and Container
platform Specialist or similar role with expertise in designing
and implementing complex solutions.
- Red Hat Certified Specialist in Containers and Kubernetes (RHCSA
RHCE) CNCF certification - CKA CKAD CKS is preferred.
- Typically 6-8 years of experience in delivering complex HPC and
container platform projects.
Required Skills:
CONSULTATIVE APPROACHFTPLOAD BALANCINGCLOUD COMPUTINGLINUX SYSTEM ADMINISTRATIONCI/CDDEVOPS TOOLSRHCSADESIGN ACTIVITIESANALYTICAL THINKINGSCALABILITYKUBERNETESCONCEPTUALIZATIONSLESOPENSTACKTCP/IPDOCKER