Job Title: HPC on AWS Lead /Specialist/ SME- REMOTE
- Need is for resource with US Citizenship and Active Secret Service Clearence( SCI )
- Skill Set: DevOps/HPC tooling resource exp with infra landing zone etc. for a project for Secret Deployment/AWS GovCloud
Overview:
The AWS HPC LEAD & SME is responsible for designing implementing and optimizing high-performance computing solutions on the AWS Cloud platform. This role combines deep technical expertise in distributed computing data-intensive workflows and AWS HPC services with the ability to lead architecture design sessions define best practices and ensure scalability performance and cost efficiency across enterprise or research workloads.
Key Responsibilities:
Lead the Design & Build: Develop scalable high-performance architectures leveraging AWS HPC services such as AWS ParallelCluster FSx for Lustre EFA (Elastic Fabric Adapter) AWS Batch and EC2 HPC instances.
Solution Implementation: Deploy automate and optimize HPC clusters and data pipelines for compute- and memory-intensive workloads including modeling simulation genomics CFD AI/ML training and financial risk analysis.
Performance Optimization: Benchmark tune and monitor system performance for compute storage and networking components to achieve optimal throughput and cost efficiency.
Infrastructure as Code (IaC): Implement reproducible environments using Terraform AWS CDK or CloudFormation to streamline provisioning CI/CD and configuration management.
Data and Storage Management: Design high-throughput parallel storage solutions using S3 FSx for Lustre EBS and EFS; integrate with hybrid and on-prem HPC environments.
Security and Compliance: Apply AWS Well-Architected Framework and HPC security best practices to ensure compliance with enterprise academic or government standards.
Collaboration and Leadership: Partner with application scientists DevOps teams and business stakeholders to translate workload requirements into optimized HPC architectures. Provide mentoring and technical leadership across multidisciplinary teams.
Documentation and Knowledge Sharing: Develop architecture diagrams reference implementations and technical playbooks to support ongoing HPC adoption and operations.
Required Skills & Experience:
8-10 years of experience in high-performance computing distributed systems or cloud architecture.
Proven expertise in AWS HPC services (EC2 HPC ParallelCluster Batch FSx for Lustre EFA).
Strong knowledge of Linux systems administration networking (Infiniband EFA MPI) and job schedulers (Slurm Torque PBS Pro).
Hands-on experience with automation and IaC (Terraform Ansible CloudFormation).
Scripting and development proficiency (Python Bash or similar).
Experience with monitoring tools (CloudWatch Grafana Prometheus) and cost-optimization strategies.
AWS Certified Solutions Architect Professional or AWS Certified Advanced Networking preferred.
Bachelors or Masters degree in Computer Science Engineering or related technical field.
Preferred Attributes:
Experience with GPU workloads containerized HPC (ECS/EKS with ParallelCluster) or hybrid/on-prem to cloud HPC migrations.
Strong communication and presentation skills for executive and technical audiences.
Demonstrated thought leadership in HPC strategy performance benchmarking and AWS innovation.
Required Skills:
AWS SME to Lead the design build and optimization of scalable (HPC) and EDA workloads on AWS The Senior AWS EDA (Electronic Design Automation) Architect & Subject Matter Expert will lead the architecture design and optimization of scalable high-performance compute (HPC) and EDA workloads on AWS. This role bridges semiconductor design engineering and cloud infrastructurehelping clients modernize their chip design workflows reduce simulation runtimes and optimize cost-performance at scale. The architect will serve as a strategic advisor to both engineering and IT leadership ensuring secure automated and efficient cloud deployments tailored to EDA workloads such as Synopsys Cadence Siemens and Ansys. Key Responsibilities Architect and implement end-to-end AWS HPC/EDA environments including compute storage networking and licensing infrastructure optimized for large-scale design simulation and verification. Lead cloud migration initiatives for on-premises EDA workloads to AWS using services such as AWS ParallelCluster FSx for Lustre EFA-enabled EC2 and AWS Batch. Design cost-efficient architectures leveraging Spot Instances Auto Scaling and elastic file systems for dynamic simulation environments. Integrate automation and DevOps pipelines using CloudFormation Terraform AWS CDK and CI/CD tools for repeatable and compliant deployments. Collaborate with semiconductor design teams to tune and benchmark workloads (timing analysis simulation emulation verification etc.) for optimal performance on AWS. Advise on data management strategies for high-throughput EDA workflows including S3 lifecycle policies hybrid storage (FSx EBS EFS) and data lakes for results analytics. Implement and enforce AWS security identity and compliance best practices (IAM VPC isolation PrivateLink encryption data residency). Work closely with vendors and partners (e.g. Synopsys Cadence Siemens) to validate tool performance and ensure license server integration on AWS. Provide subject matter expertise in HPC job schedulers (Slurm PBS LSF) cluster orchestration and cloud-native workload management. Mentor engineering and DevOps teams establish best practices and contribute to reusable architectures and playbooks across global projects. Required Qualifications 10 years of experience in HPC or EDA architecture systems engineering or cloud infrastructure roles. 5 years of hands-on experience designing AWS-based HPC or semiconductor design environments. Deep expertise with EDA tools (Synopsys Cadence Siemens Ansys) and understanding of parallel and distributed compute optimization. Proficiency with AWS core services: EC2 S3 EFS FSx for Lustre Batch Lambda CloudFormation CloudWatch IAM and VPC. Strong scripting and automation experience (Python Bash Terraform or CDK). Solid understanding of networking storage performance tuning and job scheduling systems (Slurm Grid Engine PBS Pro LSF). Experience implementing secure compliant multi-account architectures with Control Tower Organizations and Service Catalog. Proven ability to interface with engineering teams and executives translating complex technical challenges into business solutions. Preferred Qualifications AWS Certified Solutions Architect Professional or Advanced Networking Specialty. Experience integrating EDA workloads with data analytics or ML pipelines on AWS (e.g. SageMaker EMR Glue). Familiarity with chip design lifecycle workflows (RTL synthesis verification sign-off P&R DFT). Experience supporting hybrid cloud architectures and HPC burst-to-cloud strategies. Background in Semiconductor Aerospace or High-Tech industries a plus.
Required Education:
Masters preferred
Job Title: HPC on AWS Lead /Specialist/ SME- REMOTENeed is for resource with US Citizenship and Active Secret Service Clearence( SCI )Skill Set: DevOps/HPC tooling resource exp with infra landing zone etc. for a project for Secret Deployment/AWS GovCloudOverview:The AWS HPC LEAD & SME is responsib...
Job Title: HPC on AWS Lead /Specialist/ SME- REMOTE
- Need is for resource with US Citizenship and Active Secret Service Clearence( SCI )
- Skill Set: DevOps/HPC tooling resource exp with infra landing zone etc. for a project for Secret Deployment/AWS GovCloud
Overview:
The AWS HPC LEAD & SME is responsible for designing implementing and optimizing high-performance computing solutions on the AWS Cloud platform. This role combines deep technical expertise in distributed computing data-intensive workflows and AWS HPC services with the ability to lead architecture design sessions define best practices and ensure scalability performance and cost efficiency across enterprise or research workloads.
Key Responsibilities:
Lead the Design & Build: Develop scalable high-performance architectures leveraging AWS HPC services such as AWS ParallelCluster FSx for Lustre EFA (Elastic Fabric Adapter) AWS Batch and EC2 HPC instances.
Solution Implementation: Deploy automate and optimize HPC clusters and data pipelines for compute- and memory-intensive workloads including modeling simulation genomics CFD AI/ML training and financial risk analysis.
Performance Optimization: Benchmark tune and monitor system performance for compute storage and networking components to achieve optimal throughput and cost efficiency.
Infrastructure as Code (IaC): Implement reproducible environments using Terraform AWS CDK or CloudFormation to streamline provisioning CI/CD and configuration management.
Data and Storage Management: Design high-throughput parallel storage solutions using S3 FSx for Lustre EBS and EFS; integrate with hybrid and on-prem HPC environments.
Security and Compliance: Apply AWS Well-Architected Framework and HPC security best practices to ensure compliance with enterprise academic or government standards.
Collaboration and Leadership: Partner with application scientists DevOps teams and business stakeholders to translate workload requirements into optimized HPC architectures. Provide mentoring and technical leadership across multidisciplinary teams.
Documentation and Knowledge Sharing: Develop architecture diagrams reference implementations and technical playbooks to support ongoing HPC adoption and operations.
Required Skills & Experience:
8-10 years of experience in high-performance computing distributed systems or cloud architecture.
Proven expertise in AWS HPC services (EC2 HPC ParallelCluster Batch FSx for Lustre EFA).
Strong knowledge of Linux systems administration networking (Infiniband EFA MPI) and job schedulers (Slurm Torque PBS Pro).
Hands-on experience with automation and IaC (Terraform Ansible CloudFormation).
Scripting and development proficiency (Python Bash or similar).
Experience with monitoring tools (CloudWatch Grafana Prometheus) and cost-optimization strategies.
AWS Certified Solutions Architect Professional or AWS Certified Advanced Networking preferred.
Bachelors or Masters degree in Computer Science Engineering or related technical field.
Preferred Attributes:
Experience with GPU workloads containerized HPC (ECS/EKS with ParallelCluster) or hybrid/on-prem to cloud HPC migrations.
Strong communication and presentation skills for executive and technical audiences.
Demonstrated thought leadership in HPC strategy performance benchmarking and AWS innovation.
Required Skills:
AWS SME to Lead the design build and optimization of scalable (HPC) and EDA workloads on AWS The Senior AWS EDA (Electronic Design Automation) Architect & Subject Matter Expert will lead the architecture design and optimization of scalable high-performance compute (HPC) and EDA workloads on AWS. This role bridges semiconductor design engineering and cloud infrastructurehelping clients modernize their chip design workflows reduce simulation runtimes and optimize cost-performance at scale. The architect will serve as a strategic advisor to both engineering and IT leadership ensuring secure automated and efficient cloud deployments tailored to EDA workloads such as Synopsys Cadence Siemens and Ansys. Key Responsibilities Architect and implement end-to-end AWS HPC/EDA environments including compute storage networking and licensing infrastructure optimized for large-scale design simulation and verification. Lead cloud migration initiatives for on-premises EDA workloads to AWS using services such as AWS ParallelCluster FSx for Lustre EFA-enabled EC2 and AWS Batch. Design cost-efficient architectures leveraging Spot Instances Auto Scaling and elastic file systems for dynamic simulation environments. Integrate automation and DevOps pipelines using CloudFormation Terraform AWS CDK and CI/CD tools for repeatable and compliant deployments. Collaborate with semiconductor design teams to tune and benchmark workloads (timing analysis simulation emulation verification etc.) for optimal performance on AWS. Advise on data management strategies for high-throughput EDA workflows including S3 lifecycle policies hybrid storage (FSx EBS EFS) and data lakes for results analytics. Implement and enforce AWS security identity and compliance best practices (IAM VPC isolation PrivateLink encryption data residency). Work closely with vendors and partners (e.g. Synopsys Cadence Siemens) to validate tool performance and ensure license server integration on AWS. Provide subject matter expertise in HPC job schedulers (Slurm PBS LSF) cluster orchestration and cloud-native workload management. Mentor engineering and DevOps teams establish best practices and contribute to reusable architectures and playbooks across global projects. Required Qualifications 10 years of experience in HPC or EDA architecture systems engineering or cloud infrastructure roles. 5 years of hands-on experience designing AWS-based HPC or semiconductor design environments. Deep expertise with EDA tools (Synopsys Cadence Siemens Ansys) and understanding of parallel and distributed compute optimization. Proficiency with AWS core services: EC2 S3 EFS FSx for Lustre Batch Lambda CloudFormation CloudWatch IAM and VPC. Strong scripting and automation experience (Python Bash Terraform or CDK). Solid understanding of networking storage performance tuning and job scheduling systems (Slurm Grid Engine PBS Pro LSF). Experience implementing secure compliant multi-account architectures with Control Tower Organizations and Service Catalog. Proven ability to interface with engineering teams and executives translating complex technical challenges into business solutions. Preferred Qualifications AWS Certified Solutions Architect Professional or Advanced Networking Specialty. Experience integrating EDA workloads with data analytics or ML pipelines on AWS (e.g. SageMaker EMR Glue). Familiarity with chip design lifecycle workflows (RTL synthesis verification sign-off P&R DFT). Experience supporting hybrid cloud architectures and HPC burst-to-cloud strategies. Background in Semiconductor Aerospace or High-Tech industries a plus.
Required Education:
Masters preferred
View more
View less