Sr. Cloud DevOps Engineer
Clearance: US Citizenship is required / Ability to obtain a Public Trust
Job Location: Remote, US
Overview:
Varada Consulting proudly supports NASA's High Performance Computing Services program in Mountain View, CA at the Ames Research Center and in Greenbelt, MD at Goddard Space Flight Center. Make a DIFFERENCE on a program that supports 4 On-site Supercomputers 18,000+ nodes, 17+ combined petaflop supercomputer systems.
We have an immediate position for a Sr. Cloud DevOps Engineer to join our newly formed HPC Cloud team in Mountain View, CA (AMES). Remote work is possible with occasional travel to AMES.
The successful candidate will be an active member of the HPC Cloud Team charged with working with our NASA customers requirements to architect an HPC cloud solution for the Supercomputing Division.
This position will report directly to the Manager of the Application Performance and Productivity (APP) group and will work to design and deliver hybrid cloud solutions with a focus on high performance computing (HPC) and scientific data processing. The position responsibilities include partnering with engineering and development teams and will co-lead, with a government counterpart, in the design of hybrid cloud solutions that enable rapid adoption of new cloud services, intelligent workload distribution while leveraging existing on-prem HPC infrastructure
Responsibilities:
- Develop and sustain a cloud service supporting the HPC program for both hybrid and non-hybrid cloud solutions using standard industry modeling and
- diagramming and best practices.
- Plan and achieve project objectives; technically guide projects through completion and ensure all project objectives are met within target time frames.
- Partner with external development and operations teams to develop automation solutions for deployment, monitoring and securing of cloud infrastructure.
- Develop tooling for DevOps and provide practical guidance to improve efficiency and consistency for any solution development and deployment.
Requirements:
- Degree (or equivalent experience), in computer science, engineering, or related field
- 5+ years of overall experience in addition to education requirements, working with full-life cycle in software/cloud architecture and IT and/or software
- development
- In-depth hands-on work experience with large-scale cloud solutions
- Strong ability to interact with customers to understand needs, elicit requirements, and get feedback on prototype solutions
- Experience with continuous integration & continuous deployment tools, processes and basic agile methodologies
- Knowledge of networking, firewalls, etc.
- Experience with how to navigate complex government security and compliance controls within a large organizational setting
- Strong analytical skills with the ability to learn new information quickly
- Good organization skills to balance and prioritize work, and ability to multitask
- Ability to work in a hybrid remote/onsite team environment
- Excellent communication and people skills, time management, and organizational skills
-
Technical Skills:
- Expertise in AWS, GCP, or Azure platforms, including storage, networking, security, container hosting/orchestration, serverless, and user/permissions management (AWS strongly preferred)
- Expertise in Linux server administration, including networking, firewalls, security, user identity/permissions, filesystems, and packaging
- Proficiency with container runtimes such as Docker, Singularity, Podman, or CharlieCloud
- Proficiency with cluster schedulers such as PBS or Slurm (PBS preferred)
- Proficiency with Jira, Confluence or similar collaborative tools
- Proficiency with GitHub or GitLab, preferably self-hosted
- Proficiency with Continuous Integration/Continuous Deployment (CI/CD) processes and tools Proficiency with configuration management tools such as Ansible, Chef, or Puppet
- Proficiency with infrastructure as code such as Terraform, Pulumi, CloudFormation, or AWS C
- DK Proficiency in one or more programming languages (Python preferred)
- Familiarity with scientific and parallel computing, machine learning
- Familiarity with object store (S3) and POSIX file systems such as Lustre, and any potential integration of the two, e.g., S3 backed Luster (AWS FSx)
Join an Award Winning Team! Voted as Most Innovative and Fastest Growing Company, Varada Consulting offers highly customized IT capabilities in the federal civilian and DoD market space in support of the mission objectives of the federal government. Varada provides competitive compensation and benefits packages including 100% employer paid healthcare premium.
Varada Consulting, LLC is an Equal Employment Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.