Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailBeacon Systems Inc. a subsidiary of Radiant Digital Solutions delivers Program Management Science Engineering and Technology Solutions to Federal Commercial State and Local Agencies. Our support extends across leading organizations such as the DoD NASA FDA Voice of America and several U.S. state governments including Florida Rhode Island Mississippi North Dakota Virginia and West Virginia.
We are currently seeking a DevOps Engineer Infrastructure Automation for a contract opportunity in Dallas TX. If youre interested please ensure your most recent resume highlights all the required skills and experience listed below.
We are looking for a highly skilled and motivated Senior DevOps Engineer to join the Storage and Compute Platform Management team. This contractor role focuses on designing and automating scalable infrastructure systems that support our highperformance computing (HPC) and largescale storage environments.
You will be a key contributor in building tools and processes that ensure reliability observability and performance for multimegawattscale CPU and GPU compute farms used in quantitative research and machine learning workloads.
Design and implement infrastructure automation frameworks for provisioning HPC and storage platforms.
Apply infrastructureascode and configuration management best practices to ensure system consistency and repeatability.
Collaborate with platform and DevOps teams to enhance system scalability reliability and observability.
Monitor and troubleshoot infrastructure performance and reliability issues across compute and storage components.
Drive continuous improvement initiatives through performance tuning automation and capacity planning.
Support deployment and operation of distributed systems across the enterprise.
Extensive experience with infrastructure engineering especially in compute and storage systems at scale.
Strong background in Python programming for automation scripting and integration tasks.
Expertise in CI/CD pipelines and tools such as Jenkins GitLab CI or ArgoCD.
Proficient with InfrastructureasCode and configuration management tools like Terraform Ansible and Puppet.
Experience with observability and monitoring tools (e.g. Prometheus Grafana ELK Stack).
Solid understanding of Linux system administration and networking principles.
Handson with containerization and orchestration platforms (Docker Kubernetes).
Familiarity with public cloud platforms (AWS Azure GCP) and hybrid environments.
Prior exposure to HPC environments or largescale storage infrastructure is highly desirable.
Strong communication skills attention to detail and a proactive collaborative work style.
Experience working in fastpaced highavailability environments.
Ability to work independently and manage complex technical projects from start to finish.
Passion for automation scalability and performance optimization.
Contract