Infrastructure Cloud Engineer
HIGHLIGHTS
Location:Wilmington NC or Mclean VA
Position Type:Direct Hire
Hourly / Salary:Based on experience
Job Summary
Our client is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our team. The SRE will play a critical role in designing implementing and maintaining the reliability scalability and performance of our clients cloud-based systems hosted in AWS. You will collaborate closely with software engineers operations teams and other stakeholders to enhance system reliability and developer productivity through automation monitoring and incident response.
Responsibilities
Design and implement scalable reliable and secure cloud infrastructure in AWS.
Develop and maintain monitoring alerting and dashboarding solutions to ensure system health and uptime.
Automate infrastructure provisioning and configuration management using tools like Terraform (Terragrunt) CDK and CloudFormation
Implement CI/CD pipelines to streamline deployments and improve development workflows.
Respond to incidents perform root cause analysis and implement permanent fixes to prevent recurring issues.
Optimize system performance reliability and cost-effectiveness in collaboration with engineering teams.
Drive infrastructure improvements and advocate for best practices in system design and operations.
Establish and manage disaster recovery plans ensuring system availability during unexpected events.
Minimum Education
Preferred Education
Minimum Experience
Preferred
Minimum Skills
Argo CD and Argo Workflows
IaC: Terraform and Terragrunt
Kubernetes and Linkerd
AWS (EKS Fargate Aurora)
Security and Compliance
Containerization (Docker)
Logging and Monitoring Tools
Programming Scripting Language
DB Management
Version Control (Git)
Incident Management
Preferred Skills
Experience with Datadog
Experience with Cloudflare
Mortgage Domain Knowledge
Advanced Security Practices (GuardDuty Security Hub)
Disaster Recovery Planning
Experience with workflow automation tools such as Camunda
Minimum Certification(s)
Preferred Certification(s)
We are GTN The Go To Network
Infrastructure Cloud EngineerHIGHLIGHTSLocation:Wilmington NC or Mclean VAPosition Type:Direct HireHourly / Salary:Based on experienceJob Summary Our client is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our team. The SRE will play a critical role in designing impl...
Infrastructure Cloud Engineer
HIGHLIGHTS
Location:Wilmington NC or Mclean VA
Position Type:Direct Hire
Hourly / Salary:Based on experience
Job Summary
Our client is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our team. The SRE will play a critical role in designing implementing and maintaining the reliability scalability and performance of our clients cloud-based systems hosted in AWS. You will collaborate closely with software engineers operations teams and other stakeholders to enhance system reliability and developer productivity through automation monitoring and incident response.
Responsibilities
Design and implement scalable reliable and secure cloud infrastructure in AWS.
Develop and maintain monitoring alerting and dashboarding solutions to ensure system health and uptime.
Automate infrastructure provisioning and configuration management using tools like Terraform (Terragrunt) CDK and CloudFormation
Implement CI/CD pipelines to streamline deployments and improve development workflows.
Respond to incidents perform root cause analysis and implement permanent fixes to prevent recurring issues.
Optimize system performance reliability and cost-effectiveness in collaboration with engineering teams.
Drive infrastructure improvements and advocate for best practices in system design and operations.
Establish and manage disaster recovery plans ensuring system availability during unexpected events.
Minimum Education
Preferred Education
Minimum Experience
Preferred
Minimum Skills
Argo CD and Argo Workflows
IaC: Terraform and Terragrunt
Kubernetes and Linkerd
AWS (EKS Fargate Aurora)
Security and Compliance
Containerization (Docker)
Logging and Monitoring Tools
Programming Scripting Language
DB Management
Version Control (Git)
Incident Management
Preferred Skills
Experience with Datadog
Experience with Cloudflare
Mortgage Domain Knowledge
Advanced Security Practices (GuardDuty Security Hub)
Disaster Recovery Planning
Experience with workflow automation tools such as Camunda
Minimum Certification(s)
Preferred Certification(s)
We are GTN The Go To Network
View more
View less