Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailNot Disclosed
Salary Not Disclosed
1 Vacancy
We are looking for a Reliability Engineer who is based out of Reston VA.
These roles are Hybrid Role with 3 Days a week to Reston Office
Contract
6 Months extendable
Role : Reliability Engineer
Description:
We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) . The ideal candidate will have a strong background in cloud platforms DevOps practices and modern software development frameworks. The SRE will play a critical role in designing building and maintaining highly scalable fault-tolerant and secure cloud infrastructure while ensuring operational excellence high availability and reliability.
Key Resposibilities
Required Qualifications
1. Well versed in AWS (ECS EC2 RDS Redshift EMR Lambda Route 53 Step Functions). Must have hands on experience
2. DevOps - Infrastructure as Code CICD - Jenkins GitLab Terraform
3. Well versed in SRE concepts SLO Error Budget Alarms Monitoring etc. Must have implemented these concepts hands on
4. Programming using Python/Java
5. Experience in APM & observability using Splunk Dynatrace
Nice to have
1. Release Engineer
2. Production Support
3. Performance Testing
Preferred Qualifications:
Experience with AI/ML libraries (e.g. NLTK Transformers Spacy SciPy) Amazon SageMaker and GenAI tools.
Familiarity with project management tools like JIRA Confluence and ServiceNow.
Knowledge of utilities like AWS CLI POSTMAN and curl.
Required Skills
Expertise in cloud platforms (AWS Azure or GCP) and container orchestration.
Proficiency in programming/scripting languages such as Python Java Bash and PowerShell.
Strong knowledge of database technologies (e.g. PostgreSQL MongoDB DynamoDB Oracle Redshift).
Experience with DevOps tools (Jenkins Docker Nexus/Artifactory) and build tools (Maven Gradle).
Familiarity with AI/ML integrations event-driven architectures and distributed systems.
Expertise in observability logging and monitoring tools (AWS CloudWatch Splunk Dynatrace OpenTelemetry).
Strong understanding of security practices including IAM RBAC and vulnerability management.
Experience with chaos engineering resiliency assessments and disaster recovery planning.
Proficiency in performance testing tools (JMeter LoadRunner) and capacity planning.
Excellent verbal and written communication skills with the ability to collaborate across teams.
8 years of related experience in their specific area with experience leading teams on projects with similar scope and complexity.
Bachelor s or master s degree in computer science or equivalent.
Certifications: AWS Solutions Architect Agile Certified Practitioner (ACP) or relevant cloud certifications.
Full-time