Site Reliability Engineer
Norfolk, MA - USA
Job Summary
Arctiq is a global intelligence-driven technology services company delivering professional and managed services across Hybrid Cloud Infrastructure Networking & Connected Experiences Cybersecurity Data & AI Autonomous Operations & Intelligence and Enterprise Service Management. We help organizations operate secure and modernize complex environments by unifying infrastructure networking data security automation and observability under a single integrated operating model. Our work focuses on helping customers reduce operational friction improve resilience and make better faster decisions as their environments evolve. Arctiq builds on decades of industry expertise and a customer-centric ethos to deliver exceptional value to clients across diverse industries.
The Site Reliability Engineer will focus on the execution and maintenance of reliability engineering practices for mission-critical government systems. Following the SRE Implementation Plan you will bridge the gap between development and operations by applying a software engineering mindset to system administration. You will be responsible for building automation maintaining CI/CD pipelines and ensuring system health through robust monitoring.
This is a remote contract opportunity for a project Arctiq is delivering for a client. Candidates must have or be able to obtain a Secret Clearance.
Key Responsibilities
- Monitoring & Observability: Implement and maintain dashboards and alerting rules using Prometheus Grafana or ELK Stack. Support the identification of Service Level Indicators (SLIs).
- Automation: Develop and maintain Infrastructure as Code (IaC) scripts using Terraform and Ansible to ensure repeatable error-free deployments.
- CI/CD Management: Maintain automated deployment pipelines ensuring security scans and automated tests are integrated into the workflow.
- Incident Response: Participate in on-call rotations and assist in troubleshooting system outages. Contribute to blameless post-mortem reports to drive continuous improvement.
- Toil Reduction: Identify repetitive manual tasks and develop automation to reduce toil allowing the team to focus on high-value engineering.
Required Qualifications
- 35 years of experience in SRE DevOps or Systems Engineering roles.
- Proficiency in scripting languages (Python Go or Bash).
- Hands-on experience with containerization (Docker Kubernetes) and cloud platforms (AWS Azure or GCP).
- Familiarity with NIST SP 800-53 security controls.
- Education: Bachelors degree in Computer Science or a related technical field.
Required Experience:
Manager
About Company
As a systems integrator and managed service provider, Arctiq provides Hybrid Cloud Infrastructure, Networking, Cybersecurity, Data and AI, Autonomous Operations, and ESM to deliver measurable outcomes.