Site Reliability Engineer (Cleared)

Not Interested
Bookmark
Report This Job

profile Job Location:

Denver, CO - USA

profile Monthly Salary: $ 100000 - 120000
Posted on: 30+ days ago
Vacancies: 1 Vacancy

Job Summary

Site Reliability Engineer (Cleared)

$100000 to $120000 USD (with up to 10% bonus potential) Paid Relocation

Denver Metro Area Colorado

Security Clearance: Active TS/SCI Clearance is REQUIRED.

Hybrid Remote (2-3 days on site) OR 9/80 work week availableRole Summary and Position Objectives

This critical role applies software engineering principles to operations to build and run large-scale fault-tolerant systems. You will be responsible for the continuous availability scalability and performance of mission-critical platforms used to support national security.

This position involves working with highly sensitive and classified information requiring an Active Top Secret/SCI Security Clearance. A relocation package is available for this position.

Core Responsibilities

As a Senior SRE you will drive the architecture and implementation of reliable and efficient infrastructure by:

  • System Reliability & Resiliency: Ensuring the survivability and $24/7$ uptime of mission-critical systems through robust design proactive monitoring and disaster recovery planning.

  • Automation and Toil Reduction: Designing developing and deploying automation tools and scripts to eliminate repetitive manual tasks (toil) across system administration deployment and configuration.

  • Infrastructure as Code (IaC): Developing and maintaining infrastructure using declarative tools (e.g. Terraform Ansible) to ensure consistency repeatability and version control across all environments.

  • Configuration Management: Implementing and enforcing best practices for configuration using Policy as Code and Configuration as Code methodologies across large Linux environments.

  • Monitoring and Observability: Implementing advanced monitoring logging and alerting solutions to detect and resolve system issues based on symptoms not just outages and define key Service Level Indicators (SLIs).

  • Incident Management: Serving as a technical leader during production incidents conducting root cause analysis (RCA) and implementing preventative measures to drive continuous improvement.

  • Collaboration: Working closely with Software Development Cyber Security and Mission Operations teams across the entire Software Development Lifecycle (SDLC) to ensure services are designed for scalability and reliability.

What Sets You Apart

  • Clearance & Experience: An Active TS/SCI Clearance combined with $5$ years of experience in a mission-critical SRE DevOps or highly-available Systems Engineering role.

  • Technical Depth: Expert-level administration and troubleshooting of Linux systems and strong proficiency in scripting languages (e.g. Python Bash).

  • Leadership: Demonstrated success providing technical leadership mentoring junior team members and championing new ideas and SRE/DevOps best practices.

  • Communication: Strong presentation documentation and communication skills with proven experience in negotiating technical solutions to meet challenging customer requirements.

  • Proactive Mindset: A commitment to ongoing learning and applying technology trends to solve operational challenges always seeking win-win solutions.

Our Commitment to You

  • Work/Life Balance: Flexible schedules including the option for a 9/80 work schedule (every other Friday off).

  • Career Growth: An exciting career path with continuous learning development and advanced training opportunities.

  • Benefits: Competitive benefits including $401k$ matching flex time off paid parental leave comprehensive healthcare health & wellness programs and more.

Site Reliability Engineer (Cleared)$100000 to $120000 USD (with up to 10% bonus potential) Paid RelocationDenver Metro Area ColoradoSecurity Clearance: Active TS/SCI Clearance is REQUIRED.Hybrid Remote (2-3 days on site) OR 9/80 work week availableRole Summary and Position ObjectivesThis critical r...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting

About Company

Company Logo

True North Consultants: Expert Technology Recruiters specializing in recruitment solutions in software, IT, and emerging tech sectors.

View Profile View Profile