Site Reliability Engineer
Job Summary
Come put your Site Reliability Engineer (SRE) skills into action! Leidos has openings for talented SREs to join our team and work real-time hands-on fielding challenges and develop reusable solutions that support our customers in any environment. You will have the opportunity to contribute to the design requirements and implementation of improvements that accelerate the secure delivery implementation and sustainment of software in the field. You will automate the buildout of infrastructure in cloud and on-premises environments to operate Kubernetes clusters and microservices this role you will join dynamic Agile software teams that are singularly focused on providing world-class solutions to our customers in an exciting collaborative and inclusive atmosphere. You will be intellectually challenged and provided with a tremendous opportunity for growth in a fast-paced and fun environment that includes both time in the field and with the software teams.
Youll learn master and improve the fielding and diagnostic features processes and tools we use to deploy and sustain our Cloud-based and on-premises solutions into multiple hosting environments such as AWS Azure VMWare and others. Youll learn new technologies and tools and apply what youve learned to overcome technological challenges with innovative solutions. Youll collaborate with other software engineers and SREs to share your knowledge with the team and the organization to make us all better at what we do. Youll perform technical spikes and develop concepts for improved fielding and remote diagnostics to achieve more efficient and sustainable fielding events.
Primary Responsibilities:
- Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding of an microservice enterprise system (cloud and on-premises)
- Partner with development teams to improve services diagnostics and deployment tools through gap identification concept development and rigorous testing and release procedures
- Participate in system design consulting platform management and capacity planning
- Create sustainable systems and services through service automation
- Design develop troubleshoot and debug mission critical infrastructure on-prem and remote
- Manage on-premises and private/public cloud environments via infrastructure-as-code (IaC) and hands-on/client site activities.
- Participate in the concept design of reusable infrastructure components for scalable highly available secure architectures for cloud native applications.
- Enable the continuous integration and continuous delivery of our diverse suite of software products by applying best practices for infrastructure provisioning configuration and automated software deployments.
- Continually evaluate fielded system deployments and apply best practices to facilitate continuous improvement that can be applied across teams.
- Work closely with other engineers to develop the best technical design and approach for new product installation and field service activities (software patches cyber updates etc.)
- Develop solutions to complex technical issues and problems that impact multiple area or disciplines.
- Communicate with internal team members across multiple areas and coordinate completion of key deliverables across teams.
- Liaise with external and internal customer stakeholders on technical design decisions and trade-offs and ensure software solution will meet required functional performance and SLA thresholds especially with customer network interfaces.
- Mentor other SREs in the art of building deploying and maintaining production mission critical microservice enterprise systems.
- Resolve roadblocks for the field service team working collaboratively with the product engineering technical leadership and others.
Basic Qualifications:
- Typically requires a Bachelors degree in computer science or computer engineering with 8 years of experience. Additional experience may be considered in lieu of degree.
- Must be able to pass an in-depth background check (CBP Public Trust BI).
- Experience delivering entire projects or processes spanning multiple technical areas.
- Experience serving as a technical lead managing large projects or processes.
- Working knowledge of Agile Development and continuous integration and continuous delivery methodologies and tools.
- Expertise with Linux and Windows operating systems network administration and networking protocols/functions (e.g. HTTP HTTPS SSL/TLS SMTP DNS).
- Expertise provisioning and managing resources within IaaS/Cloud infrastructures (e.g. Azure AWS Google Cloud Platform etc).
- Experience with Terraform Ansible Helm BASH Scripting CloudFormation Chef Puppet Ansible or similar technologies.
- Troubleshooting PLC end-device software and service-level computers on the edge feeding our on-prem and cloud-based server systems.
- Expertise with container technologies such as Docker and container orchestration tools like Kubernetes.
- Expertise with Kubernetes kubectl
- Expertise of a version control system (e.g. Git).
- Strong self-motivated desire to learn new tools frameworks and techniques.
- Ability to complete tasking independently with minimal direct supervision.
- Ability to work and collaborate effectively within a multi-disciplined engineering team.
- Ability to travel up to 70% of times to remote locations mostly in the US along the border to troubleshoot network and software bugs during initial deployment and sustainment.
- U.S. Citizenship is required.
Preferred Qualifications:
- Experience with Enterprise Event Brokers Technologies (Kafka NATS)
- Experience with monitoring and alerting tools such as Grafana Prometheus
- Experience with API Gateways such as ISTIO
- Experience with GitOps tools such as Argo CD Flux CD Fleet or similar
- Professional cybersecurity certification such as Security or similar.
- Knowledge of Agile Development methodologies.
- Familiarity with at least one Relational Database Management System (Oracle MySQL PostgreSQL SQL Server etc.).
If youre looking for comfort keep scrolling. At Leidos we outthink outbuild and outpace the status quo because the mission demands it. Were not hiring followers. Were recruiting the ones who disrupt provoke and refuse to fail. Step 10 is ancient history. Were already at step 30 and moving faster than anyone else dares.
Original Posting:
June 4 2026For U.S. Positions: While subject to change based on business needs Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.
Pay Range:
Pay Range $107900.00 - $195050.00The Leidos pay range for this job level is a general guideline onlyand not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job education experience knowledge skills and abilities as well as internal equity alignment with market data applicable bargaining agreement (if any) or other law.
Required Experience:
IC
About Company
Leidos is an innovation company rapidly addressing the world's most vexing challenges in national security and health. Our 47,000 employees collaborate to create smarter technology solutions for customers in these critical markets.