drjobs Sr Site Reliability Engineer SRE

Sr Site Reliability Engineer SRE

Employer Active

1 Vacancy
The job posting is outdated and position may be filled
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Mclean - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Overview

Design. Disrupt. Repeat.
Be an agent of change on a team committed to achieving clientfocused missiondriven excellence. Steampunk is looking for an experienced Site Reliability Engineer with an appetite for taking on new challenges.

Who We Are
Steampunk is the explosive collision of humancentered design and traditional government contracting. An employeeowned company with a startup mindset and timetested approaches tailored for the federal government were passionate about creating solutions that are impactful practical scalable and most importantly that meet our clients everchanging needs.
At Steampunk we believe in disrupting the status quo and setting the pace in the ecosystem of government contractors while repurposing triedandtrue methodologies. We believe in empowering our people to find creative solutions to intractable problems. We believe the best environment in which to grow and thrive is outside our comfort zone. While good design makes for a good product we believe humancentered design makes for an excellent one.



Why Steampunk
Our people are the very core of what we do; their expertise and hunger for new and exciting challenges fuel our relentless pursuit of mission success. As part of our team of Punks youll test the status quo explore new boundaries and set the bar high for how government clients expect to engage with contractors.

Because we value our employees work/life balance (and believe those who work hard deserve to play hard) we offer a very competitive benefits package including telework/flex scheduling health/dental with orthodontics/vision insurance upon hire paid time off with a sellback benefit and carryover option 11 Federal Holidays 100 paid military leave 100 401(k) plan match upon hire professional development/education reimbursement all flexible spending accounts and more.

Contributions

As a Sr. Steampunk Site Reliability Engineer (SRE) you will be responsible for working with program development teams infrastructure and platform services teams and traditional operations and maintenance teams to embrace and embody a shared responsibility for the reliability of an organizations applications and infrastructure. As an SRE your primary responsibility is to combine aspects of software engineering with traditional operations to maintain and improve the reliability availability and performance of cloud infrastructure and largescale software systems and services while minimizing downtime and mitigating potential failures.


There are a wide variety of responsibilities you will be delivering in this role:

  • Infrastructure Optimization: Conduct indepth analyses of infrastructure identifying areas for improvement in terms of performance scalability and resource utilization. Collaborate with development and operations teams to implement enhancements utilizing software engineering and/or infrastructureascode principles to streamline deployment processes and ensure consistency across environments.
  • Reliability Metrics and Reporting: Define and implement key reliability metrics servicelevel objectives (SLOs) and servicelevel indicators (SLIs) to measure and report on the health of our systems. Establish monitoring and alerting mechanisms to proactively identify potential issues before they impact users.
  • Automation and Tooling: Design and implement automation tools to reduce manual toil streamline repetitive tasks and enhance overall operational efficiency. Leverage software development techniques to create robust scalable tooling that supports our reliability goals and collaborate with development teams to integrate reliability features into the development lifecycle.
  • Performance Optimization using Software Development Techniques: Collaborate with software development teams to optimize the performance and resilience of services through code improvements architectural enhancements and performance tuning. Integrate automated testing and profiling into the development pipeline to identify and address performance bottlenecks early in the development lifecycle.
  • Capacity Planning and Scaling: Collaborate with infrastructure teams to forecast capacity requirements ensuring our systems can seamlessly scale to meet growing user demands. Implement strategies for autoscaling and load balancing to optimize resource utilization and enhance overall system stability.
  • Collaboration and Training: Work closely with development teams to embed reliability best practices into the software development process. Provide mentorship and training to crossfunctional teams on SRE principles encouraging a shared responsibility for the reliability of our services.
  • Incident Management: Lead the development and implementation of incident response procedures ensuring timely and effective resolution of issues to minimize impact on users. Foster a culture of continuous improvement by conducting thorough postincident reviews identifying root causes and implementing preventative measures.
  • Infrastructure and Systems Monitoring: Observe and monitor systems to make sure you have the insight into system performance health availability and what is happening internally in the system. Understand what to monitor based on the system(s) you are managing where to store the monitoring data who can access historical monitoring data and how to look at the data to make determinations about future actions.

Qualifications

Required:

  • Masters degree and 8 years of experience; OR
    • Bachelors degree and 10 years of IT experience
  • Eligibile to obtain and maintain a government security clearance
  • Knowledge and experience with Agile and DevSecOps methodologies
  • Experience in system Engineering in one or more areas including telecommunications concepts computer languages operating systems database/Data Base Management System (DBMS) and middleware
  • Experience with the following software/tools:
  • Source code and binary repository products and techniques (GitHub GitLab BitBucket Artifactory Nexus etc.
  • Infrastructure and Cloud Management tools such as AWS CloudWatch
  • Log Management and Analysis tools such as Splunk
  • Automation and Configuration Management tools such as Terraform or Puppet

Preferred:

  • Knowledge and experience with NewRelic and/or other AIOps platforms
  • Have programming skills Javascript Ruby and/or Go
  • Experience with Nginx HAProxy Docker Kubernetes or similar technologies
  • Experience with messaging systems collaboration software applicationbased firewall and proxy server(s) and operating systems
  • Experience with Linux and Windows operating systems along with scripting tools and techniques such as Bash CSH KSH ZSH etc. and/or Powershell.
  • Experience with Monitoring and Alerting tools such as Prometheus Grafana and Datadog

About steampunk

Identity Statement

As part of the application process you are expected to be on camera during interviews and assessments. We reserve the right to take your picture to verify your identity and prevent fraud.

Steampunk is a Change Agent in the Federal contracting industry bringing new thinking to clients in the Homeland Federal Civilian Health and DoD sectors. Through our HumanCentered delivery methodology we are fundamentally changing the expectations our Federal clients have for true shared accountability in solving their toughest mission challenges. As an employee owned company we focus on investing in our employees to enable them to do the greatest work of their careers and rewarding them for outstanding contributions to our growth. If you want to learn more about our story visit .

We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race color religion sex national origin disability status protected veteran status or any other characteristic protected by participates in the EVerify program.


Required Experience:

Senior IC

Employment Type

Unclear

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.