Job Description
The Role:
The Senior Site Reliability Engineer/Developer is responsible for ensuring the reliability scalability and performance of software systems. Their job profile includes:
System Monitoring and Troubleshooting: Monitoring the performance and availability of software systems identifying and resolving issues and implementing proactive measures to prevent future incidents.
Automation and Infrastructure: Developing and maintaining automation tools and infrastructure to streamline software deployment configuration management and system monitoring.
Performance Optimization: Analyzing system performance identifying bottlenecks and implementing optimizations to improve the efficiency and scalability of software systems.
Incident Response and Root Cause Analysis: Responding to incidents conducting root cause analysis and implementing corrective actions to prevent similar incidents in the future.
Collaboration with Development Teams: Collaborating with software development teams to ensure that reliability and scalability considerations are incorporated into the software design and implementation.
Continuous Improvement: Identifying opportunities for process improvement implementing best practices and driving initiatives to enhance the reliability and performance of software systems.
What Youll Do:
Implement and evolve secure highly available and globally distributed systems powering GMs vehicle security platforms.
Own reliability roadmaps establishing frameworks and strategies for system hardening high availability disaster recovery and operational scalability.
Develop automation-first solutions to eliminate operational toil with advanced use of languages such as Python Go and Java.
Lead incident response driving systematic elimination of failure modes through blameless postmortems PRRs and cross-team preventative initiatives.
Drive observability strategies with best-in-class practices for metrics logging and distributed tracing using Prometheus Datadog or similar stacks.
Partner with engineering platform and security teams to design for reliability from inception influencing architecture reviews and CI/CD best practices.
Lead optimization capacity planning and performance-tuning strategies for large-scale security-critical platforms.
Introduce modern SRE practices such as chaos engineering resilience testing and progressive delivery to validate support teams and evolve system safety along with SLO SLI and SLAs.
Mentor engineers across disciplines on SRE platform resilience secure operational practices and architectural trade-offs.
Evaluate and adopt technologies (open-source enterprise homegrown) for security and reliability at scale.
Influence product strategy in partnership with engineering leads ensuring operational reliability is prioritized alongside customer and business outcomes.
Your Skills & Abilities (Required Qualifications):
5 years of experience in Site Reliability Engineering DevOps or infrastructure/platform roles supporting secure scalable systems.
Strong Proven expertise in designing and scaling cloud infrastructure (Azure) and container orchestration systems (Kubernetes Docker).
Demonstrated mastery of infrastructure-as-code frameworks (Terraform Helm CloudFormation etc).
Proficiency in Python and one JVM language (Java or Kotlin) and working knowledge of Go.
Deep architectural understanding of distributed systems networking system design and large-scale security practices.
Track record of architecting and running zero-downtime systems in production.
Experience with modern monitoring and reliability tooling and frameworks (Prometheus Datadog OpenTelemetry etc.).
Experience leading incident response uptime SLO/SLA management and operational excellence initiatives across multiple teams.
Capable of influencing architecture and product strategy while maintaining a hands-on approach to systems reliability.
Exceptional communication skills able to present complex trade-offs and foster alignment across executive product and engineering stakeholders.
What Will Give You A Competitive Edge (Preferred Qualifications)
BS/MS/PhD in Computer Science Engineering or equivalent industry experience.
Deep understanding of encryption technologies secure data handling practices and identity management.
Experience designing and operating IoT or automotive-focused architectures with rigorous availability and safety requirements.
Direct experience in chaos engineering game-day testing disaster recovery orchestration and production load testing.
Ability to grow and mentor engineers into leaders in their domain building SRE teams that can operate independently at scale.
Demonstrated success in defining and executing reliability strategies with measurable business impact.
Strong product mindset with the ability to balance engineering excellence with speed and business priorities.
About GM
Our vision is a world with Zero Crashes Zero Emissions and Zero Congestion and we embrace the responsibility to lead the change that will make our world better safer and more equitable for all.
Why Join Us
We believe we all must make a choice every day individually and collectively to drive meaningful change through our words our deeds and our culture. Every day we want every employee to feel they belong to one General Motors team.
Benefits Overview
From day one were looking out for your well-beingat work and at homeso you can focus on realizing your ambitions. Learn how GM supports a rewarding career that rewards you personally by visiting Total Rewards resources.
Non-Discrimination and Equal Employment Opportunities (U.S.)
General Motors is committed to being a workplace that is not only free of unlawful discrimination but one that genuinely fosters inclusion and belonging. We strongly believe that providing an inclusive workplace creates an environment in which our employees can thrive and develop better products for our customers.
All employment decisions are made on a non-discriminatory basis without regard to sex race color national origin citizenship status religion age disability pregnancy or maternity status sexual orientation gender identity status as a veteran or protected veteran or any other similarly protected status in accordance with federal state and local laws.
We encourage interested candidates to review the key responsibilities and qualifications for each role and apply for any positions that match their skills and capabilities. Applicants in the recruitment process may be required where applicable to successfully complete a role-related assessment(s) and/or a pre-employment screening prior to beginning employment. To learn more visit How we Hire.
Accommodations
General Motors offers opportunities to all job seekers including individuals with disabilities. If you need a reasonable accommodation to assist with your job search or application for employment email us or call us your email please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.
Required Experience:
Senior IC
GM is home to Chevrolet, Buick, GMC & Cadillac and has been leading the auto industry for over a century. See how we create a vehicle for every drive.