Senior DevOps Engineer Build Environment Manager
Arlington, TX - USA
Job Summary
We are seeking a Senior DevOps Lead Engineer to lead and evolve the DevOps function for the Be The People digital ecosystem a suite of highly visible public-facing applications. This is a senior individual contributor role with strong technical leadership expectations responsible for environments CI/CD pipelines release readiness and operational guardrails.
This role operates with a high degree of autonomy and judgment balancing delivery velocity with reliability security and long-term maintainability. You will collaborate with software engineers data engineers architects cloud engineers and product stakeholders to tackle some of societys most pressing challenges.
How You Will Contribute
DevOps & Platform Ownership
- Lead the DevOps function end-to-end for Be The People and contribute to how applications are assembled tested released and operated
- Lead creation and lifecycle management of environments including spinning up new environments and coordinating content and data seeding
- Contribute to the design and evolution of platform architecture with an emphasis on scalability resilience and cost efficiency
- Establish cloud standards and manage cloud operations in cooperation with Cloud Engineering.
CI/CD & Automation
- Design implement and maintain CI/CD pipelines using GitHub Actions as the primary orchestrator
- Establish reusable workflow templates and best practices without introducing unnecessary complexity
- Apply Infrastructure-as-Code discipline including versioning promotion across environments and drift prevention using tools such as Terraform Terragrunt CDK Ansible or CloudFormation
- Develop and debug Python Ansible Playbooks Terraform and other infrastructure-as-code tooling
Release Readiness & Operational Guardrails
- Own or steward the release management function acting as a go/no-go checkpoint to ensure required testing has occurred stakeholder sign-offs are complete and risks are clearly surfaced
- Participate in production incident response root-cause analysis and post-incident learning
- Construct a resilient ecosystem capable of quickly restoring services and participate in developing the disaster recovery playbook for Be The People
Observability Reliability & Security
- Implement monitoring alerting and observability for production systems using tools such as Prometheus Grafana Nagios or ELK Stack
- Participate in the development implementation and automation of security policies
- Apply baseline security best practices across infrastructure and pipelines
- Apply SRE-inspired practices such as defining SLIs/SLOs and designing systems operable by teams beyond DevOps
Cloud CDN & Infrastructure
- Deeply understand CDN configuration (e.g. Cloudflare) including caching layers and their impact on testing and production behavior
- Be competent in DNS concepts and troubleshooting even if DNS ownership resides elsewhere
- Optimize cloud infrastructure (AWS primary; familiarity with Azure and GCP a plus) for performance reliability and cost
- Provide architectural guidance and design recommendations for cloud assets resource consolidation and standardized practices
Testing & Quality Enablement
- Help shape test automation strategy recognizing AI-assisted testing and organizational QA maturity constraints
- Ensure testing happens and is validated rather than personally writing all tests
- Partner with the testing team on test automation test tracking and incorporate into the release process
- Embed quality checks as first-class controls in delivery pipelines
Technical Leadership & Collaboration
- Serve as a technical leader who leads through influence rather than authority
- Collaborate cross-functionally with application infrastructure and data teams
- Participate in technical screening and interview panels including short high-signal technical screens
- Coach and mentor engineers on cloud and DevOps best practices raising overall DevOps maturity
- Create and maintain durable documentation including environment definitions deployment processes and runbooks
What You Will Bring
Experience
- 10 years of relevant DevOps and Cloud Engineering experience this is a senior-level role not suitable for junior or mid-level candidates
- Proven experience operating production systems with real customer and business impact
- Minimum 5 years of professional Cloud Engineering background with deep AWS expertise
Technical Skills
- CI/CD: GitHub Actions (primary) plus experience with CircleCI Jenkins Bamboo or AWS CodePipeline
- Infrastructure as Code: Terraform AWS CDK Serverless Stack (SST) Ansible Playbooks or CloudFormation
- Containers & Orchestration: Docker Docker Swarm Kubernetes Rancher
- Cloud Platforms: AWS (EC2 RDS DynamoDB DocumentDB Lambda SQS SNS ECS ECR Elastic Load Balancers S3 Amplify CodeBuild); familiarity with Azure Google Cloud and Acquia
- CDN & Networking: Cloudflare; understanding of DNS Custom TCP SSH HTTPS UDP VPNs Load Balancing and Firewalls
- Monitoring & Observability: Prometheus Grafana Nagios ELK Stack or Graylog
- Databases: DocumentDB / MongoDB (operational context); SQL background advantageous
- Security: Secure SDLC concepts including secrets management scanning and least-privilege access
- Languages: Python PowerShell JavaScript or Java (for Lambda and scripting)
- Version Control: Git and GitHub
- Configuration Management: Ansible Chef or Puppet
- Project Management Sprint Planning & Release Planning: Jira and Confluence
Leadership & Soft Skills
- Strong judgment and ownership comfortable saying no or not yet when appropriate
- Formal people-management experience is not required; demonstrated technical leadership through influence is expected
- Ability to communicate clearly with both technical and non-technical stakeholders around risk and tradeoffs
- Comfortable with command-line tools and the Linux/Unix environment
- Enthusiasm to contribute to Stand Togethers vision and principled approach to solving problems and a commitment to stewarding our culture which champions values including transformation and innovation entrepreneurialism humility and respect.
What Success Looks Like
- Releases are predictable governed process driven and low-drama
- CI/CD pipelines are reliable understandable and trusted by engineering teams
- Systems are scalable observable and operable by the broader organization
- DevOps acts as an enabler with intentional guardrails not a bottleneck
- The platform and practices are positioned to scale with future organizational growth
Standout Candidates Will Bring
- AWS or Azure certifications (Certified Developer DevOps Engineer Solutions Architect Data Analytics or Database)
- Experience with regulated or compliance-sensitive environments
- Exposure to multi-product or platform organizations
- Experience modernizing legacy delivery practices
- Background in building and deploying web applications from source code
- DevSecOps experience including code analysis vulnerability management regulatory compliance security policy monitoring to help build a security-aware culture
What We Offer
- Competitive benefits: Enjoy a 6% 401(k) match with immediate vesting flexible time off comprehensive health and dental plans plus wellness and mental health support through Peloton and Talkspace.
- A meaningful career: Join a passionate community of over 1300 employees dedicated to improving lives and driving innovative solutions to complex social challenges.
- Commitment to growth: Thrive in a non-hierarchical environment that empowers employees to discover develop and apply their unique talents.
- Competitive compensation: Our approach rewards the value you create through competitive salaries and bonus opportunities allowing you to share in the success you help drive.
Required Experience:
Manager
About Company
Stand Together is a philanthropic community that helps America’s boldest changemakers tackle the root causes of our country’s biggest problems.