Sr. DevOps Engineer
Portland, TX - USA
Job Summary
Senior DevOps Engineer
We are looking for an experienced Senior DevOps Engineer to join our team. The ideal candidate will be able to work in a fast paced environment operate gracefully under stress effectively manage multiple assignments be self driven proactive and have great interpersonal and communication skills.
As a Senior DevOps Engineer you will play a key role in designing building and maintaining the infrastructure and processes that empower our development teams to deliver high-quality software quickly and reliably. You will be responsible for implementing and optimizing CI/CD pipelines managing cloud-based infrastructure and championing DevOps best practices throughout the organization. This role requires a strong technical background in DevOps practices cloud technologies automation tools and the ability to mentor and guide other team members.
Role & Responsibilities
CI/CD Pipeline Optimization: Design implement and continuously improve CI/CD pipelines to streamline the software delivery process ensuring rapid and reliable deployments.
Infrastructure Management: Manage and maintain our cloud-based infrastructure on Google Cloud Platform (GCP) ensuring high availability performance security and cost-effectiveness.
Automation Expertise: Automate repetitive tasks and processes to improve efficiency reduce manual errors and ensure consistency across environments.
Monitoring and Observability: Implement and maintain comprehensive monitoring and alerting systems to proactively identify and resolve issues ensuring the health and performance of our systems.
Mentorship and Collaboration: Share your expertise and mentor other team members on DevOps best practices tools and techniques. Collaborate with development QA and SRE teams to troubleshoot and resolve issues fostering a culture of collaboration and continuous learning.
Security and Compliance: Ensure our infrastructure and processes adhere to industry best practices and security standards protecting our systems and data from potential threats.
Incident Response: Participate in incident response and post-incident review processes to minimize downtime identify root causes and implement corrective actions.
Minimum qualifications
Experience:
5 years of experience in DevOps or a related field.
Proven track record of designing building and maintaining CI/CD pipelines infrastructure as code and cloud-based infrastructure.
Deep understanding of Google Cloud Platform (GCP) or other major cloud providers.
Hands-on experience with containerization (e.g. Docker) and orchestration (e.g. Kubernetes).
Skills:
Strong programming or scripting skills in Python Bash or other relevant languages.
Expertise in configuration management tools (e.g. Ansible Chef Puppet).
Proficiency in using version control systems (e.g. Git).
Solid understanding of networking and security concepts.
Excellent problem-solving and troubleshooting skills.
Strong communication and collaboration skills with the ability to mentor and guide others.
Education:
Bachelors degree in Computer Science Engineering or a related field.
Bonus Points:
Experience with cybersecurity tools and technologies.
Familiarity with monitoring and observability tools (e.g. Prometheus Grafana).
Knowledge of SRE principles and practices.
Contributions to open source projects related to DevOps.
Certifications in relevant cloud technologies or DevOps practices.
Required Technical Skills
Cloud Infrastructure
Expert: Google Cloud Platform (GCP)
Compute (Compute Engine Kubernetes Engine Cloud Functions App Engine)
Storage (Cloud Storage Persistent Disk Filestore BigTable)
Networking (VPC Load Balancing Cloud DNS Cloud CDN Cloud Interconnect)
Databases (Cloud SQL Cloud Spanner Firestore)
Security (Identity and Access Management Cloud Armor Security Command Center Secret Manager)
Monitoring (Cloud Monitoring Cloud Logging Cloud Trace)
Advanced:
Experience with infrastructure optimization and cost management on GCP
Knowledge of GCP best practices and architectural patterns
Bonus: Experience with other cloud providers (AWS Azure) or hybrid cloud environments.
Infrastructure as Code (IaC)
Expert: Terraform
Proficient: Ansible or similar configuration management tools (Chef Puppet)
Bonus: Experience with other IaC tools (e.g. CloudFormation Pulumi) or policy-as-code frameworks (e.g. Open Policy Agent)
CI/CD
Expert: Jenkins CircleCI GitLab CI/CD or similar tools
Advanced: Experience designing and implementing complex CI/CD pipelines with multiple stages environments and deployment strategies
Bonus: Experience with Tekton Argo CD or other Kubernetes-native CI/CD solutions
Containerization and Orchestration
Expert: Docker Kubernetes
Advanced: Experience with Kubernetes networking storage security and cluster management
Bonus: Experience with Helm (Kubernetes package manager) Istio (service mesh) or Knative (serverless platform)
Programming and Scripting
Proficient: Python Bash or other scripting languages (e.g. Ruby Perl)
Bonus: Experience with Go (Golang) or other compiled languages (e.g. Java C)
Monitoring and Observability
Proficient: Prometheus Grafana ELK Stack (Elasticsearch Logstash Kibana) or similar monitoring and logging tools
Advanced: Experience designing and implementing monitoring and alerting strategies for distributed systems
Bonus: Experience with distributed tracing (e.g. Jaeger Zipkin) or other observability tools (e.g. OpenTelemetry)
Security
Proficient: Security best practices for cloud infrastructure container security network security access control secrets management
Advanced: Experience implementing and maintaining security policies conducting security audits and reviews
Bonus: Experience with security scanning tools (e.g. Trivy Clair) penetration testing or security certifications (e.g. CISSP)
Additional Skills (Highly Desirable)
Experience with
Knowledge of chaos engineering principles and practices
Familiarity with GitOps workflows
Experience with serverless technologies (e.g. Cloud Functions AWS Lambda)
Understanding of cost optimization techniques for cloud infrastructure
About Eclypsium
Eclypsium is a supply chain security platform that builds trust in every device by identifying verifying and fortifying software firmware and hardware throughout enterprise infrastructure. Eclypsiums SaaS platform does this by integrating the bill of materials from suppliers and continuously monitoring to independently assess risk of each critical asset from chip to cloud throughout the life cycle and across enterprise ecosystems. Protecting Fortune 100 enterprises and federal agencies Eclypsium has been named a Gartner Cool Vendor in Security Operations and Threat Intelligence. A TAG Cyber Distinguished Vendor one of the Worlds 10 Most Innovative Security Companies by Fast Company a CNBC Upstart 100 a CB Insights Cyber Defender and an RSAC Innovation Sandbox finalist. For more information visit .
Benefits
Eclypsium headquarters are located in Portland OR with distributed remote employees and global teams in Argentina and Singapore. We offer competitive compensation and benefits packages and are committed to the well-being of our employees and their families.
Benefits & Perks include:
Competitive compensation & startup equity
Comprehensive medical dental and vision coverage
Life insurance short-term and long-term disability coverage
Flexible time off
Employee assistance program
- Employer sponsored 401k plan
Paid parental leave
Paid sabbatical
Home office support for remote employees
Regular events and celebrations
Equal Opportunity
Eclypsium is an equal opportunity employer. We believe in the importance of diverse teams and value candidates of all backgrounds. We do not discriminate on the basis of age ancestry citizenship color ethnicity family or medical care leave gender identity or expression genetic information marital status medical condition national origin physical or invisible disability status political affiliation veteran status race religion or sexual orientation.
Required Experience:
Senior IC
About Company
Eclypsium's platform enhances supply chain security by incorporating zero-trust in every device, fortifying hardware, firmware, and software.