CR260-Senior DevOps Operations Engineer – SRE

SoftSol, Inc.

Not Interested
Bookmark
Report This Job

profile Job Location:

Pleasanton, CA - USA

profile Monthly Salary: Not Disclosed
Posted on: 8 hours ago
Vacancies: 1 Vacancy

Job Summary

Job Summary: Senior Dev Operations Engineer SRE

- Serve as a lead member of the DevOps/SRE team responsible for system administration monitoring installation configuration maintenance operations and architecture across AWS cloud and on-premises environments.
- Implement and maintain production and pre-production environments using automation and monitoring tools to ensure high availability (99.9% uptime) and reliability.
- Design deploy and manage AWS solutions and services (e.g. EC2 S3 ECS EKS Kafka RDS CloudWatch Dynatrace etc.) with a focus on scalability high availability and disaster recovery.
- Build and maintain Infrastructure as Code (IaC) solutions using Terraform or AWS CDK.
- Set up and manage monitoring alerting and notification systems in AWS using CloudWatch/Dynatrace.
- Automate system and application monitoring provisioning and configuration management (Ansible Python scripting).
- Support and troubleshoot 24/7 production environments providing root cause analysis and post-incident reviews.
- Administer Linux systems ensuring security performance and system updates.
- Collaborate with developers engineers and operations teams to support CI/CD pipelines and application deployments (Jenkins Azure Pipelines Git GitLab SVN).
- Provide technical guidance mentorship and knowledge transfer to internal engineering teams.
- Maintain comprehensive documentation for environments procedures and incidents.
- Ensure security compliance and address vulnerabilities in cloud and application environments.
- Support server maintenance updates antivirus requirements and web farm infrastructure across multiple data centers.
- Participate in infrastructure design discussions including virtualization clustering disaster recovery and geographic redundancy.
- Hold a BS in Computer Science (or equivalent experience) with AWS DevOps and/or Solutions Architect certification strongly preferred.
- Bring at least 6 years of IT experience including 4 years managing AWS environments with expertise in automation monitoring reliability engineering and Linux system administration.

Must Have Skills:
- Experience setting up AWS alerts/alarms/notifications (CloudWatch Dynatrace)
- Experience with AWS services (Kafka ECS EKS)
- Infrastructure as Code (CDK Terraform)
- Strong background in automation monitoring CI/CD and site reliability
- 24/7 support and troubleshooting skills

Key Focus Areas:
AWS expertise automation monitoring CI/CD Linux system administration high-availability troubleshooting technical leadership and documentation.
Job Summary: Senior Dev Operations Engineer SRE - Serve as a lead member of the DevOps/SRE team responsible for system administration monitoring installation configuration maintenance operations and architecture across AWS cloud and on-premises environments. - Implement and maintain production ...
View more view more

Key Skills

  • Change Management
  • Software Deployment
  • Cloud Infrastructure
  • High Availability
  • IaaS
  • Firewall
  • Linux
  • Middleware
  • Jboss
  • Network Architecture
  • Scripting
  • Technical Support