Job Summary: Senior Dev Operations Engineer SRE
- Serve as a lead member of the DevOps/SRE team responsible for system administration monitoring installation configuration maintenance operations and architecture across AWS cloud and on-premises environments.
- Implement and maintain production and pre-production environments using automation and monitoring tools to ensure high availability (99.9% uptime) and reliability.
- Design deploy and manage AWS solutions and services (e.g. EC2 S3 ECS EKS Kafka RDS CloudWatch Dynatrace etc.) with a focus on scalability high availability and disaster recovery.
- Build and maintain Infrastructure as Code (IaC) solutions using Terraform or AWS CDK.
- Set up and manage monitoring alerting and notification systems in AWS using CloudWatch/Dynatrace.
- Automate system and application monitoring provisioning and configuration management (Ansible Python scripting).
- Support and troubleshoot 24/7 production environments providing root cause analysis and post-incident reviews.
- Administer Linux systems ensuring security performance and system updates.
- Collaborate with developers engineers and operations teams to support CI/CD pipelines and application deployments (Jenkins Azure Pipelines Git GitLab SVN).
- Provide technical guidance mentorship and knowledge transfer to internal engineering teams.
- Maintain comprehensive documentation for environments procedures and incidents.
- Ensure security compliance and address vulnerabilities in cloud and application environments.
- Support server maintenance updates antivirus requirements and web farm infrastructure across multiple data centers.
- Participate in infrastructure design discussions including virtualization clustering disaster recovery and geographic redundancy.
- Hold a BS in Computer Science (or equivalent experience) with AWS DevOps and/or Solutions Architect certification strongly preferred.
- Bring at least 6 years of IT experience including 4 years managing AWS environments with expertise in automation monitoring reliability engineering and Linux system administration.
Must Have Skills:
- Experience setting up AWS alerts/alarms/notifications (CloudWatch Dynatrace)
- Experience with AWS services (Kafka ECS EKS)
- Infrastructure as Code (CDK Terraform)
- Strong background in automation monitoring CI/CD and site reliability
- 24/7 support and troubleshooting skills
Key Focus Areas:
AWS expertise automation monitoring CI/CD Linux system administration high-availability troubleshooting technical leadership and documentation.
Job Summary: Senior Dev Operations Engineer SRE - Serve as a lead member of the DevOps/SRE team responsible for system administration monitoring installation configuration maintenance operations and architecture across AWS cloud and on-premises environments. - Implement and maintain production ...
Job Summary: Senior Dev Operations Engineer SRE
- Serve as a lead member of the DevOps/SRE team responsible for system administration monitoring installation configuration maintenance operations and architecture across AWS cloud and on-premises environments.
- Implement and maintain production and pre-production environments using automation and monitoring tools to ensure high availability (99.9% uptime) and reliability.
- Design deploy and manage AWS solutions and services (e.g. EC2 S3 ECS EKS Kafka RDS CloudWatch Dynatrace etc.) with a focus on scalability high availability and disaster recovery.
- Build and maintain Infrastructure as Code (IaC) solutions using Terraform or AWS CDK.
- Set up and manage monitoring alerting and notification systems in AWS using CloudWatch/Dynatrace.
- Automate system and application monitoring provisioning and configuration management (Ansible Python scripting).
- Support and troubleshoot 24/7 production environments providing root cause analysis and post-incident reviews.
- Administer Linux systems ensuring security performance and system updates.
- Collaborate with developers engineers and operations teams to support CI/CD pipelines and application deployments (Jenkins Azure Pipelines Git GitLab SVN).
- Provide technical guidance mentorship and knowledge transfer to internal engineering teams.
- Maintain comprehensive documentation for environments procedures and incidents.
- Ensure security compliance and address vulnerabilities in cloud and application environments.
- Support server maintenance updates antivirus requirements and web farm infrastructure across multiple data centers.
- Participate in infrastructure design discussions including virtualization clustering disaster recovery and geographic redundancy.
- Hold a BS in Computer Science (or equivalent experience) with AWS DevOps and/or Solutions Architect certification strongly preferred.
- Bring at least 6 years of IT experience including 4 years managing AWS environments with expertise in automation monitoring reliability engineering and Linux system administration.
Must Have Skills:
- Experience setting up AWS alerts/alarms/notifications (CloudWatch Dynatrace)
- Experience with AWS services (Kafka ECS EKS)
- Infrastructure as Code (CDK Terraform)
- Strong background in automation monitoring CI/CD and site reliability
- 24/7 support and troubleshooting skills
Key Focus Areas:
AWS expertise automation monitoring CI/CD Linux system administration high-availability troubleshooting technical leadership and documentation.
View more
View less