Job Location: Toronto
Required Skills: Digital : Amazon Web Service(AWS) Cloud ComputingDigital : Site Reliability Engineering (SRE)Dynatrace
Experience: 6-8 years
Job description: SRE Key Responsibilities
Design implement and maintain highly available and scalable systems on and manage CICD pipelines for automated deployments and and optimize Dynatrace monitoring for application performance and infrastructure health.
Implement observability practices (metrics logging tracing) to improve system reliability.
Collaborate with development and operations teams to automate processes and reduce manual interventions.
Perform incident management root cause analysis and drive continuous improvement.
Ensure security compliance and cost optimization in cloud environments
Required Skills Qualifications Strong experience with AWS services (EC2 S3 RDS Lambda VPC IAM CloudWatch).Hands-on expertise in Dynatrace for application and infrastructure monitoring.
Proficiency in CICD tools (Jenkins GitLab CI Azure DevOps or similar).
Knowledge of Infrastructure as Code (IaC) tools (Terraform AWS CloudFormation).
Experience with containerization and orchestration (Docker Kubernetes).
Familiarity with scripting languages (Python Bash).Solid understanding of SRE principles SLIs SLOs and error budgets.