Site Reliability Engineer (SRE) – AWS

Not Interested
Bookmark
Report This Job

profile Job Location:

Toronto - Canada

profile Monthly Salary: CAD 10 - 10
profile Experience Required: 5years
Posted on: 18 days ago
Vacancies: 1 Vacancy

Job Summary

Job Location: Toronto
Required Skills: Digital : Amazon Web Service(AWS) Cloud ComputingDigital : Site Reliability Engineering (SRE)Dynatrace
Experience: 6-8 years

Job description: SRE Key Responsibilities
Design implement and maintain highly available and scalable systems on and manage CICD pipelines for automated deployments and and optimize Dynatrace monitoring for application performance and infrastructure health.
Implement observability practices (metrics logging tracing) to improve system reliability.
Collaborate with development and operations teams to automate processes and reduce manual interventions.
Perform incident management root cause analysis and drive continuous improvement.
Ensure security compliance and cost optimization in cloud environments
Required Skills Qualifications Strong experience with AWS services (EC2 S3 RDS Lambda VPC IAM CloudWatch).Hands-on expertise in Dynatrace for application and infrastructure monitoring.
Proficiency in CICD tools (Jenkins GitLab CI Azure DevOps or similar).
Knowledge of Infrastructure as Code (IaC) tools (Terraform AWS CloudFormation).
Experience with containerization and orchestration (Docker Kubernetes).
Familiarity with scripting languages (Python Bash).Solid understanding of SRE principles SLIs SLOs and error budgets.



Required Skills:

Experience (Years): 8-10

Job Location: TorontoRequired Skills: Digital : Amazon Web Service(AWS) Cloud ComputingDigital : Site Reliability Engineering (SRE)DynatraceExperience: 6-8 yearsJob description: SRE Key Responsibilities Design implement and maintain highly available and scalable systems on and manage CICD pipeline...
View more view more

Company Industry

IT Services and IT Consulting

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting