Production Support and Site Reliability Engineer (SRE)

Not Interested
Bookmark
Report This Job

profile Job Location:

Toronto - Canada

profile Monthly Salary: CAD 10 - 10
profile Experience Required: 5years
Posted on: 9 hours ago
Vacancies: 1 Vacancy

Job Summary

Production Support Engineer / SRE


Work Mode: 4 days Onsite

Production Support & Operations

Manage daytoday production support activities for both web and mobile applications (Android and iOS).

Maintain overall health stability and safety of production systems and applications.

Identify operational risks and recommend mitigation strategies.

Improve application instrumentation logging alerting and monitoring capabilities.


Change & Release Management

Perform change management activities across test and production environments.

Execute code deployments while adhering to source code management release management and compliance policies.

Ensure proper governance and documentation of all deployment activities.


Collaboration & Stakeholder Engagement

Work closely with development teams and business partners to recommend solutions that combine internal development integration with other applications and vendor platforms.

Thrive in an agile environment by contributing to sprint activities and collaborative planning.

Communicate effectively with team members management infrastructure teams and other interface groups throughout the project lifecycle.


Knowledge Building & Leadership

Develop strong understanding of business processes and enterprise systems.

Provide coaching expertise and continuous feedback to help build the teams capability.

Share technical knowledge to support onboarding and skill growth.


Support & OnCall Availability

Participate in occasional weekend and afterhours support for critical issues or deployments.


Required Qualifications (MustHave)

Technical Troubleshooting & Monitoring


Handson experience troubleshooting application and database issues using:

Dynatrace

OpenShift (OCP)

Elastic / Kibana

MongoDB services running on Linux

IIS Web Servers on Windows

Kafka (basic to intermediate knowledge)


Strong proficiency with Following Database & Application Technologies:

Microsoft SQL Server

MongoDB

Microsoft C#


Solid ability to write read and troubleshoot SQL queries.

Knowledge of SQL database architecture performance monitoring and optimization.


Good understanding of Following Operating Systems:

Linux / Unix environments

Windows Server platforms


Automation & Infrastructure

Experience automating routine database or infrastructure operations.

Proficiency working with cloudhosted applications and services

Microsoft Azure

OpenShift (OCP)

AWS S3 bucket concepts


DevOps & SRE Practices

Experience with DevOps and Site Reliability Engineering tools such as: Helios UCD (UrbanCode Deploy) Jenkins Ansible

Knowledge of CI/CD pipelines release workflows and automation strategies.


Needs Experience with:

JavaScript


Java

Job scheduling using Stonebranch.

Monitoring tools such as Catchpoint and Aternity.


Productivity & Support Tools

Jira and Confluence for project/task management.

Firebase Google Play Console and Google Analytics for Android apps.

Apple App Store experience for iOS application operations.


Soft Skills & Frameworks

Strong analytical problemsolving and decisionmaking skills.

Solid understanding of ITIL service management practices.

Experience using ServiceNow for incident problem and change management.




Required Skills:

Production Support Engineer / SRE Work Mode: 4 days Onsite Production Support & Operations Manage day to day production support activities for both web and mobile applications (Android and iOS). Maintain overall health stability and safety of production systems and applications. Identify operational risks and recommend mitigation strategies. Improve application instrumentation logging alerting and monitoring capabilities. Change & Release Management Perform change management activities across test and production environments. Execute code deployments while adhering to source code management release management and compliance policies. Ensure proper governance and documentation of all deployment activities. Collaboration & Stakeholder Engagement Work closely with development teams and business partners to recommend solutions that combine internal development integration with other applications and vendor platforms. Thrive in an agile environment by contributing to sprint activities and collaborative planning. Communicate effectively with team members management infrastructure teams and other interface groups throughout the project lifecycle. Knowledge Building & Leadership Develop strong understanding of business processes and enterprise systems. Provide coaching expertise and continuous feedback to help build the teams capability. Share technical knowledge to support onboarding and skill growth. Support & On Call Availability Participate in occasional weekend and after hours support for critical issues or deployments. Required Qualifications (Must Have) Technical Troubleshooting & Monitoring Hands on experience troubleshooting application and database issues using: Dynatrace OpenShift (OCP) Elastic / Kibana MongoDB services running on Linux IIS Web Servers on Windows Kafka (basic to intermediate knowledge) Strong proficiency with Following Database & Application Technologies: Microsoft SQL Server MongoDB Microsoft C# Solid ability to write read and troubleshoot SQL queries. Knowledge of SQL database architecture performance monitoring and optimization. Good understanding of Following Operating Systems: Linux / Unix environments Windows Server platforms Automation & Infrastructure Experience automating routine database or infrastructure operations. Proficiency working with cloud hosted applications and services Microsoft Azure OpenShift (OCP) AWS S3 bucket concepts DevOps & SRE Practices Experience with DevOps and Site Reliability Engineering tools such as: Helios UCD (UrbanCode Deploy) Jenkins Ansible Knowledge of CI/CD pipelines release workflows and automation strategies. Needs Experience with: JavaScript Java Job scheduling using Stonebranch. Monitoring tools such as Catchpoint and Aternity. Productivity & Support Tools Jira and Confluence for project/task management. Firebase Google Play Console and Google Analytics for Android apps. Apple App Store experience for iOS application operations. Soft Skills & Frameworks Strong analytical problem solving and decision making skills. Solid understanding of ITIL service management practices. Experience using ServiceNow for incident problem and change management.

Production Support Engineer / SREWork Mode: 4 days OnsiteProduction Support & Operations Manage daytoday production support activities for both web and mobile applications (Android and iOS). Maintain overall health stability and safety of production systems and applications. ...
View more view more

Company Industry

IT Services and IT Consulting

Key Skills

  • History
  • Insurance Management
  • JDE
  • Administration Office
  • Catering Operations