Int. AIOps Site Reliability Engineer

PointClickCare

Posted on : 12-07-2025

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

Mississauga - Canada

Monthly Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 12-07-2025

Job Description

At PointClickCare our mission is simple: to help providers deliver exceptional care. And that starts with our people. As a leading health tech company thats founder-led and privately held we empower our employees to push boundaries innovate and shape the future of healthcare.

With the largest long-term and post-acute care dataset and a Marketplace of 400 integrated partners our platform serves over 30000 provider organizations making a real difference in millions of lives. We also reinvest a significant percentage of our revenue back into research and development ensuring our employees have the resources to innovate and make a lasting by Forbes as a top private cloud company and honored as one of Canadas Most Admired Corporate Cultures we offer flexibility growth opportunities and meaningful work.

At PointClickCare we empower our people to be the architects of a smarter healthcare future; one that is human-first and accelerated by AI to create meaningful and lasting change. Employees harness AI as a catalyst for creativity productivity and thoughtful decision-making. By integrating AI tools into our daily workflows collaboration is enhanced outcomes are improved and every team member has the proficiency to maximize their impact. It all starts with our hiring practices where we uncover AI expertise that complements our mission and we continue to invest in training and development to nurture innovation throughout the employee journey.

Join us in redefining healthcare so it doesnt just survive it learn more about PointClickCare check out Life at PointClickCareand connect with us on Glassdoor and LinkedIn.

Int. AIOps Site Reliability Engineer

Role Summary:

We are seeking an innovative Intermediate Site Reliability Engineer to spearhead the transformation of our operational engineering landscape through AI-driven automation. This role will be pivotal in implementing AIOps capabilities enabling proactive management of reliability reducing toil and accelerating incident resolution across our cloud-native application environment.

Key Responsibilities:

AI-Driven Observability & Monitoring:

Implement and optimize AI-based anomaly detection tools across critical applications to enhance system reliability.

Establish standardized tagging and metadata practices to improve data quality for enhanced AI observability and insights.

Automation & Self-Healing:

Design and implement automated runbooks and workflows triggered by AI insights to reduce manual intervention.

Develop self-healing mechanisms for common failure scenarios including automated responses to AI-detected anomalies.

Incident Management & Root Cause Analysis:

Deploy AI/ML tools for automated root cause analysis and incident correlation to minimize downtime.

Leverage predictive analytics to reduce mean time to detect (MTTD) and mean time to resolution (MTTR).

Predictive Scaling & Resource Optimization:

Build and deploy AI models to forecast traffic and resource needs facilitating proactive scaling and resource allocation.

Enhance cost efficiency through intelligent autoscaling and resource optimization.

Team Enablement & AI Maturity:

Conduct internal AIOps workshops and training sessions to elevate team capabilities.

Guide the team through an AIOps maturity model identifying and closing capability gaps while tracking progress.

Troubleshooting and Problem Resolution:

Participate in an on-call rotation to respond to incidents ensuring 24/7 system availability.

Lead incident response calls to troubleshoot complex system and application-level issues.

Engineer solutions to improve reliability and eliminate recurring incidents.

Required Skills & Experience:

Strong background in SRE practices cloud-native architecture and CI/CD pipelines.

Hands-on experience with observability platforms (e.g. Datadog AppDynamics Prometheus).

Proficiency in scripting and automation (Python Bash Terraform etc.).

Familiarity with AI/ML concepts and their application in operational contexts.

Experience implementing or integrating AIOps platforms or frameworks.

Excellent problem-solving skills troubleshooting skills and a proactive mindset.

Preferred Qualifications:

Bachelors degree in Computer Science Software Engineering or a related discipline.

Minimum of 5 years of experience as a Site Reliability Engineer (SRE).

Prior relevant software development architecture or engineering experience (Min 5 years).

Experience with Generative AI tools for incident response and documentation.

Exposure to predictive analytics and time-series forecasting.

Knowledge of Responsible AI principles and risk frameworks.

Involvement in AI-driven transformation initiatives or hackathons.

Strong experience in building and supporting cloud-based solutions with Azure cloud infrastructure and services experience preferred.

Experience with virtualization and container solutions such as Docker and Kubernetes.

Familiarity with Databricks Event Hub Redis Azure Service Bus Azure Functions and Tomcat.

Experience with Windows and Linux administration.

Experience with configuration management and deployment automation tools (e.g. Chef Terraform Puppet Ansible Jenkins Spinnaker ArgoCD GitHub Actions).

Proficiency in programming languages such as Java JavaScript and Python.

Working knowledge of database technologies (e.g. SQL Server MySQL PostgreSQL).

Experience with monitoring and logging solutions (e.g. Prometheus Grafana ELK stack AppDynamics DataDog).

Strong debugging and optimization skills with the ability to automate routine tasks.

Systematic problem-solving approach with strong communication skills and a proactive mindset.

Knowledge of standard production practices including change management and incident management (ITIL).

Experience building CI/CD pipelines and Blue/Green Zero Downtime deployment strategies.

Troubleshooting experience with diverse hosting technologies web servers Java applications operating systems network components and web browsers.

Nice to Have:

Proficiency in Linux including experience compiling kernels tracing syscalls and understanding TCP.

Knowledge of open-source software and contributions to the open-source community.

Familiarity with Rhapsody and various healthcare messaging standards such as HL7 and FHIR.

Experience with AI-driven infrastructure management tools and platforms.

Participation in AI-focused conferences workshops or communities to stay abreast of emerging trends.

This role is an exciting opportunity for an Intermediate Site Reliability Engineer who is passionate about leveraging AI technologies to enhance the reliability and efficiency of cloud-native applications. If you are driven by innovation and thrive in a collaborative environment we encourage you to apply and be part of our forward-thinking team.

$109000 - $118000 a year

PointClickCare Benefits & Perks:

Benefits starting from Day 1!

Retirement Plan Matching

Flexible Paid Time Off

Wellness Support Programs and Resources

Parental & Caregiver Leaves

Fertility & Adoption Support

Continuous Development Support Program

Employee Assistance Program

Allyship and Inclusion Communities

Employee Recognition and more!

It is the policy of PointClickCare to ensure equal employment opportunity without discrimination or harassment on the basis of race religion national origin status age sex sexual orientation gender identity or expression marital or domestic/civil partnership status disability veteran status genetic information or any other basis protected by law. PointClickCare welcomes and encourages applications from people with disabilities. Accommodations are available upon request for candidates taking part in all aspects of the selection process. Please contact should you require any accommodations.

When you apply for a position your information is processed and stored with Lever in accordance with Levers Privacy Policy. We use this information to evaluate your candidacy for the posted position. We also store this information and may use it in relation to future positions to which you apply or which we believe may be relevant to you given your background. When we have no ongoing legitimate business need to process your information we will either delete or anonymize it. If you have any questions about how PointClickCare uses or processes your information or if you would like to ask to access correct or delete your information please contact PointClickCares human resources team:

PointClickCare is committed to Information Security. By applying to this position if hired you commit to following our information security policies and procedures and making every effort to secure confidential and/or sensitive information.

Employment Type

Full-Time

Company Industry

Key Skills

Apply Now

About Company

PointClickCare

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Int. AIOps Site Reliability Engineer

PointClickCare

Job Description

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Staff Site Reliability Engineer

Apprentice Site Manager

Retail Customer Service - Multi Site

Site Lead, Service Depot â Parts Repair

Manufacturing Engineer

Process Engineer

QA Engineer

Duty Engineer