At PointClickCare our mission is simple: to help providers deliver exceptional
care. And that starts with our people. As a leading health tech company that’s
founder-led and privately held, we empower our employees to push boundaries,
innovate, and shape the future of healthcare.
With the largest long-term and post-acute care dataset and a Marketplace of 400+
integrated partners, our platform serves over 30,000 provider organizations,
making a real difference in millions of lives. We also reinvest a significant
percentage of our revenue back into research and development, ensuring our
employees have the resources to innovate and make a lasting impact. Recognized
by Forbes as a top private cloud company and honored as one of Canada’s Most
Admired Corporate Cultures, we offer flexibility, growth opportunities, and
meaningful work.
At PointClickCare, we empower our people to be the architects of a smarter
healthcare future; one that is human-first and accelerated by AI to create
meaningful and lasting change. Employees harness AI as a catalyst for
creativity, productivity, and thoughtful decision-making. By integrating AI
tools into our daily workflows, collaboration is enhanced, outcomes are
improved, and every team member has the proficiency to maximize their impact. It
all starts with our hiring practices where we uncover AI expertise that
complements our mission, and we continue to invest in training and development
to nurture innovation throughout the employee journey.
Join us in redefining healthcare — so it doesn’t just survive, it thrives. To
learn more about PointClickCare, check out Life at PointClickCare
[https://can01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fpointclickcare.com%2Flife-at-pointclickcare%2F&data=05%7C02%7CSandeep.Dhillon%40pointclickcare.com%7Ce9353db140a24fd980f808ddb00013f1%7Cafd0249eca3d42058bf4ac2b6abd0fec%7C0%7C0%7C638860236586947941%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=dIKtrMG7L0sb%2FfLrFRvhcZAmfqfNXTXDTvMsoJkwGvM%3D&reserved=0] and
connect with us on Glassdoor
[https://can01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.glassdoor.ca%2FOverview%2FWorking-at-PointClickCare-EI_IE452666.11%2C25.htm&data=05%7C02%7CSandeep.Dhillon%40pointclickcare.com%7Ce9353db140a24fd980f808ddb00013f1%7Cafd0249eca3d42058bf4ac2b6abd0fec%7C0%7C0%7C638860236586978442%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=KVgSiL%2ByFmnpxTc5PmuErMwRNwDmTGqUzJIUSIm5fY0%3D&reserved=0]and
LinkedIn
[https://can01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.linkedin.com%2Fcompany%2Fpointclickcare%2F&data=05%7C02%7CSandeep.Dhillon%40pointclickcare.com%7Ce9353db140a24fd980f808ddb00013f1%7Cafd0249eca3d42058bf4ac2b6abd0fec%7C0%7C0%7C638860236586994867%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=AFNWMltUX65iQ5E5LFYOrA9bKP10SzKQ7jwcti57q2E%3D&reserved=0].
Travel to Office expectations
For Remote Roles: As this role is remote, there will be in-office events that
will require travel to and from the Mississauga and/or Salt Lake City office.
These will include, but not limited to, onboarding, team events, semi-annual and
annual team meetings.
For Hybrid Roles: As this role is Hybrid, there will be an expectation to reside
within commutable distance to the office/location specified in the job listing.
This will include, but not limited to, weekly/bi-weekly/monthly events in the
office with your specific team. This is a requirement for this role.
Int. AIOps Site Reliability Engineer
Role Summary:
We are seeking an innovative Intermediate Site Reliability Engineer to spearhead
the transformation of our operational engineering landscape through AI-driven
automation. This role will be pivotal in implementing AIOps capabilities,
enabling proactive management of reliability, reducing toil, and accelerating
incident resolution across our cloud-native application environment.
Key Responsibilities:
AI-Driven Observability & Monitoring:
•Implement and optimize AI-based anomaly detection tools across critical
applications to enhance system reliability.
•Establish standardized tagging and metadata practices to improve data quality
for enhanced AI observability and insights.
Automation & Self-Healing:
•Design and implement automated runbooks and workflows triggered by AI insights
to reduce manual intervention.
•Develop self-healing mechanisms for common failure scenarios, including
automated responses to AI-detected anomalies.
Incident Management & Root Cause Analysis:
•Deploy AI/ML tools for automated root cause analysis and incident correlation
to minimize downtime.
•Leverage predictive analytics to reduce mean time to detect (MTTD) and mean
time to resolution (MTTR).
Predictive Scaling & Resource Optimization:
•Build and deploy AI models to forecast traffic and resource needs, facilitating
proactive scaling and resource allocation.
•Enhance cost efficiency through intelligent autoscaling and resource
optimization.
Team Enablement & AI Maturity:
•Conduct internal AIOps workshops and training sessions to elevate team
capabilities.
•Guide the team through an AIOps maturity model, identifying and closing
capability gaps while tracking progress.
Troubleshooting and Problem Resolution:
•Participate in an on-call rotation to respond to incidents, ensuring 24/7
system availability.
•Lead incident response calls to troubleshoot complex system and
application-level issues.
•Engineer solutions to improve reliability and eliminate recurring incidents.
Required Skills & Experience:
•Strong background in SRE practices, cloud-native architecture, and CI/CD
pipelines.
•Hands-on experience with observability platforms (e.g., Datadog, AppDynamics,
Prometheus).
•Proficiency in scripting and automation (Python, Bash, Terraform, etc.).
•Familiarity with AI/ML concepts and their application in operational contexts.
•Experience implementing or integrating AIOps platforms or frameworks.
•Excellent problem-solving skills, troubleshooting skills, and a proactive
mindset.
Preferred Qualifications:
•Bachelor’s degree in Computer Science, Software Engineering, or a related
discipline.
•Minimum of 5 years of experience as a Site Reliability Engineer (SRE).
•Prior relevant software development, architecture, or engineering experience
(Min 5 years).
•Experience with Generative AI tools for incident response and documentation.
•Exposure to predictive analytics and time-series forecasting.
•Knowledge of Responsible AI principles and risk frameworks.
•Involvement in AI-driven transformation initiatives or hackathons.
•Strong experience in building and supporting cloud-based solutions, with Azure
cloud infrastructure and services experience preferred.
•Experience with virtualization and container solutions such as Docker and
Kubernetes.
•Familiarity with Databricks, Event Hub, Redis, Azure Service Bus, Azure
Functions, and Tomcat.
•Experience with Windows and Linux administration.
•Experience with configuration management and deployment automation tools (e.g.,
Chef, Terraform, Puppet, Ansible, Jenkins, Spinnaker, ArgoCD, GitHub Actions).
•Proficiency in programming languages such as Java, JavaScript, and Python.
•Working knowledge of database technologies (e.g., SQL Server, MySQL,
PostgreSQL).
•Experience with monitoring and logging solutions (e.g., Prometheus, Grafana,
ELK stack, AppDynamics, DataDog).
•Strong debugging and optimization skills, with the ability to automate routine
tasks.
•Systematic problem-solving approach with strong communication skills and a
proactive mindset.
•Knowledge of standard production practices, including change management and
incident management (ITIL).
•Experience building CI/CD pipelines and Blue/Green, Zero Downtime deployment
strategies.
•Troubleshooting experience with diverse hosting technologies, web servers, Java
applications, operating systems, network components, and web browsers.
Nice to Have:
•Proficiency in Linux, including experience compiling kernels, tracing syscalls,
and understanding TCP.
•Knowledge of open-source software and contributions to the open-source
community.
•Familiarity with Rhapsody and various healthcare messaging standards, such as
HL7 and FHIR.
•Experience with AI-driven infrastructure management tools and platforms.
•Participation in AI-focused conferences, workshops, or communities to stay
abreast of emerging trends.
This role is an exciting opportunity for an Intermediate Site Reliability
Engineer who is passionate about leveraging AI technologies to enhance the
reliability and efficiency of cloud-native applications. If you are driven by
innovation and thrive in a collaborative environment, we encourage you to apply
and be part of our forward-thinking team.
\n
\n
$109,000 - $118,000 a year
\n
PointClickCare Benefits & Perks:
Benefits starting from Day 1!
Retirement Plan Matching
Flexible Paid Time Off
Wellness Support Programs and Resources
Parental & Caregiver Leaves
Fertility & Adoption Support
Continuous Development Support Program
Employee Assistance Program
Allyship and Inclusion Communities
Employee Recognition … and more!
It is the policy of PointClickCare to ensure equal employment opportunity
without discrimination or harassment on the basis of race, religion, national
origin, status, age, sex, sexual orientation, gender identity or expression,
marital or domestic/civil partnership status, disability, veteran status,
genetic information, or any other basis protected by law. PointClickCare
welcomes and encourages applications from people with disabilities.
Accommodations are available upon request for candidates taking part in all
aspects of the selection process. Please contact recruitment@pointclickcare.com
should you require any accommodations.
When you apply for a position, your information is processed and stored with
Lever, in accordance with Lever’s Privacy Policy
[https://www.lever.co/privacy-policy]. We use this information to evaluate your
candidacy for the posted position. We also store this information, and may use
it in relation to future positions to which you apply, or which we believe may
be relevant to you given your background. When we have no ongoing legitimate
business need to process your information, we will either delete or anonymize
it. If you have any questions about how PointClickCare uses or processes your
information, or if you would like to ask to access, correct, or delete your
information, please contact PointClickCare’s human resources team:
recruitment@pointclickcare.com [recruitment@pointclickcare.com]
PointClickCare is committed to Information Security. By applying to this
position, if hired, you commit to following our information security policies
and procedures and making every effort to secure confidential and/or sensitive
information.