Job Title: Site Reliability Engineer - IAM
Location: Montreal Quebec (Onsite)
Type: Contract
- Were seeking someone to join our IAM Cybersecurity team as a Cybersecurity Site Reliability Engineering (SRE) Specialist in Cyber to be part of a global reliability engineering team supporting critical Workforce Identity Management tools and Directory Services built on ForgeRock and Ping Identity systems.
- In the Technology division we leverage innovation to build the connections and capabilities that power our Firm enabling our clients and colleagues to redefine markets and shape the future of our communities.
- This is a Infrastructure Production Management & Reliability Engineering position at Director level which is part of the job family responsible for maintaining the stability and reliability of the organizations infrastructure systems ensuring optimal performance and availability to support business operations.
- Client is an industry leader in financial services known for mobilizing capital to help governments corporations institutions and individuals around the world achieve their financial goals. Interested in joining a team thats eager to create innovate and make an impact on the world..
What youll do in the role:
- Maintain LDAP Directory Services products such as Ping Directory and PingData Sync.
- Provide first-line support for Identity Management during large-scale outages including post-mortem pre-mortems and problem management with data-driven strategies and a code-first approach to problem solving.
- Prepare and execute change management activities often automating and creating tools where necessary.
- Participate in on-call rotations (weekday and weekend cycles).
- Collaborate with partner enterprise technology teams and provide support to stakeholders and our Level 2 operations team.
- Improving stability of Identity Management platforms by identifying and implementing alert automation and self-healing functions where possible.
- Performance and scalability
- Ensure systems can scale seamlessly to handle increased load and monitor the performance of our applications and infrastructure using our service level objectives (SLO) and service level indicators (SLI).
- Share responsibilities and knowledge across the team engaging with our community and stakeholders to gather feedback to improve our systems.
- Address security and compliance issues ensuring we are meeting industry standards and have implemented best practices for security and data protection with our systems.
What youll bring to the role:
- 3-5 years of experience in Identity and Access Management (IAM) or previous experience in IT Operations Reliability Engineering or DevOps.
- Exposure to Agile / DevOps environments.
- Knowledge of Site Reliability Engineering (SRE) principles and methodology.
- In depth experience with LDAP protocol and preferably experience in Ping Identity products. Intermediate knowledge of operating system administration Linux platforms (Red Hat preferred).
- Foundational knowledge of authentication protocols in the broader IAM domain such as OpenID Connect SAML Kerberos and Radius and multifactor authentication solutions like RSA SecurID Cisco Duo Security FIDO etc.
- Experience working in a large enterprise environment and have a general understanding of enterprise infrastructure concepts and troubleshooting including network storage web infrastructure middleware etc.
- Proficiency in at least one scripting language such as PowerShell Python Shell (bash).
- Familiarity with the Software Development Life Cycle (SDLC) and development environment tooling (GitHub Jenkins Visual Studio Code etc.).
- Familiarity with visualization and plant and incident management tools such as Splunk Grafana ServiceNow Jira Bitbucket PagerDuty PowerBI.
- Knowledge of enterprise security standards and concepts.