Job Title: - Lead AI SRE/ AI Ops Engineer
Duration: - 12 Months
Location: - Fremont CA (4 days Onsite Need Local)
Experience
- Total 13 years of experience required and around 5 years of hands-on experience in IT operations cloud operations SRE platform support or production engineering
- Proven experience in production support incident handling automation and operational troubleshooting
- Experience working with monitoring observability scripting and release validation
- Exposure to AIOps AI-assisted operations or automation-led support models is strongly required.
Key Responsibilities
- Lead the adoption and operationalization of the SRE agent across support and reliability workflows
- Translate existing scripts runbooks SOPs and operational knowledge into agent-compatible workflows
- Work with teams to identify which use cases should be automated semi-automated or remain human-driven
- Validate agent outputs recommendations and remediation steps before operational use
- Support production releases release validation smoke testing and post-release health checks
- Drive troubleshooting during incidents and ensure proper root cause analysis and follow-through
- Improve alert handling event correlation and operational response patterns
- Coordinate with engineering operations and platform teams on onboarding and process changes
- Mentor junior engineers and guide them on workflow design validation and operational execution
- Maintain high-quality documentation runbooks and operational standards
Required Technical Skills
- Strong hands-on scripting experience in PowerShell Python Shell/Bash
- Experience with monitoring alerting logs dashboards and incident workflows
- Good understanding of production support processes release support and validation practices
- Experience with cloud platforms preferably Azure
- Familiarity with ITSM/ticketing tools such as ServiceNow Jira or similar
- Ability to understand existing operational scripts and modernize them into scalable workflows
- Experience with APIs integrations or automation pipelines is preferred
- Experience to Kubernetes / AKS/AI tools - ChatGPT copilot.
Thanks and regards
Rohit Raj
Momento USA Exceeding Customer Expectations
440 Benigno Blvd Unit#A-5 2nd Floor. Bellmawr NJ 08031
Interstate Business Park
Office: Ext 1027
Note: Momento USA is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race color religion sex pregnancy sexual orientation gender identity national origin age protected veteran status or disability status.
Job Title: - Lead AI SRE/ AI Ops Engineer Duration: - 12 Months Location: - Fremont CA (4 days Onsite Need Local) Experience Total 13 years of experience required and around 5 years of hands-on experience in IT operations cloud operations SRE platform support or production engineering Proven...
Job Title: - Lead AI SRE/ AI Ops Engineer
Duration: - 12 Months
Location: - Fremont CA (4 days Onsite Need Local)
Experience
- Total 13 years of experience required and around 5 years of hands-on experience in IT operations cloud operations SRE platform support or production engineering
- Proven experience in production support incident handling automation and operational troubleshooting
- Experience working with monitoring observability scripting and release validation
- Exposure to AIOps AI-assisted operations or automation-led support models is strongly required.
Key Responsibilities
- Lead the adoption and operationalization of the SRE agent across support and reliability workflows
- Translate existing scripts runbooks SOPs and operational knowledge into agent-compatible workflows
- Work with teams to identify which use cases should be automated semi-automated or remain human-driven
- Validate agent outputs recommendations and remediation steps before operational use
- Support production releases release validation smoke testing and post-release health checks
- Drive troubleshooting during incidents and ensure proper root cause analysis and follow-through
- Improve alert handling event correlation and operational response patterns
- Coordinate with engineering operations and platform teams on onboarding and process changes
- Mentor junior engineers and guide them on workflow design validation and operational execution
- Maintain high-quality documentation runbooks and operational standards
Required Technical Skills
- Strong hands-on scripting experience in PowerShell Python Shell/Bash
- Experience with monitoring alerting logs dashboards and incident workflows
- Good understanding of production support processes release support and validation practices
- Experience with cloud platforms preferably Azure
- Familiarity with ITSM/ticketing tools such as ServiceNow Jira or similar
- Ability to understand existing operational scripts and modernize them into scalable workflows
- Experience with APIs integrations or automation pipelines is preferred
- Experience to Kubernetes / AKS/AI tools - ChatGPT copilot.
Thanks and regards
Rohit Raj
Momento USA Exceeding Customer Expectations
440 Benigno Blvd Unit#A-5 2nd Floor. Bellmawr NJ 08031
Interstate Business Park
Office: Ext 1027
Note: Momento USA is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race color religion sex pregnancy sexual orientation gender identity national origin age protected veteran status or disability status.
View more
View less