Production Support Engineer / SRE
Manage daytoday production support activities for both web and mobile applications (Android and iOS).
Maintain overall health stability and safety of production systems and applications.
Identify operational risks and recommend mitigation strategies.
Improve application instrumentation logging alerting and monitoring capabilities.
Change & Release Management
Perform change management activities across test and production environments.
Execute code deployments while adhering to source code management release management and compliance policies.
Ensure proper governance and documentation of all deployment activities.
Collaboration & Stakeholder Engagement
Work closely with development teams and business partners to recommend solutions that combine internal development integration with other applications and vendor platforms.
Thrive in an agile environment by contributing to sprint activities and collaborative planning.
Communicate effectively with team members management infrastructure teams and other interface groups throughout the project lifecycle.
Knowledge Building & Leadership
Develop strong understanding of business processes and enterprise systems.
Provide coaching expertise and continuous feedback to help build the teams capability.
Share technical knowledge to support onboarding and skill growth.
Support & OnCall Availability
Participate in occasional weekend and afterhours support for critical issues or deployments.
Required Qualifications (MustHave)
Technical Troubleshooting & Monitoring
Handson experience troubleshooting application and database issues using:
Dynatrace
OpenShift (OCP)
Elastic / Kibana
MongoDB services running on Linux
IIS Web Servers on Windows
Kafka (basic to intermediate knowledge)
Strong proficiency with Following Database & Application Technologies:
Microsoft SQL Server
MongoDB
Microsoft C#
Solid ability to write read and troubleshoot SQL queries.
Knowledge of SQL database architecture performance monitoring and optimization.
Good understanding of Following Operating Systems:
Linux / Unix environments
Windows Server platforms
Automation & Infrastructure
Experience automating routine database or infrastructure operations.
Proficiency working with cloudhosted applications and services
Microsoft Azure
OpenShift (OCP)
AWS S3 bucket concepts
DevOps & SRE Practices
Experience with DevOps and Site Reliability Engineering tools such as: Helios UCD (UrbanCode Deploy) Jenkins Ansible
Knowledge of CI/CD pipelines release workflows and automation strategies.
Needs Experience with:
JavaScript
Java
Job scheduling using Stonebranch.
Monitoring tools such as Catchpoint and Aternity.
Productivity & Support Tools
Jira and Confluence for project/task management.
Firebase Google Play Console and Google Analytics for Android apps.
Apple App Store experience for iOS application operations.
Soft Skills & Frameworks
Strong analytical problemsolving and decisionmaking skills.
Solid understanding of ITIL service management practices.
Experience using ServiceNow for incident problem and change management.
Required Skills:
Production Support Engineer / SRE Work Mode: 4 days Onsite Production Support & Operations Manage day to day production support activities for both web and mobile applications (Android and iOS). Maintain overall health stability and safety of production systems and applications. Identify operational risks and recommend mitigation strategies. Improve application instrumentation logging alerting and monitoring capabilities. Change & Release Management Perform change management activities across test and production environments. Execute code deployments while adhering to source code management release management and compliance policies. Ensure proper governance and documentation of all deployment activities. Collaboration & Stakeholder Engagement Work closely with development teams and business partners to recommend solutions that combine internal development integration with other applications and vendor platforms. Thrive in an agile environment by contributing to sprint activities and collaborative planning. Communicate effectively with team members management infrastructure teams and other interface groups throughout the project lifecycle. Knowledge Building & Leadership Develop strong understanding of business processes and enterprise systems. Provide coaching expertise and continuous feedback to help build the teams capability. Share technical knowledge to support onboarding and skill growth. Support & On Call Availability Participate in occasional weekend and after hours support for critical issues or deployments. Required Qualifications (Must Have) Technical Troubleshooting & Monitoring Hands on experience troubleshooting application and database issues using: Dynatrace OpenShift (OCP) Elastic / Kibana MongoDB services running on Linux IIS Web Servers on Windows Kafka (basic to intermediate knowledge) Strong proficiency with Following Database & Application Technologies: Microsoft SQL Server MongoDB Microsoft C# Solid ability to write read and troubleshoot SQL queries. Knowledge of SQL database architecture performance monitoring and optimization. Good understanding of Following Operating Systems: Linux / Unix environments Windows Server platforms Automation & Infrastructure Experience automating routine database or infrastructure operations. Proficiency working with cloud hosted applications and services Microsoft Azure OpenShift (OCP) AWS S3 bucket concepts DevOps & SRE Practices Experience with DevOps and Site Reliability Engineering tools such as: Helios UCD (UrbanCode Deploy) Jenkins Ansible Knowledge of CI/CD pipelines release workflows and automation strategies. Needs Experience with: JavaScript Java Job scheduling using Stonebranch. Monitoring tools such as Catchpoint and Aternity. Productivity & Support Tools Jira and Confluence for project/task management. Firebase Google Play Console and Google Analytics for Android apps. Apple App Store experience for iOS application operations. Soft Skills & Frameworks Strong analytical problem solving and decision making skills. Solid understanding of ITIL service management practices. Experience using ServiceNow for incident problem and change management.
IT Services and IT Consulting