drjobs Site reliability engineering(SRE) Operations

Site reliability engineering(SRE) Operations

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

O'Fallon - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Job Description

Roles/responsibilities:

Incident Resolution - Review and resolve the Incidents arising from

o Operation Command Center Alerts

o Alerts from Enterprise Monitoring Operations (EM Operations).

o OMNIBUS and Splunk Alerts

Change Implementation - Deploying the application related artifacts to the production environments in the slotted approved release window

Reporting the issues with the deployments and coordinating with the Development Teams to fix any deployment issues

Work Orders - Resolve Work orders in form of Business/functional queries adhoc testing verification and validation etc from Regional product team and customer support teams.

Traffic Routing perform traffic routing in support of infrastructure maintenance

Perform Root Cause Analysis in detail for High severity Incidents and take action on fixing the underlying cause of the high severity issues. Take necessary preventive actions also.

Supporting the UAT testing by the Product team and Regional customer support team.

Configuring application/artifacts and supporting the new customer onboarding to the platform

Raise new change tickets and arrange for approvals including CAB approvals

Review and approve change tickets.

Work with customers on ad-hoc queries

Work with Development / Testing team for defect analysis (with Production simulated data)

Build automation scripts that reduce the number of Incidents and/or improves processes followed

Support customer to fill in the Post Incident Report (PIR) when any high impacting Incidents affecting customers occurred.

Participate / Initiate in War Room calls that impacts application availability or has a customer impact

Willing to work on shifts (Morning & Afternoon shifts) & Weekend support

Must have skills:

Unix Shell Scripting SQL

Troubleshooting using logs Splunk / Dynatrace

ITSM Incident Change and Problem Management

L2 Support experience is a must

Snowflake

Good to have skills:

PCF Cloud knowledge

CI/CD Jenkins Git & Maven

Tools Used:

Remedy Ticketing Tool

Rally (For Story and Bug Tracking)

Splunk and Dynatrace for Monitoring

WinSCP (file movement/ validation)

CyberArk/Putty

Toad Querying Tool for DB

Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.