drjobs Lead Site Reliablity Engineer

Lead Site Reliablity Engineer

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Chennai - India

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Reporting to: Sr Manager Availability Management

Office Location: Chennai India

Flexible Working: Hybrid (Part Office/Part Home)

Cloud Site Reliability Engineer Responsibilities

  • Onboard internal customers to our 24x7 Applications Support and Enterprise Status Page services

  • Be involved with creating an SRE culture globally by defining monitoring strategies and best practices at the organization.

  • Monitor application performance and have the ability to provide recommendations on increasing the observability of applications and platforms.

  • Play an important role in the Continual Service Improvement process identifying and driving improvement

  • Be instrumental to developing standards guides to assist the business in maximizing their use of common tools .

  • Participate in code peer reviews and enforce quality gates to ensure best practices are followed.

  • Apply automation to tasks which would benefit from this. Automating repetitive tasks and deploying monitors via code are core examples.

  • Document knowledge gained from engagements in the forms of runbooks and other information critical to incident response.

  • Exploring and applying Artificial Intelligence to enhance operational processes/procedures


ShouldHaves Skills & Experience

  • Strong skills with modern monitoring tools and demonstrable knowledge of APM RUM and/or synthetic testing.

  • Experience working with observability tools such as Datadog NewRelic Splunk CloudWatch AzureMonitor

  • Experience with the OpenTelemetry (OTEL) Standard

  • Working knowledge of at least one programming language such as Python JavaScript (NodeJS etc) Golang or others.

  • Strong experience with IaC tools such as Terraform and Cloudformation.

  • Experience with cloud environments especially AWS and/or Azure.

  • Good customer interaction skills and able to understand their needs and expectations.

  • Strength in conviction able to encourage adoption to a wide audience but comfortable with mandating where necessary

  • Experience with code quality tools such as SonarQube.

  • Knowledge on code linters tools of various programming languages.

  • Experience with CI/CD tools. Such as Bamboo Jenkins Azure DevOps Github actions.

  • ITIL experience with basic understanding on incident management problem management and change management.

NicetoHaves Skills & Experience

  • Any cloud certification

  • ITIL certifications

  • Experience with ITSM tools

  • Experience using OnCall Management Tooling

No travel required


Employment Type

Full-Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.