drjobs
Hiring for Azure Reliability engineer at Dallas TX
drjobs
Hiring for Azure Rel....
Siri InfoSolutions Inc
drjobs Hiring for Azure Reliability engineer at Dallas TX العربية

Hiring for Azure Reliability engineer at Dallas TX

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs

Job Location

drjobs

- USA

Monthly Salary

drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Req ID : 2610640
Hi
Hiring for Azure Reliability engineer at Dallas TX
Job Description
Must Have Technical/Functional Skills
We are seeking a Site Reliability Engineer (SRE) Support Manager who could manage a team responsible to build deploy operate sustain and grow the software systems that build scale monitor secure manage and automate Frontiers systems.
The SRE support manager will drive the stability and sustainability of these nextgeneration systems and discover innovative ways to scale and operate them reliably as we expand.
In this role you will work with Systems and SRE Engineers to create proactive engineering mechanisms that will enable your team to manage the health of a number of distributed systems and the software stacks that run on them.
Experience in Azure DevOps. The associate must have the knowledge on the monitoring tools like Dynatrace ELK Logstash Firebase
Strong communications skills both oral and written
Extensive experience driving production incident bridge calls involving multidisciplinary IT / Ops teams.
Developing synthetic monitoring and SRE dashboards
Automation
Azure DevOps tooling and infrastructure
This role requires on call support and willingness to support during weekends / after office hours based on shift requirements.
Experience Required API Monitoring
Roles & Responsibilities
Observability and Availability
o Identify the root causes of issues by working with Engineering team and leveraging existing APM tools (viz. Dynatrace ELK Logstash Firebase)
o Provide a single view of health of IT enterprise systems by implementing application and system health dashboards.
o Observability Probe Injection
o Implementation of alerts dashboards and instrumentation for timely identification of application and platform performance and reliability
Application KPI Monitoring
o Realtime monitoring of application performance including response time throughput and error rates using Logstash ELK Decibels Firebase Dynatrace.
o Provide alerts and notifications when anomalies are detected along with recommendations for how to resolve the issue.
o Provide alerts and notifications when potential issues with user experience are detected and recommendations for improving user experience.
o Provide Alerts and notifications when availability or reliability falls below agreedupon thresholds and recommendations to restore the system/service health.
o Automated detection alerting and integration of tools to reduce toil and enable seamless triage processes.
o Identify manual repetitive and errorprone tasks related to application management and prioritization for reliability enhancements.
Release Management activities
o Delivery execution of Change Management and Release Management activities that will include planning proper sequence of events defining backout strategy planning regression cycle.
o Measure KPIs before and after release to track any adverse impacts.
o Monitor specific dashboards based on what application changes are going live.
o Participate in stability calls and outage calls representing Digital.
DevOps Activities:
o Azure DevOps Platform Support
o Non Development and NonProduction Environment Support
o Build and deployment automation.
o CI/CD pipeline templatization and support
o Deploy updates and fixes using CI/CD pipelines.
o Infrastructure as Code (IaC) and Application DevOps pipelines creation
o Define high level CICD roll out plan.
o Continue identifying opportunities of automation for quick and error free deployment/releases.
o Perform Azure DevOps Admin task.
o Update existing CICD pipelines if required
o Onboarding new applications in ADO
Generic Managerial Skills
Customer Management
Offshore Team coordination
Thanks & regards
Dushyanth Sr. IT Recruiter Email:

Employment Type

Full Time

Company Industry

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting
Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.