drjobs
SRE(Site Reliability Engineer)
drjobs
SRE(Site Reliability....
Enterprise Solutions Inc
drjobs SRE(Site Reliability Engineer) العربية

SRE(Site Reliability Engineer)

Employer Active

1 Vacancy
The job posting is outdated and position may be filled
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs

Job Location

drjobs

others - USA

Monthly Salary

drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Req ID : 1755831

Position SRE(Site Reliability Engineer)

Type fulltime

Location Irving, TX

Skills - VMWare Horizon, Middleware, AWS, containers, CI/CD

Responsibilities:

  • Supporting the Ops teams to diagnose and provide solutions for systemic issues in the Software Robotics & Citizen Development platforms
  • Solve Operational problems
  • Work on-call shifts responding to alerts
  • Providing oversight into mission critical projects
  • Conduct operations workshops and increase operational effectiveness within the organization
  • Design and implement operational processes, deployment guidelines, and feedback loops to ensure successful deployment/operations of Software Robotics & Citizen development technology platforms
  • Bring service enhancements to the Software Robotics & Citizen development Services by means on standardized & improved monitoring and reporting capabilities
  • Perform monthly analytics of historic Ops tickets to enable proactive issue identification and enabling of issue avoidance or self-healing capabilities
  • Identify, develop, test, debug and implement improvements using Digital Transformation techniques to optimize operational efficiencies for Level 3 Ops functions and improve platform reliability
  • Persuade and influence others through strong and comprehensive communication and diplomacy skills
  • Set and maintain acceptable performance and availability thresholds (Service Level Objectives) by working closely with Ops and system engineers to drive adoption of modern reliability practices like SLI, SLOs, error budget policies, actionable alerts, self-healing, proactive capacity management and change/release management practices.
  • Monitor and help stabilize services in production. Standardize & Improve monitoring and reporting capabilities, dashboard creation using tools like Splunk, AppDynamics, etc
  • Perform Compliance Reviews of Vulnerabilities/EOVS
  • Engage on Gating/Production Readiness Review(PRR) for new services/platforms/products
  • Participate on CoB/DR Drills
  • Standardize change and release management
  • Proactively perform demand forecasting, capacity planning and anomaly detection
  • Review and engage on release of new products/services from testing through go-live
  • Standardize change and release management for Platforms
  • Contribute towards architecture reviews, making sure product designs are technically sound with contingencies thought through.
  • Technical documentation focused towards driving post-mortems, technical/process guidance, automation, operational improvement, etc.
  • Review current platforms & application architecture and engage with server & product engineers for setting up SDI for the existing platform with focused drive to enable new platforms.

Employment Type

Full Time

Company Industry

About Company

100 employees
Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.