Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailDESCRIPTION:
Duties: Build automations using programming languages to reduce manual effort. Define and collect metrics from systems and applications using industrystandard applications or custombuilt processes. Design and develop visualizations of system health. Respond to incidents of system instability or unavailability diagnosing problems writing software to resolve issues and performing Root Cause Analyses to determine the reason for an outage. Perform system logging analysis to ensure application stability. Troubleshoot system and network issues to find potential areas for improvement.
QUALIFICATIONS:
Minimum education and experience required: Bachelors degree in Computer Engineering Computer Science Electronic Engineering Computer Information Systems or related field of study plus two 2 years of experience in the job offered or as Site Reliability Engineer Software Engineer Software Developer or related occupation.
Skills Required: This position requires experience with the following: Site reliability engineering including system monitoring log analysis incident management and blameless postmortems; implementing site reliability within an application or platform; at least one of the following: Python Java Spring Boot or .Net; observability using white and black box monitoring; observability using service level objective alerting; Grafana Dynatrace Prometheus Datadog and Splunk to perform telemetry collection for observability; continuous integration and continuous delivery tools including Jenkins and Terraform; container and container orchestration including ECS Kubernetes and Docker; troubleshooting common networking technologies and issues.
Job Location: 8181 Communications Pkwy Plano TX 75024.
Full-Time