NBCUniversal has an opening for a Site Reliability Engineer focused primarily on but not limited to supporting live channel distribution on the Video Streaming Engineering team within the NBCU Operations and Technology division. This position will be part of a dedicated 24x7 team tasked with supporting and maintaining distribution systems including diagnosing and preventing onair issues before they arise. Additionally this position will support the operation of future solutions as new technologies continue to change the broadcast environment. Workflows include distribution of live linear channels to OTT Partners and Peacock and supporting the systems that contribute live sources to the cloud and facilities across the country.
Responsibilities:
- Investigate issues within broadcast systems and their integration points to find the root cause of problems or systemic issues.
- As a Level 2 resource drive and own investigations related to Broadcast issues and report back findings in a timely manner to leadership and operations.
- Follow up with team members & 3rd party vendors if issues found cannot be solved and drive vendors for root cause and solutions if possible.
- Create comprehensive documentation outlining the intricacies of encountered issue elucidating the root cause and steps for effective issue resolution.
- Assist in the deployment and testing of patches or fixes from vendors both in the Development environment as well as the Production environment until completion and to the satisfaction of the Operations team.
- Assist in the design analysis or evaluation of assigned projects using sound engineering principles and adhering to business standards practices procedures and product / program requirements.
- Support and participate in Onair systems integration and onair rollout.
- Provide 24x7 OnAir systems support and daily operations support; some oncall support may be required from time to time during onair rollout and special broadcast events.
- Attend daily maintenance and operations review calls to report back to leadership and Operations on findings from new and open issues and their potential fixes and planned deployments of those fixes
Qualifications :
Basic Requirements:
- BS in Engineering/Computer Science or related field
- A passion for investigating issues driving towards resolutions and effective problem solving
- 5 years of DevOps/SRE experience in the technology sector delivering productionquality software or softwaredefined infrastructure in a high traffic environment run on a cloud hosting environment (AWS preferred)
- 5 years of experience in a support/analysis role
- Experience with deployment automation in within AWShosted services (Cloud Formation Terraform Ansible)
- Familiarity with containerization and orchestration services such as Kubernetes and Docker
- Familiarity with CI/CD orchestration tools (e.g. GitHub Actions or Jenkins)
- Experience with CI/CD build and deployment practices
- 5 years of Linux System Administration
- 5 years experience coding in Go Python Ruby Java or shell languages
- Experience in designing analyzing and building automation and tools for large scale systems
- Professional experience using modern log/metric aggregation software (e.g. Cloudwatch Elasticsearch Kibana Splunk Grafana)
- Experience and comfort with continuous delivery/frequent releases of code to production
- A methodical and logical approach to reasoning about problems and system interactivity
- Willingness and ability to prioritize business needs to meet shortterm demands
- Working knowledge of the OSI model comfortable troubleshooting networking issues.
- An unwillingness to tolerate userfacing downtime
Desired Characteristics:
- 3 years of experience in the Media & Entertainment industry
- 3 years of experience in 24x7 production environments
- 3 years of experience supporting IT/Broadcast Systems
- 5 years of customer facing experience
- Experience with Live TV Broadcasting OTT Streaming codecs and ARQ technologies a plus.
Additional Requirements:
- Fully Remote: This position has been designated as fully remote meaning that the position is expected to contribute from a nonNBCUniversal worksite most commonly an employees residence.
This position is eligible for company sponsored benefits including medical dental and vision insurance 401(k) paid leave tuition reimbursement and a variety of other discounts and perks. Learn more about the benefits offered by NBCUniversal by visiting the Benefits page of the Careers website. Salary range: $110000 $145000
We are accepting applications for this position on an ongoing basis.
Additional Information :
As part of our selection process external candidates may be required to attend an inperson interview with an NBCUniversal employee at one of our locations prior to a hiring decision. NBCUniversals policy is to provide equal employment opportunities to all applicants and employees without regard to race color religion creed gender gender identity or expression age national origin or ancestry citizenship disability sexual orientation marital status pregnancy veteran status membership in the uniformed services genetic information or any other basis protected by applicable law.
If you are a qualified individual with a disability or a disabled veteran you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access as a result of your disability. You can request reasonable accommodations by emailing
For LA County and City Residents Only: NBCUniversal will consider for employment qualified applicants with criminal histories or arrest or conviction records in a manner consistent with relevant legal requirements including the City of Los Angeles Fair Chance Initiative For Hiring Ordinance the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act where applicable.
Remote Work :
Yes
Employment Type :
Fulltime