P1395
Why Databricks
Were growing fast and attracting the best talent in the world. Bricksters as we call ourselves are a special mix of smart curious quick thinkers. If you ask a Brickster what they love about working here youll likely hear about our culture.
We are seeking an experienced NOC Engineer to join our team. The successful candidate will be responsible for monitoring critical Databricks infrastructure and developing monitoring tools and alerting dashboards. They will also work closely with stakeholders to investigate and resolve incidents perform root cause analysis and propose solutions to increase the reliability and stability of the Databricks unified analytics platform.
The impact you will have here:
- Monitor critical infrastructure triage alerts to proactively identify incidents and work with stakeholders to resolve incidents.
- Investigate incidents and propose solutions to improve platform reliability and stability.
- Perform root cause analysis for recurring incidents and provide proactive solutions.
- Develop toolings or automate processes to improve platform monitoring and alerting.
- Contribute to software development efforts to improve overall service reliability and stability.
- Communicate effectively with internal stakeholders including executive staff to provide incident analysis.
- Participate in war rooms and temporary communication channels during outages.
- Demonstrate crossfunctional leadership and establish ownership of incidents and outages.
- Multitask on several incidents and/or projects
What are we looking for
- Minimum of 5 years of experience as a NOC SRE or DevOps engineer
- Strong knowledge of cloud technologies such as Azure AWS and GCP
- Handson experience with monitoring logging and alerting tools such as ELK Prometheus Grafana Pager Duty etc.
- Experience with containers and orchestration technologies such as Docker and Kubernetes.
- Proficiency in automation and scripting
- Linux systems administration skills.
- Excellent communication skills.
- Willingness to learn Databricks products
- Bachelors degree in Computer Science or a related field
Required Experience:
Senior IC