As a hands-on SRE Manager youll lead by exampleactively driving operational excellence contributing to code and ensuring system reliability. You will be deeply involved in incident response across complex distributed data platforms designed to support data exploration analytics and reporting solutions. These platforms operate at the unique intersection of high data volume and hybrid infrastructure spanning both cloud and on-premise environments.
Hands-on experience supporting and maintaining applications in cloud or hybrid environments
Expertise in cloud-native services including ETL frameworks (Apache Spark Flink) and messaging systems (Kafka)
Proven ability to lead incident response perform root cause analysis and drive system reliability improvements
Bachelors degree or equivalent with 10 years of experience in the SRE domain and at least 2 years in a management role focused on leading hiring developing and building teams
Hands-on experience supporting enterprise data systems on distributed architectures
Exposure to data visualization tools such as Tableau Business Objects ThoughtSpot with experience supporting and troubleshooting issues related to dashboards and reports
Experience with modern & distributed databases such as Snowflake Cassandra SingleStore and SAP HANA
Experience using GenAI or automation tools for issue detection alerting or remediation
Solid understanding of system design data structures and incident management best practices
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.