drjobs Software Engineer - Lakeflow PhD Candidates

Software Engineer - Lakeflow PhD Candidates

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

San Francisco, CA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

RDQ326R93

Databricks is radically simplifying the entire data lifecycle from ingestion to generative AI and everything inbetween. Were doing it crosscloud with a unified platform serving over 10k customers processing exabytes of data/day on 15 million VMs and growing exponentially.

The Lakeflow team is looking for recent PhD graduates. Lakeflow team includes products like Apache Spark Structured Streaming Delta Live Tables (DLT) and Materialized Views. Apache Spark Structured Streaming is one the worlds most popular streaming engines. DLT makes it easy to build and manage reliable batch and streaming data pipelines that deliver highquality data on the Databricks Lakehouse Platform. DLT helps data engineering teams simplify ETL (extracttransformload) development and management with declarative pipeline development automatic data testing and deep visibility for monitoring and recovery. DLT optimizes pipeline execution by logical optimization through query transformations and physical optimization such as instance type selection and vertical/horizontal autoscaling.

Moreover as part of DLT we have a new catalyst optimization layer Eenzyme designed specifically to speed up the ETL process and make declarative ETL computation possible by incrementally computing and materializing the intermediate results. Enzyme can create and keep uptodate a materialization of the results of a given query stored in a Delta table. Enzyme does this by using a cost model to choose between a variety of techniques that borrow from traditional literature on the maintenance of materialized views deltatodelta streaming and manual ETL patterns commonly used by our customers.

As a part of the LakeflowDLT team there are opportunities to design and implement in many areas that leapfrog existing systems:

What We Look For:

Databricks 2017. All rights reserved. Apache Apache Spark Spark and the Spark logo are trademarks of the Apache Software Foundation. Privacy Policy Terms of Use

Employment Type

Unclear

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.