drjobs "Data Engineer"

"Data Engineer"

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Columbus - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Job Title: Data Engineer Hadoop to Databricks Migration
Location: Columbus OH Jersey City NJ
Duration: Contract
Experience: 8 years (minimum 1 year in Databricks and PySpark)

Job Description:

We are seeking a highly skilled Data Engineer with hands-on experience in Hadoop to Databricks migration projects. The ideal candidate will have a strong background in AWS cloud platforms Databricks PySpark and data observability/monitoring using Splunk.

You will play a critical role in the modernization of big data platforms by transforming legacy Hadoop systems into scalable and efficient Databricks-based pipelines.

Key Responsibilities:
  • Lead or contribute to migration of data pipelines and jobs from Hadoop ecosystem to Databricks on AWS.
  • Develop and optimize PySpark jobs for data ingestion transformation and processing.
  • Build scalable and efficient data solutions in the Databricks platform.
  • Collaborate with data architects analysts and business teams to ensure data models and pipelines meet business requirements.
  • Monitor and troubleshoot production workflows using Splunk or other observability tools.
  • Ensure data quality performance tuning and governance standards are met during the migration process.
Required Skills:
  • Strong hands-on experience with Databricks and PySpark.
  • Proven experience with Hadoop ecosystem and its components (Hive HDFS etc.).
  • Proficiency in AWS services (S3 EMR Glue Lambda etc.).
  • Experience in creating and managing ETL/ELT pipelines in a distributed environment.
  • Knowledge of Splunk for log monitoring alerting and operational insights.
  • Strong understanding of big data architecture performance optimization and cost management.
Preferred Qualifications:
  • Experience in enterprise-scale data lake or lakehouse implementations.
  • Familiarity with CI/CD practices for data pipelines.
  • Strong analytical and problem-solving skills.
  • Excellent communication and collaboration abilities.

Employment Type

Full-time

Company Industry

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.