Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailLine of Service
AdvisoryIndustry/Sector
Not ApplicableSpecialism
AnalyticsManagement Level
ManagerJob Description & Summary
At PwC our people in data and analytics engineering focus on leveraging advanced technologies and techniques to design and develop robust data solutions for clients. They play a crucial role in transforming raw data into actionable insights enabling informed decisionmaking and driving business growth.Years of Experience: Candidates with 8 years of experience in architecting and delivering scalable big data pipelines using Apache Spark and Databricks on AWS.
Position Requirements:
Must Have:
Design build and maintain scalable data pipelines using Databricks and Apache Spark.
Good knowledge on Medallion Architecture in Databricks Lakehouse
Develop and optimize ETL/ELT processes for structured and unstructured data.
Implement Lakehouse architecture for efficient data storage processing and analytics.
Orchestrating ETL/ELT Pipelines: Design and manage data workflows using Databricks Workflows Jobs API.
Work with AWS Data Services S3 Lambda CloudWatch for seamless integration.
Performance Optimization: Optimize queries using pushdown capabilities and indexing strategies.
Implement data governance with Unity Catalog security policies and access controls.
Collaborate with data scientists analysts and engineers to enable advanced analytics.
Monitor troubleshoot and improve Databricks jobs and clusters.
Strong expertise in endtoend implementation of migration projects to AWS Cloud
Should be aware of Data Management concepts and Data Modelling
AWS & Python Expertise with handson cloud development.
Spark Performance Tuning: Core SQL and Streaming.
Orchestration: Airflow
Code Repositories: Git GitHub.
Strong in writing SQL
Cloud Data Migration: Deep understanding of processes.
Strong Analytical ProblemSolving & Communication Skills.
Good to have Knowledge / Skills:
Experience in Teradata DataStage SSIS Mainframe(Cobol JCL Zeke Scheduler)
Knowledge on Lakehouse Federation
Knowledge of Delta Lake.
Knowledge of Databricks Delta Live Table.
Streaming: Kafka Spark Streaming.
CICD : Jenkins
IaC & Automation: Terraform for Databricks deployment.
Knowlege on integrating 3party APIs to Databricks.
Knowledge of Transport & Mobility domain.
Professional and Educational Background:
BE / / MCA / / M.E / / MBA
Education (if blank degree and/or field of study not specified)
Degrees/Field of Study required:Degrees/Field of Study preferred:Certifications (if blank certifications not specified)
Required Skills
Optional Skills
Accepting Feedback Accepting Feedback Active Listening Agile Scalability Amazon Web Services (AWS) Analytical Thinking Apache Hadoop Azure Data Factory Coaching and Feedback Communication Creativity Data Anonymization Database Administration Database Management System (DBMS) Database Optimization Database Security Best Practices Data Engineering Data Engineering Platforms Data Infrastructure Data Integration Data Lake Data Modeling Data Pipeline Data Quality Data Transformation 23 moreDesired Languages (If blank desired languages not specified)
Travel Requirements
Available for Work Visa Sponsorship
Government Clearance Required
Job Posting End Date
Required Experience:
Manager
Full-Time