Knowledge of how to create and improve data sets big data data pipelines and infrastructures.
Being able to do root cause analysis on data and procedures both internally and outside to find possibilities for improvement and provide clarification.
Outstanding analytical abilities related to coping with unstructured datasets.
Capacity to develop procedures that support task management data structures dependence and metadata.
Knowledge of data models data mining and segmentation techniques.
Understanding of programming languages like Java and Python.
Familiarity with Hadoop HBase MapReduce and other suitable platforms.
Excellent understanding of operating systems like UNIX Linux and Windows.
Strong project management and organisational skills.
What You Know:
GCP Expertise: Proficient in GCP Dataflow BigQuery Cloud composer and google Storage services.
Databricks: Experience with Sparkbased data engineering workflows including Delta Lake.
Testing Tools & Frameworks: Strong experience with tools like PyTest dbt (data build tool) or similar testing frameworks for data pipelines.
SQL and Scripting: Advanced SQL skills for data validation with Python proficiency for automation.
Big data Knowledge: Knowledge of Big Data processing and distributed computing. Understand the importance data modeling concepts
CI/CD: Knowledge of CI/CD pipelines on or GitHub Actions focusing on data engineering workflows.
Data Governance: Familiarity with data governance tools and concepts such as metadata management lineage and data cataloging.
Communication: Excellent communications skills with the ability to synthesize simplify and explain complex problems to different types of audience including executives
Education:
BS or MS degree in Computer Science or a related technical field.
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.