Data engineer

Purple Drive

Not Interested
Bookmark
Report This Job

profile Job Location:

Bellevue, WA - USA

profile Monthly Salary: Not Disclosed
Posted on: 21 days ago
Vacancies: 1 Vacancy

Job Summary

Role Type:Hybrid


Responsibilities and essential job functions include but are not limited to the following:
Demonstrate deep knowledgeofthe data engineering domain to build and support non-interactive (batch distributed) & real-time highly available data data pipeline and technology capabilities
Buildfault-tolerant self-healing adaptive and highly accurate data computational pipelines
Provide consultation and leadthe implementation of complex programs
Develop and maintain documentation relating to all assigned systems and projects
Tune queries running overbillions of rows of data running in a distributed query engine
Perform root cause analysis to identify permanent resolutions to software or business process issues
Bachelors degree in computer science management information systems or related discipline or equivalent work experience
Strong/expertSpark(PySpark) Using Jupyter Notebooks Colab or DataBricks (preferred)
Hands-on data pipeline development ingest patterns inAzure
Orchestration tools ADF or Airflow
SQL
DenormalizedData modeling for big data systems
Collaborative able to work remotely and still be an engaging team member.
Strong analytical and design skills.



Required Skills:

SPARK PYSPARK JUPYTER

Role Type:Hybrid Responsibilities and essential job functions include but are not limited to the following:Demonstrate deep knowledgeofthe data engineering domain to build and support non-interactive (batch distributed) & real-time highly available data data pipeline and technology capabilitiesBuild...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala