drjobs Big Data PySpark Tech Lead

Big Data PySpark Tech Lead

Employer Active

1 Vacancy
The job posting is outdated and position may be filled
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Irving, TX - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Job Description
Skill: Big Data (PySpark) Tech Lead
  • 10 Years Overall Experience in Data Management Data Lake and Data Warehouse.
  • 6 Years Hadoop Hive Sqoop SQL Teradata.
  • 6 Years PySpark(Python and Spark) Unix.
  • Good to have Industry leading ETL experience.
  • Banking Domain experience.
Key Responsibilities:
  • Ability to design build and unit test applications on Spark framework on Python.
  • Build PySpark based applications for both batch and streaming requirements which will require indepth knowledge on majority of Hadoop and NoSQL databases as well.
  • Develop and execute data pipeline testing processes and validate business rules and policies.
  • Optimize performance of the built Spark applications in Hadoop using configurations around Spark Context SparkSQL Data Frame and Pair RDDs.
  • Optimize performance for data access requirements by choosing the appropriate native Hadoop file formats (Avro Parquet ORC etc) and compression codec respectively.
  • Ability to design & build realtime applications using Apache Kafka & Spark Streaming.
  • Build integrated solutions leveraging Unix shell scripting RDBMS Hive HDFS File System HDFS File Types HDFS compression codec.
  • Build data tokenization libraries and integrate with Hive & Spark for columnlevel obfuscation.
  • Experience in processing large amounts of structured and unstructured data including integrating data from multiple sources.
  • Create and maintain integration and regression testing framework on Jenkins integrated with BitBucket and/or GIT repositories.
  • Participate in the agile development process and document and communicate issues and bugs relative to data standards in scrum meetings.
  • Work collaboratively with onsite and offshore team.
  • Develop & review technical documentation for artifacts delivered.
  • Ability to solve complex datadriven scenarios and triage towards defects and production issues.
  • Ability to learnunlearnrelearn concepts with an open and analytical mindset.
  • Participate in code release and production deployment.
  • Challenge and inspire team members to achieve business results in a fast paced and quickly changing environment.
Salary Range $120000$140000 a year
#LICO1
Location
Irving TX

Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.