Location: Philadelphia PA
Duration: Permanent and Full time
- Building data pipelines writing ETL logic in PySpark or Scala.
- Ingestion of data into AWS from trafficking systems on premise databases (Teradata MS SQL Server Client Vertica etc.) on premise Hadoop cluster etc.
- Spinning up appropriate EMR EC2 instances for the job
- Scheduling the jobs and automating the jobs
- Creating the data sets in the appropriate format i.e Parquet
- Ingesting the data into S3/Redshift or any data ware house for external team to consume into their application
Company Description:Looking for a great career Global Geek Force Recruiting can improve candidate sourcing interviewing and applicant tracking for a streamlined hiring process. Candidates: please email your resume to