DATA ENGINEER

TechniPros

Not Interested
Bookmark
Report This Job

profile Job Location:

Scottsdale, AZ - USA

profile Monthly Salary: Not Disclosed
Posted on: 7 hours ago
Vacancies: 1 Vacancy

Job Summary

Role : Data Engineer
Location : Scottsdale AZ (Onsite)
Long term contract
Looking for W2 Candidates. No C2C

Job Description :

We are looking for a skilled Data Engineer with strong PySpark experience to work on large-scale data processing and analytics initiatives. The ideal candidate will have hands-on experience working with large datasets complex joins and performance optimization along with the ability to apply basic analytical thinking and deliver clear stakeholder-ready outputs.


Core Skill Sets (Must-Have):
Strong hands-on experience with PySpark
Extensive experience working with large datasets
Proven expertise in joining large databases efficiently
Ability to write high-performance optimized code
Basic analytical skills to interpret and validate data
Reporting skills using Excel

Good to Have Skills
Experience in model development or supporting analytics/modeling teams
SAS experience
Exposure to Cloudera or similar big data platforms
Understanding of data warehousing and analytics workflows

Key Responsibilities:
Design develop and maintain scalable data pipelines using PySpark. Data Engineering & Development
Write efficient and optimized PySpark code to process and transform large-scale datasets.
Handle joins across multiple large databases ensuring performance accuracy and scalability.
Optimize Spark jobs to minimize runtime memory usage and compute cost.
Work with structured and semi-structured data from multiple sources.
Data Preparation & Analysis Support
Build and curate training and analytical datasets by joining and transforming multiple data sources.
Apply basic analytical skills to understand data patterns anomalies and business relevance.
Perform data validation and quality checks including:
Record counts and reconciliation
Duplicate detection
Null and outlier checks
Schema and data-type validation
Ensure datasets are analysis-ready and trustworthy.

Stakeholder Interaction & Reporting:
Understand business objectives and translate them into data requirements.
Ask the right questions to determine:
Level of aggregation required
Metrics definitions
Data freshness and accuracy expectations
Preferred output and reporting formats
Present results and insights clearly to stakeholders.
Create reports and summaries using Excel for business users and leadership.

Best Regards:

Tina
Phone: 1-
Email:

Role : Data Engineer Location : Scottsdale AZ (Onsite) Long term contract Looking for W2 Candidates. No C2C Job Description : We are looking for a skilled Data Engineer with strong PySpark experience to work on large-scale data processing and analytics initiatives. The ideal candidate will have ...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala