Data Engineer with Scala

Cloudious LLC

Not Interested
Bookmark
Report This Job

profile Job Location:

Bentonville, AR - USA

profile Monthly Salary: Not Disclosed
Posted on: 3 days ago
Vacancies: 1 Vacancy

Job Summary

Only W2 Candidates required No glider needed for this role

Mandatory Areas
Must Have Skills Data Engineer with Scala
Skill 1 Scala Spark Python SQL Bigdata Hadoop
GCP data tools: BigQuery Dataproc Vertex AI Pub/Sub Cloud Functions
Skill 2 PySpark Python SparkSQL and data modeling

Data Engineer with Scala
Bentonville AR- 5 days Onsite
$65/hr on C2C

We are seeking a Data Engineer with Spark & SCALA ; Streaming skills builds real-time scalable data pipelines using tools like Spark Kafka and cloud services (GCP) to ingest transform and deliver data for analytics and ML.

Responsibilities:
As a Senior Data Engineer you will

Design develop and maintain ETL/ELT data pipelines for batch and real-time data ingestion transformation and loading using Spark (PySpark/Scala) and streaming technologies (Kafka Flink).
Build and optimize scalable data architectures including data lakes data warehouses (BigQuery) and streaming platforms.
Performance Tuning: Optimize Spark jobs SQL queries and data processing workflows for speed efficiency and cost-effectiveness
Data Quality: Implement data quality checks monitoring and alerting systems to ensure data accuracy and consistency.
Required Skills & Qualifications:
Programming: Strong proficiency in Python SQL and potentially Scala/Java.
Big Data: Expertise in Apache Spark (Spark SQL DataFrames Streaming).
Streaming: Experience with messaging queues like Apache Kafka or Pub/Sub.
Cloud: Familiarity with GCP Azure data services.
Databases: Knowledge of data warehousing (Snowflake Redshift) and NoSQL databases.
Tools: Experience with Airflow Databricks Docker Kubernetes is a plus.

Experience Level:
Total IT Experience Minimum 8 years
GCP - 4 years of recent GCP experience

Only W2 Candidates required No glider needed for this role Mandatory Areas Must Have Skills Data Engineer with Scala Skill 1 Scala Spark Python SQL Bigdata Hadoop GCP data tools: BigQuery Dataproc Vertex AI Pub/Sub Cloud Functions Skill 2 PySpark Python SparkSQL and data modeling Data...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala