Data Engineer (Generative AI)

VDart Inc

Not Interested
Bookmark
Report This Job

profile Job Location:

Atlanta, GA - USA

profile Monthly Salary: Not Disclosed
Posted on: 10 hours ago
Vacancies: 1 Vacancy

Job Summary

Job Title: Data Engineer (Generative AI)

Location: Charlotte NC Atlanta GA (Onsite) OR Remote

Job Type: Contract

Tax Terms: C2C W2

Responsibilities:

  • Design develop and optimize data pipelines using Python and PySpark for batch and incremental processing.
  • Build and manage AWS based data solutions leveraging services such as S3 Glue and cloud native processing frameworks.
  • Prepare transform and curate datasets to support AI/ML and GenAI model development.
  • Integrate data pipelines with AI/ML workflows ensuring data quality consistency and traceability.
  • Implement data validation profiling and performance tuning to improve reliability and scalability.
  • Collaborate with data scientists ML engineers and platform teams to deliver end to end GenAI solutions.
  • Exposure to GenAI pipelines model data preparation or LLM driven workflows.
  • Experience with CI/CD data quality frameworks or cloud cost optimization.
  • Familiarity with SQL based analytics and metadata driven data processing

Skills:

  • The role focuses on Python and PySpark development on AWS enabling AI/ML and GenAI use cases through reliable high quality data foundations
  • Strong hands on experience with Python for data engineering and automation.
  • Proven expertise in PySpark / Spark for large scale data processing.
  • Experience working in AWS cloud environments for data engineering workloads.
  • Solid understanding of data engineering fundamentals including ETL data modeling and performance optimization.
  • Experience supporting or working alongside AI/ML or GenAI initiatives.
Job Title: Data Engineer (Generative AI) Location: Charlotte NC Atlanta GA (Onsite) OR Remote Job Type: Contract Tax Terms: C2C W2 Responsibilities: Design develop and optimize data pipelines using Python and PySpark for batch and incremental processing. Build and manage AWS based data soluti...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala