Job Title: Data Engineer (Generative AI)
Location: Charlotte NC Atlanta GA (Onsite) OR Remote
Job Type: Contract
Tax Terms: C2C W2
Responsibilities:
- Design develop and optimize data pipelines using Python and PySpark for batch and incremental processing.
- Build and manage AWS based data solutions leveraging services such as S3 Glue and cloud native processing frameworks.
- Prepare transform and curate datasets to support AI/ML and GenAI model development.
- Integrate data pipelines with AI/ML workflows ensuring data quality consistency and traceability.
- Implement data validation profiling and performance tuning to improve reliability and scalability.
- Collaborate with data scientists ML engineers and platform teams to deliver end to end GenAI solutions.
- Exposure to GenAI pipelines model data preparation or LLM driven workflows.
- Experience with CI/CD data quality frameworks or cloud cost optimization.
- Familiarity with SQL based analytics and metadata driven data processing
Skills:
- The role focuses on Python and PySpark development on AWS enabling AI/ML and GenAI use cases through reliable high quality data foundations
- Strong hands on experience with Python for data engineering and automation.
- Proven expertise in PySpark / Spark for large scale data processing.
- Experience working in AWS cloud environments for data engineering workloads.
- Solid understanding of data engineering fundamentals including ETL data modeling and performance optimization.
- Experience supporting or working alongside AI/ML or GenAI initiatives.
Job Title: Data Engineer (Generative AI) Location: Charlotte NC Atlanta GA (Onsite) OR Remote Job Type: Contract Tax Terms: C2C W2 Responsibilities: Design develop and optimize data pipelines using Python and PySpark for batch and incremental processing. Build and manage AWS based data soluti...
Job Title: Data Engineer (Generative AI)
Location: Charlotte NC Atlanta GA (Onsite) OR Remote
Job Type: Contract
Tax Terms: C2C W2
Responsibilities:
- Design develop and optimize data pipelines using Python and PySpark for batch and incremental processing.
- Build and manage AWS based data solutions leveraging services such as S3 Glue and cloud native processing frameworks.
- Prepare transform and curate datasets to support AI/ML and GenAI model development.
- Integrate data pipelines with AI/ML workflows ensuring data quality consistency and traceability.
- Implement data validation profiling and performance tuning to improve reliability and scalability.
- Collaborate with data scientists ML engineers and platform teams to deliver end to end GenAI solutions.
- Exposure to GenAI pipelines model data preparation or LLM driven workflows.
- Experience with CI/CD data quality frameworks or cloud cost optimization.
- Familiarity with SQL based analytics and metadata driven data processing
Skills:
- The role focuses on Python and PySpark development on AWS enabling AI/ML and GenAI use cases through reliable high quality data foundations
- Strong hands on experience with Python for data engineering and automation.
- Proven expertise in PySpark / Spark for large scale data processing.
- Experience working in AWS cloud environments for data engineering workloads.
- Solid understanding of data engineering fundamentals including ETL data modeling and performance optimization.
- Experience supporting or working alongside AI/ML or GenAI initiatives.
View more
View less