Job Description: Agentic Data Engineer to design develop and deploy data pipelines that leverage agentic AI that solve realworld problems
The client is seeking a highly skilled Agentic Data Engineer to design develop and deploy data pipelines that leverage agentic AI that solve realworld problems. The ideal candidate will have experience in designing data process to support agentic systems ensure data quality and facilitating interaction between agents and data.
Responsibilities:
Designing and developing data pipelines for agentic systems develop Robust data flows to handle complex interactions between AI agents and Data sources.
Ability to train and fine tune large language models
Design and build the data architecture including databases data lakes to support various data engineering tasks.
Develop and manage Extract Load transform (ELT) processes to ensure data is accurately and efficiently moved from source systems to analytical platforms used in data science.
Implement data pipelines that facilitate feedback loops allowing human input to improve system performance in humanintheloop systems.
Work with vector databases to store and retrieve embeddings efficiently.
Collaborate with data scientists and engineers to preprocess data train models and integrate AI into applications.
Optimize data storage and retrieval with high performance
Statistical analysis trends patterns to create data formats from multiple sources.
Qualifications:
Strong Data engineering fundamentals
Utilize Big data frameworks like Spark/Databricks
Training LLMs with structed and unstructured data sets.
Understanding of Graph DB
Experience with Azure Blob Storage Azure Data Lakes Azure Databricks
Experience implementing Azure Machine Learning Azure Computer Vision Azure Video Indexer Azure OpenAI models Azure Media Services Azure AI Search
Determine effective data partitioning criteria
Utilize data storage system spark to implement partition schemes
Understanding core machine learning concepts and algorithms
Familiarity with Cloud computing skills
Strong programming skills in Python and experience with AI/ML frameworks.
Proficiency in vector databases and embedding models for retrieval tasks.
Expertise in integrating with AI agent frameworks.
Experience with cloud AI services (Azure AI).
Experience with GIS spatial data to create markers on maps ( lat long nearest topology of road geolocate between datasets correlation etc.).
Experience with Department of Transportation Data Domains developing an AI Composite Agentic Solution designed to identify and analyzedata models connect & correlateinformation to validatehypotheses forecast predict and recommendpotential strategies and conduct Whatif analysis.
Bachelors or masters degree in computer science AI Data Science or a related field.
Skills:
Skill
Required / Desired
Amount
of Experience
Understanding the Big data Technologies
1
Years
Experience developing ETL and ELT pipelines
1
Years
Experience with Spark GraphDB Azure Databricks
1
Years
Expertise in Data Partitioning
1
Years
Experience with Data conflation
3
Years
Experience developing Python Scripts
3
Years
Experience training LLMs with structured and unstructured data sets
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.