Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailDesign and develop robust data pipelines for agentic AI systems enabling complex interactions between AI agents and data sources
Train and finetune large language models to support agentdriven applications
Architect and build scalable data infrastructure including databases and data lakes
Develop and manage ELT processes to ensure accurate and efficient data movement from source systems to analytical platforms
Implement feedbackdriven pipelines to support humanintheloop systems and continuous performance improvement
Work with vector databases to store and retrieve embeddings efficiently
Collaborate with data scientists and engineers on data preprocessing model training and AI integration
Optimize data storage and retrieval for high performance and scalability
Analyze statistical data trends and patterns to format and standardize inputs from multiple sources
1 year of understanding and working with Big Data technologies
1 year of experience developing ETL and ELT pipelines
1 year of experience using Spark GraphDB and Azure Databricks
1 year of expertise in data partitioning for performance and scalability
3 years of experience with data conflation techniques
3 years of experience developing Python scripts for data processing or automation
2 years of experience training large language models (LLMs) with structured and unstructured data sets
3 years of experience working with GIS spatial data
Required Experience:
Senior IC
Contract