AI Data Engineer
Job Summary
- Build and maintain scalable data pipelines using Spark Databricks and cloud platforms
- Design data models for analytics ML and AI applications
- Drive adoption of AI tools and agentic workflows within the data engineering team
- Identify and implement ways to improve engineering efficiency using AI
- Prototype and scale AI-assisted development practices
- Act as a go-to expert for AI experimentation and knowledge sharing
- Help establish best practices and contribute to an AI-focused community or guild
- Build pipelines supporting ML models LLM applications and AI workflows
- Ensure data quality observability and reliability
- Collaborate with Product Data Science ML/AI and DevOps teams
Qualifications :
- 3 years of commercial experience in data engineering
- Strong proficiency in SQL and Python (development and optimization)
- Hands-on experience with Spark/PySpark (Databricks is a plus)
- Experience with cloud data platforms (Azure preferred: ADF Synapse ADLS Event Hub)
- Solid understanding of ETL/ELT data modeling and data warehousing
- Experience with orchestration tools (Airflow ADF)
- Understanding of reliability performance and production-grade systems
- Hands-on experience using AI coding tools (Copilot Cursor Claude Code etc.) in real workflows
- Experience delivering at least one project with AI-assisted development
- Ability to structure tasks for AI tools and critically validate their output
- Upper Intermediate level of English for effective communication
WILL BE A PLUS
- Experience configuring AI development environments (agents integrations workflows)
- Familiarity with LLMs embeddings and RAG architectures
- Experience with vector databases (pgvector FAISS etc.)
- Familiarity with AI/agent frameworks (LangChain LlamaIndex etc.)
- Experience with dbt Kafka BI tools
- Data quality tooling (Great Expectations Soda etc.)
- Multi-cloud experience (AWS/GCP)
- Interest in advanced topics (evaluation reranking drift detection synthetic data)
- Contributions to AI/data tooling or open source
Additional Information :
PERSONAL PROFILE
- Proactive and ownership-driven
- Strong problem-solving and critical thinking skills
- Comfortable working in evolving environments with emerging tools
- Collaborative and able to influence others
- Genuinely interested in the intersection of data engineering and AI
Remote Work :
Yes
Employment Type :
Full-time
About Company
At Sigma Software, we are involved with the clients team to contribute to the design and development of a technical solution for their tokenized domain reservation platform. We started by assigning a software architect to design the smart contracts and integrate blockchain into the s ... View more