drjobs Data Engineer - LLM Pipeline Data Infrastructure

Data Engineer - LLM Pipeline Data Infrastructure

Employer Active

1 Vacancy
The job posting is outdated and position may be filled
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Amsterdam - Netherlands

Yearly Salary drjobs

EUR 60000 - 80000

Vacancy

1 Vacancy

Job Description

Were building an AIpowered conversational system for drivethru automation. As our Data Engineer youll design and implement the infrastructure that powers our multistage LLM pipeline from data capture to processing model training and deployment.

Tasks

  • Build scalable realtime data pipelines for audio processing LLM interactions and model training
  • Design comprehensive data storage solutions across object storage NoSQL and analytical databases
  • Implement data quality management with filtering normalization and enrichment capabilities
  • Create automated processes for data preparation model evaluation and continuous improvement
  • Develop observability systems with monitoring alerting and performance dashboards
  • Establish data security and compliance protocols including privacy protection measures
  • Build resilient data systems with error recovery backup and integrity verification

Requirements

What Youll Need

  • Experience designing data pipelines for AI/ML applications
  • Expertise with Apache Airflow for workflow orchestration
  • Strong knowledge of Apache Spark for largescale data processing
  • Experience with Apache Kafka for realtime event streaming
  • Proficiency with object storage systems S3/MinIO and database technologies Cassandra/ScyllaDB ClickHouse
  • Understanding of monitoring tools OpenTelemetry and observability platforms
  • Experience implementing data security and compliance measures
  • Advanced Python programming skills

Preferred Experience

  • Audio data processing and conversational AI systems
  • LLM training and finetuning pipelines
  • Data quality frameworks (Great Expectations) and versioning tools (LakeFS DVC)
  • Kubernetes for container orchestration
  • Multiregion deployment and distributed systems

Benefits

  • Build cuttingedge conversational AI systems with realworld impact
  • Work with modern opensource technology stack
  • Help shape the future of automated customer service
  • Competitive compensation and flexible work arrangements

If youre passionate about building robust data systems for AI applications and excited by complex realtime data challenges wed love to talk.

Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.