drjobs Applied Research Engineer - Synthetic Data

Applied Research Engineer - Synthetic Data

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

London - UK

Monthly Salary drjobs

$ 200000 - 350000

Vacancy

1 Vacancy

Job Description

Job Description

Shape the future of agentic AI through cuttingedge data strategy

Want to pioneer nextgeneration data techniques for advanced AI systems This role combines frontier model research with practical implementation at one of Europes most ambitious AI startups.

Youll join a rapidly growing AI Data team developing cuttingedge datacentric approaches that enhance LLMs VLMs and Action Models. This isnt just about collecting data its about transforming how AI systems learn and operate through synthetic generation model distillation and preference alignment.

Founded with a clear mission to push the boundaries of superintelligent agentic AI this wellfunded startup $200M raised) is assembling worldclass talent focused on both advancing capabilities and ensuring responsible development. Their approach is comprehensive building proprietary technology from data to models focusing on language multimodal and vision systems with superior performance and costeffectiveness.

As an Applied Engineer focusing on Data Research youll develop sophisticated data strategies that directly impact frontier AI systems:

  • Generate and augment synthetic multimodal datasets for VQA agent behaviours and virtual navigation
  • Apply model distillation techniques to optimise largescale models for edge deployment
  • Design evaluation frameworks to measure improvements across multiple domains
  • Lead research into aligning data with human and AI preferences
  • Collaborate with crossfunctional teams to integrate datadriven solutions

This role offers rare access to significant compute resources with a massive GPU cluster that enables cuttingedge work. Youll be joining at a pivotal stage where your contributions will shape core technology and direction.

Requirements:

  • Strong Python programming skills covering parallel computing system design and largescale deployments
  • Experience developing multimodal data pipelines
  • Background in training and deploying LLMs VLMs or PyTorch models
  • MSc or PhD in machine learning computer vision NLP or related field
  • Deep understanding of training and evaluation paradigms for multimodal models
  • Effectiveness in fastchanging environments

Nice to have:

  • Experience with agentspecific data pipelines
  • Background in multimodal human annotation platforms
  • Document understanding/OCR expertise
  • Synthetic data generation experience (particularly multimodal)

Youll have flexibility to work from New York London or remotely within European or US East Coast time zones. For those based in cities with offices hybrid arrangements are available.

Your package includes a highly competitive salary $200000$350000 depending on experience) plus significant equity with strong upside potential.

If youre passionate about advancing AI through innovative data approaches and want to make a lasting impact on agentic systems wed love to hear from you. All applicants will receive a response.

Employment Type

Full-Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.