Mazlan Models is Ideagens programme to build domainspecific AI models for regulated industries where the quality of data directly determines the quality of outcomes. As AI Data Engineering Lead you will own the data foundation every model is built on and shape how trusted AI is delivered at scale. This is a leadership role combining strategy architecture and governance with handson impact across sourcing transforming versioning and preparing highvalue regulated data for training. You will lead a growing team of data engineers and work closely with AI engineering legal and domain experts to ensure our models are accurate compliant and ready for realworld use.
Responsibilities
Leading and developing a team of AI data engineers setting clear technical standards supporting career growth and scaling the function as the programme grows
Defining the technical direction for AI data engineering including architecture decisions tooling choices and delivery practices across the organisation
Designing and building the endtoend AI data platform from operational product data and regulatory sources through cloud storage and transformation pipelines to trainingready datasets
Owning dataset versioning and lineage so every training artefact is traceable reproducible and auditable across the full model lifecycle
Building and maintaining largescale regulatory and operational corpora in collaboration with domain experts ensuring data quality and consistency
Architecting and operating AWSbased data infrastructure at production scale with a focus on reliability security and performance
Defining and enforcing data governance standards including quality checks labelling conventions and data handling frameworks
Leading GDPR compliance for AI training data in partnership with Legal and ensuring best practice is embedded from the start
Skills and Experience
You are a senior data engineer or technical lead with prior experience leading teams and owning large data platforms end to end
You have deep production experience with Python and SQL and write data transformation code that is robust readable and reusable
You have designed and run AWS data stacks at scale including services such as S3 Glue Athena Kinesis Lambda and IAM
You understand ML training data pipelines and know how they differ from analytics workloads including dataset formats splits and quality constraints
You bring strong data governance instincts and design for versioning lineage and auditability from day one
You are comfortable working with legal and compliance partners on sensitive data handling and regulatory requirements
You communicate clearly across disciplines and work effectively with AI engineers product leaders and domain specialists
Experience with NLP or LLM training data data version control tools or regulated industry software is valuable but not essential
About Ideagen
Ideagen is the invisible force behind many things we rely on every day - from keeping airplanes soaring in the sky to ensuring the food on our tables is safe to helping doctors and nurses care for the sick. So when you think of Ideagen think of it as the silent teammate thats always working behind the scenes to help those people who make our lives safer and better. Everyday millions of people are kept safe using Ideagen software. We have offices all over the world including America Australia Malaysia and India with people doing lots of different and exciting jobs.
Were building a future-ready team and AI is part of how we work smarter. If youre curious adaptable and open to using AI to improve how you work youll thrive at Ideagen!
What is next
If your application meets the requirements for this role our Talent Acquisition team will be in touch to guide you through the next steps.
At Ideagen we value the importance of work-life balance and welcome candidates seeking flexible arrangements. If this is something you are interested in please let us know during the application process. Enhance your career and make the world a safer place!
#INDMP
#LI-NOTTINGHAM
#LI-REMOTE
Required Experience:
IC
Role PurposeLocation- Ruddington NottinghamshireLevel - ProfessionalDepartment - Product R&DWorking Pattern - Remoteunless based locallyBenefits -Benefits at IdeagenSalary - To be discussed at next stageMazlan Models is Ideagens programme to build domainspecific AI models for regulated industries wh...
Mazlan Models is Ideagens programme to build domainspecific AI models for regulated industries where the quality of data directly determines the quality of outcomes. As AI Data Engineering Lead you will own the data foundation every model is built on and shape how trusted AI is delivered at scale. This is a leadership role combining strategy architecture and governance with handson impact across sourcing transforming versioning and preparing highvalue regulated data for training. You will lead a growing team of data engineers and work closely with AI engineering legal and domain experts to ensure our models are accurate compliant and ready for realworld use.
Responsibilities
Leading and developing a team of AI data engineers setting clear technical standards supporting career growth and scaling the function as the programme grows
Defining the technical direction for AI data engineering including architecture decisions tooling choices and delivery practices across the organisation
Designing and building the endtoend AI data platform from operational product data and regulatory sources through cloud storage and transformation pipelines to trainingready datasets
Owning dataset versioning and lineage so every training artefact is traceable reproducible and auditable across the full model lifecycle
Building and maintaining largescale regulatory and operational corpora in collaboration with domain experts ensuring data quality and consistency
Architecting and operating AWSbased data infrastructure at production scale with a focus on reliability security and performance
Defining and enforcing data governance standards including quality checks labelling conventions and data handling frameworks
Leading GDPR compliance for AI training data in partnership with Legal and ensuring best practice is embedded from the start
Skills and Experience
You are a senior data engineer or technical lead with prior experience leading teams and owning large data platforms end to end
You have deep production experience with Python and SQL and write data transformation code that is robust readable and reusable
You have designed and run AWS data stacks at scale including services such as S3 Glue Athena Kinesis Lambda and IAM
You understand ML training data pipelines and know how they differ from analytics workloads including dataset formats splits and quality constraints
You bring strong data governance instincts and design for versioning lineage and auditability from day one
You are comfortable working with legal and compliance partners on sensitive data handling and regulatory requirements
You communicate clearly across disciplines and work effectively with AI engineers product leaders and domain specialists
Experience with NLP or LLM training data data version control tools or regulated industry software is valuable but not essential
About Ideagen
Ideagen is the invisible force behind many things we rely on every day - from keeping airplanes soaring in the sky to ensuring the food on our tables is safe to helping doctors and nurses care for the sick. So when you think of Ideagen think of it as the silent teammate thats always working behind the scenes to help those people who make our lives safer and better. Everyday millions of people are kept safe using Ideagen software. We have offices all over the world including America Australia Malaysia and India with people doing lots of different and exciting jobs.
Were building a future-ready team and AI is part of how we work smarter. If youre curious adaptable and open to using AI to improve how you work youll thrive at Ideagen!
What is next
If your application meets the requirements for this role our Talent Acquisition team will be in touch to guide you through the next steps.
At Ideagen we value the importance of work-life balance and welcome candidates seeking flexible arrangements. If this is something you are interested in please let us know during the application process. Enhance your career and make the world a safer place!