AI Data Engineer (Librarian)

Bilue

Not Interested
Bookmark
Report This Job

profile Job Location:

Sydney - Australia

profile Monthly Salary: Not Disclosed
Posted on: 9 hours ago
Vacancies: 1 Vacancy

Department:

Engineering

Job Summary

The Role

AI systems dont fail because of bad models. They fail because of bad libraries outdated documents cited as current knowledge that exists but cant be found datasets that are technically present but practically untrustworthy.

The AI Data Engineer (Librarian) owns that problem. This is not a traditional data engineering role. It sits at the intersection of data governance knowledge management and AI delivery. You will design and maintain the catalogues metadata schemas quality frameworks and lineage structures that every AI system in The Foundry depends on and you will work directly alongside AI Engineers to connect retrieval systems to the clean context-aware knowledge stores youve built.

Think of it this way: the AI Engineer builds the system that searches the library. You build the library.

What Youll Do

  • Design and maintain data catalogues for AI projects using platforms such as DataHub OpenMetadata Apache Atlas Collibra or cloud-native equivalents (AWS Glue Data Catalog Azure Purview GCP Dataplex).

  • Define metadata schemas and taxonomy standards type version jurisdiction validity period confidence tier so retrieval systems know not just what a document is but when it applies and how much to trust it.

  • Assess data quality across client and internal assets using tools like Great Expectations dbt tests or Soda; flag stale superseded or ambiguous records before they reach the AI layer.

  • Build and maintain data lineage so every AI-generated output can be traced back to its source version and validity period making outputs auditable not just accurate.

  • Design automated ingestion workflows and nightly quality checks that keep catalogues current without constant manual intervention.

  • Partner with The Foundry engineers to connect RAG pipelines and agentic retrieval systems to catalogue APIs and with The Labs strategists to ma


Qualifications :

What Were Looking For

  • A background in data governance information management knowledge management or records management with genuine interest in how that work enables AI delivery.

  • Hands-on experience with at least one data catalogue platform (DataHub OpenMetadata Collibra Alation Apache Atlas or a major cloud equivalent) and familiarity with metadata standards such as JSON-LD Dublin Core or domain-specific ontologies.

  • Strong SQL skills; working Python for data profiling quality scripting and metadata automation; and comfort with dbt or OpenLineage for lineage tracking.

  • Understanding of vector databases and RAG architecture enough to know how metadata quality directly affects retrieval precision.

  • Experience in regulated or high-stakes data environments where provenance and auditability genuinely matter: financial services insurance government healthcare or similar.

  • A collaborative low-ego disposition. The Data Librarians work is structural and often invisible. The glory goes to the AI system. You are fine with that.


Bonus: A formal background in library or information science. Experience with knowledge graphs (Neo4j RDF SPARQL). Prior consulting or agency delivery experience.


Additional Information :

Life at Bilue

People-first focus: Were committed to delivering exceptional outcomes for our clients but we know it starts with our people. Youll join a values-led team thats collaborative curious and genuinely cares about doing great work together.

Connection that counts: From monthly anchor days and team lunches to our annual offsite we create intentional moments to connect collaborate and celebrate. These arent just fun perks theyre part of how we work and grow together.

Flexibility that works: We offer hybrid working with 12 days per week in the office. Its a balance that gives you the space to do your best work while still creating time to connect and build strong relationships in person.

Strong internal communities: We actively foster internal communities across tech design delivery and beyond giving you plenty of chances to connect share knowledge and learn from your peers.

Opportunities to grow: We invest in your development with unlimited access to Go1s learning library and support from our internal performance coach. Whether you want to deepen your technical skills or grow your leadership potential well back you.

Flat structure real impact: At Bilue everyones voice matters. Our leadership team is hands-on and approachable and we operate without unnecessary layers. We keep things open and transparent and your ideas will be heard no matter your title.

Bilue Big Blue Ocean. Are you ready to set sail Apply now!

NB. This is a full-time position based in Sydney NSW or Melbourne VIC. To be considered candidates must have unrestricted working rights in Australia.


Remote Work :

No


Employment Type :

Full-time

The RoleAI systems dont fail because of bad models. They fail because of bad libraries outdated documents cited as current knowledge that exists but cant be found datasets that are technically present but practically untrustworthy.The AI Data Engineer (Librarian) owns that problem. This is not a tra...
View more view more

About Company

Hello, we’re Bilue - a leading design and development agency specialising in mobile, cloud, web, and emerging technologies. Bilue was founded by Cameron Barrie, an app developer with a vision to help Australian companies design and deliver cutting-edge digital experiences. What began ... View more

View Profile View Profile