Machine Learning Engineer Data AI Research

Canva

Not Interested
Bookmark
Report This Job

profile Job Location:

Sydney - Australia

profile Monthly Salary: Not Disclosed
Posted on: 30+ days ago
Vacancies: 1 Vacancy

Job Summary

About the Role

As a Research MLE at Canva youll be responsible for high-performance data acquisition processing and annotation to enable the training of cutting-edge models. Your focus will be on sourcing data automation building performant infrastructure for filtering and analyzing and dealing with petabyte-scale data. Youll be the crucial link that makes novel model development training and evaluation possible accelerating Canvas cutting-edge research.

Key Focus Areas

  • Data Acquisition: Developing scalable tools and pipelines for acquiring diverse datasets from multiple sources
  • Curation: Engineering robust solutions for filtering deduplication quality assessment and curating data that meets specific research requirements and model training criteria
  • Data Infrastructure: Developing high-throughput tools for interfacing with large-scale data pools enabling efficient querying sampling and extracting valuable statistical insights and patterns

Primary Responsibilities

  • Work alongside research teams to ensure continuous flow of high-quality data toward active projects understanding their specific dataset requirements and delivery timelines
  • Curate targeted subsets of data using ML techniques including clustering embedding-based similarity search and automated quality scoring
  • Extract visualize and communicate actionable insights about dataset composition distributions biases and statistical properties to inform research decisions
  • Build performant parallel algorithms for gathering and processing data at scale optimizing for both throughput and cost-efficiency across distributed systems
  • Engineer intuitive interfaces and tooling to help researchers explore sample and interact with large datasets without requiring deep infrastructure knowledge
  • Work with paired multimodal data (text-image audio-video etc.) ensuring alignment quality handling synchronization challenges and maintaining multimodal correspondence
  • Leverage high-performance parallel computing frameworks (Ray Spark DeepSpeed etc) and cloud infrastructure for distributed data operations on petabyte-scale datasets

Youre probably a match if you have:

  • A strong aesthetic sense with a background or demonstrated passion for visual design or human-computer interaction.

  • Strong proficiency in Python and ML frameworks (e.g. PyTorch TensorFlow).

  • Extensive experience with designing and implementing large-scale data processing workflows using libraries like Pandas and data warehousing solutions such as Snowflake.

  • Solid understanding of statistical methods including experimental design A/B testing and quality evaluation systems.

  • Experience with generative AI and synthetic data generation is highly desirable.

Nice to have:

  • Experience with cloud platforms (e.g. AWS GCP Azure) for data storage processing and MLOps related to dataset management.

  • Experience with MLOps practices and tools specifically for data versioning lineage and pipeline automation.

  • Ability to develop data visualization or data collection interfaces (e.g. TypeScript Python).


Additional Information :

Dont tick all the boxes Dont worry about that - nobody does!  Wed still love to hear from you! At Canva we know that great engineers come from a variety of backgrounds and we value passion curiosity and a willingness to learn just as much as specific experience. If youre excited about this role but dont tick every box we encourage you to apply you might a great fit in ways you didnt expect!

Whats in it for you

Achieving our crazy big goals motivates us to work hard - and we do - but youll experience lots of moments of magic connectivity and fun woven throughout life at Canva too. We also offer a stack of benefits to set you up for every success in and outside of work.

Heres a taste of whats on offer:

  • Equity packages - we want our success to be yours too
  • Inclusive parental leave policy that supports all parents & carers
  • An annual Vibe & Thrive allowance to support your wellbeing social connection office setup & more
  • Flexible leave options that empower you to be a force for good take time to recharge and supports you personally

Check out    for more info.

Other stuff to know

We make hiring decisions based on your experience skills and passion as well as how you can enhance Canva and our culture. When you apply please tell us the pronouns you use and any reasonable adjustments you may need during the interview process.


Remote Work :

Yes


Employment Type :

Full-time

About the RoleAs a Research MLE at Canva youll be responsible for high-performance data acquisition processing and annotation to enable the training of cutting-edge models. Your focus will be on sourcing data automation building performant infrastructure for filtering and analyzing and dealing with ...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala

About Company

Company Logo

We're a global online visual communications platform on a mission to empower the world to design. Featuring a simple drag-and-drop user interface and a vast range of templates ranging from presentations, documents, websites, social media graphics, posters, apparel to videos, plus a hu ... View more

View Profile View Profile