Senior Machine Learning Engineer AI & GPU Performance

London - UK

Monthly Salary: Not Disclosed

Posted on: 20 hours ago

Vacancies: 1 Vacancy

Job Summary

Who are we

From your everyday PowerPoint presentations to Hollywood movies AI will transform the way we create and consume content.

Today people want to watch and listen not read both at home and at work. If youre reading this and nodding check out our brand video.

Despite the clear preference for video communication and knowledge sharing in the business environment are still dominated by text largely because high-quality video production remains complex and challenging to scaleuntil now.

Meet Synthesia

Were on amission to make video easy for everyone.Born in an AI lab our AI video communications platform simplifies the entire video production process making it easy for everyone regardless of skill level to create collaborate and share high-quality videos. Whether its for delivering essential training to employees and customers or marketing products and services Synthesia enables large organizations to communicate and share knowledge through video quickly and efficiently. Were trusted by leading brands such as Heineken Zoom Xerox McDonalds and more. Readstories from happy customersand what 1200 people say on G2.

In February 2024 G2 named us as the fastest growing company in the world. Today were at a $2.1bn valuation and we recently raised our Series D. This brings our total funding to over $330M from top-tier investors including Accel Nvidia Kleiner Perkins Google and top founders and operators including Stripe Datadog Miro Webflow and Facebook.

About the role

As a ML Performance Engineer in the AI & GPU Performance team you will contribute to the design and development of high performance solutions. You will join a team of 40 Researchers and Engineers within the R&D Department working on cutting edge challenges in the Generative AI space with a focus on creating highly realistic emotional and life-like Synthetic humans through text-to-video. Within the team youll have the opportunity to work on the applied side of our research efforts and directly impact our solutions that are used worldwide by over 60000 businesses.

This is an opportunity to work for a company that is impacting businesses at a rapid pace across the globe.

What will you be doing

As a ML Performance Engineer in the AI & GPU Performance team you will contribute to the design and development of high performance solutions. You will own one or more projects for computationally optimizing large-scale model training and inference pipelines. By partnering with researchers and research teams youll identify high-impact initiatives and push the boundaries of model performance. You will work on re-implementing models in an efficient manner by using PyTorch and underlying technologies like CUDA/Triton Torch compilation etc.

This would include:

Evaluating profiling and optimising compute resource usage (e.g. Hopper & Blackwell GPUs) for cost and time efficiency at training and inference times
Developing customized efficient solutions for inference pipelines (CUDA/Triton kernels) as well as Introducing or enhancing tooling for achieving optimal computational performance (e.g. DL compilers ONNX TensorRT)
Driving the adoption of best practices for large-model training including checkpointing gradient accumulation and memory optimisation among others
Introducing or enhancing tooling for distributed training performance monitoring and logging (e.g. DeepSpeed PyTorch Distributed)
Designing and implement techniques for model parallelism data parallelism and mixed-precision training
Keeping updated on the latest research in model compression (e.g. quantization pruning) and advanced optimisation methods

Who are you

You are a ML engineer passionate about high performance computing
You have a background in Computer Science / Engineering and 3 years of industry experience. (PhD preferred)
You have worked on optimising large models for over 2 years
You have experience developing CUDA/Triton kernels and optimizing models with DL compilers ()
You have great coding skills in Python and C and you care about writing clean and efficient code
You have experience with optimising distributed systems and distributed tools like DDP Deepspeed Accelerate or similar
You have some experience in the video space (Diffusion models / GANs)
You are interested in doing research trying new things and pushing the boundaries going beyond whats already known

The good stuff...

Attractive compensation (salary stock options bonus)
Private Health Insurance in London
Hybrid work setting with an office in London
25 days of annual leave public holidays
Work in a great company culture with the option to join regular planning and socials at our hubs.
A generous referral scheme when you know people that are amazing for us
Strong opportunities for your career growth

You can see more about Who we are and How we work here: Experience:

Senior IC

Who are weFrom your everyday PowerPoint presentations to Hollywood movies AI will transform the way we create and consume content.Today people want to watch and listen not read both at home and at work. If youre reading this and nodding check out our brand video.Despite the clear preference for vid...

Who are we

From your everyday PowerPoint presentations to Hollywood movies AI will transform the way we create and consume content.

Today people want to watch and listen not read both at home and at work. If youre reading this and nodding check out our brand video.

Meet Synthesia

About the role

What will you be doing

This would include:

Evaluating profiling and optimising compute resource usage (e.g. Hopper & Blackwell GPUs) for cost and time efficiency at training and inference times
Developing customized efficient solutions for inference pipelines (CUDA/Triton kernels) as well as Introducing or enhancing tooling for achieving optimal computational performance (e.g. DL compilers ONNX TensorRT)
Driving the adoption of best practices for large-model training including checkpointing gradient accumulation and memory optimisation among others
Introducing or enhancing tooling for distributed training performance monitoring and logging (e.g. DeepSpeed PyTorch Distributed)
Designing and implement techniques for model parallelism data parallelism and mixed-precision training
Keeping updated on the latest research in model compression (e.g. quantization pruning) and advanced optimisation methods

Who are you

You are a ML engineer passionate about high performance computing
You have a background in Computer Science / Engineering and 3 years of industry experience. (PhD preferred)
You have worked on optimising large models for over 2 years
You have experience developing CUDA/Triton kernels and optimizing models with DL compilers ()
You have great coding skills in Python and C and you care about writing clean and efficient code
You have experience with optimising distributed systems and distributed tools like DDP Deepspeed Accelerate or similar
You have some experience in the video space (Diffusion models / GANs)
You are interested in doing research trying new things and pushing the boundaries going beyond whats already known

The good stuff...

Attractive compensation (salary stock options bonus)
Private Health Insurance in London
Hybrid work setting with an office in London
25 days of annual leave public holidays
Work in a great company culture with the option to join regular planning and socials at our hubs.
A generous referral scheme when you know people that are amazing for us
Strong opportunities for your career growth

You can see more about Who we are and How we work here: Experience:

Senior IC

Key Skills

JProfiler
Splunk
Performance Testing
Fiddler
Apache
HP Performance Center
LoadRunner
New Relic
Scalability
J2EE
Java
Scripting

Apply Now

About Company

Synthesia

Create AI videos from text with AI video generator. Get most advanced AI avatars and voiceovers in 130+ languages. Try free AI video generator today!

View Profile View Profile

AI AutoApply

Apply to 100+ jobs with one click

AI Resume Builder

Create an ATS-ready CV in minutes

AI Cover Letter

Write a personalized letter instantly

Senior Machine Learning Engineer AI & GPU Performance

London - UK

Job Summary

Who are we

Meet Synthesia

About the role

What will you be doing

Who are you

The good stuff...

Who are we

Meet Synthesia

About the role

What will you be doing

Who are you

The good stuff...

Key Skills

About Company

Related Jobs