Senior Engineer â AI Model Compression Research

Axelera AI

Posted on : 04-05-2025

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

milan - Italy

Monthly Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 04-05-2025

Job Description

Company Overview
Axelera is a European highgrowth Series B startup revolutionizing the AI landscape with our inmemory computing platform. We specialize in creating AI hardware and software optimized for highperformance inference catering to cuttingedge use cases across highend edge computing embodied AI and serverside AI deployments. We are looking for passionate innovative research engineers to join our team and help drive the future of AI.

Role Overview
We are looking for an AI Research Engineer with a strong focus on model compression to join our dynamic team. This role will be responsible for developing cuttingedge compression techniques that make Generative AI models more efficient for realtime inference across a variety of environments from highend edge systems to largescale serverside deployments. You will be key in ensuring that our models are optimized for memory usage computational efficiency and performance while maintaining or improving model accuracy.

This is an exciting opportunity to work at the intersection of advanced machine learning inmemory computing and highperformance AI inference on cuttingedge hardware architectures.

Responsibilities:

Model Compression: Design and implement advanced model compression techniques such as pruning quantization weight sharing and knowledge distillation to make models more memoryefficient and computationally optimized.
Performance Tuning: Optimize compressed models to achieve highthroughput and lowlatency inference specifically tailored to our inmemory computing platform.
Collaboration: Work closely with AI researchers software engineers and hardware engineers to integrate your model optimizations into our AI platform ensuring that models work effectively across edge and serverside deployments.
Innovation: Stay on top of the latest developments in the AI and model compression research space pushing the envelope on novel techniques for reducing model size without sacrificing performance.
Deployment & Testing: Implement best practices for model testing deployment and continuous improvement to ensure models scale effectively in production environments.

Requirements:

Experience: Proven experience (for all levels) working on model compression including techniques like pruning quantization lowrank factorization and knowledge distillation.
Technical Skills:
- Expertise in deep learning frameworks such as TensorFlow PyTorch or JAX.
- Experience optimizing models for resourceconstrained environments such as edge devices or embedded systems.
- Familiarity with distributed systems inmemory computing or highperformance computing environments.
- A strong understanding of deep learning algorithms neural networks and the tradeoffs involved in model compression.
Knowledge: A strong understanding of the latest advancements in AI/ML research particularly in compression and distillation of generative models (e.g. transformers and diffusion models).
Collaboration & Communication: Ability to work in a highly collaborative fastpaced startup environment and communicate complex technical concepts clearly.

Preferred Qualifications:

PhD or advanced degree in Computer Science Machine Learning AI or related fields.
5 years of postgraduation relevant work experience.
Research experience in model compression efficient inference or deploying AI models to resourceconstrained devices.
Familiarity with model deployment frameworks like TensorRT ONNX or similar.
A passion for solving realworld challenges with AI in dynamic highperformance environments.

Location

This position is based in Italy & we support relocation to Bologna Florence or Milan for talent based abroad and interested in this role.

Why Join Us

Impact: Work on groundbreaking technology that will power the next wave of AI applications from edge computing to embodied AI systems.
Culture: Join a diverse driven team that values innovation collaboration and continuous learning.
Growth: As a Series B startup youll have significant growth opportunities including the chance to shape the direction of the product and AI strategy.
Compensation: Competitive salary equity options and benefits package.

How to Apply
Please submit your resume and a brief cover letter explaining why youre excited about this opportunity and how your experience aligns with our model compression goals.

At Axelera AI we wholeheartedly embrace equal opportunity and hold diversity in the highest regard. Our steadfast commitment is to cultivate a warm and inclusive environment that empowers and celebrates every member of our team. We welcome applicants from all backgrounds to join us in shaping the future of AI.

Required Experience:

Senior IC

Employment Type

Full-Time

Company Industry

Key Skills

Apply Now

About Company

Axelera AI

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Senior Engineer â AI Model Compression Research

Axelera AI

Job Description

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Senior Lead AI Engineer (Foundation Model Hosting)

Senior AI Engineer (AI Foundations)

Senior Lead AI Engineer (SDKs)

AI Engineer

Senior Lead AI Engineer (Gen AI Platform Services)

Model Writer

MCP AI Engineer

Lead Consultant â AI/ML

Senior Engineer â AI Model Compression Research

Axelera AI

Job Description

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Senior Engineer â AI Model Compression Research