Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailCompany Overview
Axelera is a European highgrowth Series B startup revolutionizing the AI landscape with our inmemory computing platform. We specialize in creating AI hardware and software optimized for highperformance inference catering to cuttingedge use cases across highend edge computing embodied AI and serverside AI deployments. We are looking for passionate innovative research engineers to join our team and help drive the future of AI.
Role Overview
We are looking for an AI Research Engineer with a strong focus on model compression to join our dynamic team. This role will be responsible for developing cuttingedge compression techniques that make Generative AI models more efficient for realtime inference across a variety of environments from highend edge systems to largescale serverside deployments. You will be key in ensuring that our models are optimized for memory usage computational efficiency and performance while maintaining or improving model accuracy.
This is an exciting opportunity to work at the intersection of advanced machine learning inmemory computing and highperformance AI inference on cuttingedge hardware architectures.
Responsibilities:
Model Compression: Design and implement advanced model compression techniques such as pruning quantization weight sharing and knowledge distillation to make models more memoryefficient and computationally optimized.
Performance Tuning: Optimize compressed models to achieve highthroughput and lowlatency inference specifically tailored to our inmemory computing platform.
Collaboration: Work closely with AI researchers software engineers and hardware engineers to integrate your model optimizations into our AI platform ensuring that models work effectively across edge and serverside deployments.
Innovation: Stay on top of the latest developments in the AI and model compression research space pushing the envelope on novel techniques for reducing model size without sacrificing performance.
Deployment & Testing: Implement best practices for model testing deployment and continuous improvement to ensure models scale effectively in production environments.
Requirements:
Experience: Proven experience (for all levels) working on model compression including techniques like pruning quantization lowrank factorization and knowledge distillation.
Technical Skills:
Expertise in deep learning frameworks such as TensorFlow PyTorch or JAX.
Experience optimizing models for resourceconstrained environments such as edge devices or embedded systems.
Familiarity with distributed systems inmemory computing or highperformance computing environments.
A strong understanding of deep learning algorithms neural networks and the tradeoffs involved in model compression.
Knowledge: A strong understanding of the latest advancements in AI/ML research particularly in compression and distillation of generative models (e.g. transformers and diffusion models).
Collaboration & Communication: Ability to work in a highly collaborative fastpaced startup environment and communicate complex technical concepts clearly.
Preferred Qualifications:
PhD or advanced degree in Computer Science Machine Learning AI or related fields.
5 years of postgraduation relevant work experience.
Research experience in model compression efficient inference or deploying AI models to resourceconstrained devices.
Familiarity with model deployment frameworks like TensorRT ONNX or similar.
A passion for solving realworld challenges with AI in dynamic highperformance environments.
Location
This position is based in Italy & we support relocation to Bologna Florence or Milan for talent based abroad and interested in this role.
Why Join Us
Impact: Work on groundbreaking technology that will power the next wave of AI applications from edge computing to embodied AI systems.
Culture: Join a diverse driven team that values innovation collaboration and continuous learning.
Growth: As a Series B startup youll have significant growth opportunities including the chance to shape the direction of the product and AI strategy.
Compensation: Competitive salary equity options and benefits package.
How to Apply
Please submit your resume and a brief cover letter explaining why youre excited about this opportunity and how your experience aligns with our model compression goals.
At Axelera AI we wholeheartedly embrace equal opportunity and hold diversity in the highest regard. Our steadfast commitment is to cultivate a warm and inclusive environment that empowers and celebrates every member of our team. We welcome applicants from all backgrounds to join us in shaping the future of AI.
Required Experience:
Senior IC
Full-Time