Principal Software Engineer AIML (Ireland)
Job Summary
Principal Software Engineer to work on cutting-edge AI/ML applications and agent systems leveraging modern inference platforms to build production-ready prototypes. You will contribute to upstream communities like vLLM TGI PyTorch and OpenVINO while building innovative applications that demonstrate the capabilities of next-generation AI/ML systems. The ideal candidate is energized by working concurrently on a wide variety of projects independently as well as within a team environment.
What you will do:
- Build high-quality high-performing AI/ML applications and agent systems using modern inference platforms for multi-modal and distributed model serving
- Apply and optimize inference techniques including KV cache management model quantization and distributed serving to production workloads
- Contribute to upstream inference runtime communities such as vLLM TGI PyTorch OpenVINO and related projects
- Build multi-modal AI applications integrating vision language and other modalities
- Provide technical leadership and coordination across multiple stakeholders and engineering teams
- Apply a growth mindset by staying current with rapid advancements in AI/ML inference technologies
- Benchmark and analyze inference performance at scale driving data-driven optimization decisions
- Publicize innovations through blogs presentations conferences and other technical venues
What you will bring:
- Bachelors degree in Computer Science Engineering or equivalent experience
- 5 years of experience in AI/ML engineering with focus on production inference systems
- Deep expertise in PyTorch and modern deep learning frameworks
- Hands-on experience with inference runtime optimization (model serving batching KV cache management)
- Advanced programming skills in Python and C
- Proven ability to contribute to and lead open source projects
- Strong self-motivation and organizational skills
- Ability to work concurrently on multiple projects independently and within a team environment
- Excellent English written and verbal communication skills
- Collaborative attitude and willingness to share ideas openly
The following are considered a plus:
- Experience with vLLM TGI (Text Generation Inference) or similar inference runtimes
- Contributions to PyTorch OpenVINO or other inference frameworks
- Experience with distributed model serving and GPU optimization
- Familiarity with Kubernetes and cloud-native AI/ML deployments
- Knowledge of model quantization techniques (GPTQ AWQ FP8 etc.)
- Experience with CUDA Triton or other GPU programming frameworks
- Experience with diffusion models and diffusion transformers
- Experience building AI agents and agentic systems
About Red Hat
Red Hat is the worlds leading provider of enterprise open source software solutions using a community-powered approach to deliver high-performing Linux cloud container and Kubernetes technologies. Spread across 40 countries our associates work flexibly across work environments from in-office to office-flex to fully remote depending on the requirements of their role. Red Hatters are encouraged to bring their best ideas no matter their title or tenure. Were a leader in open source because of our open and inclusive environment. We hire creative passionate people ready to contribute their ideas help solve complex problems and make an impact.
Inclusion at Red Hat
Red Hats culture is built on the open source principles of transparency collaboration and inclusion where the best ideas can come from anywhere and anyone. When this is realized it empowers people from different backgrounds perspectives and experiences to come together to share ideas challenge the status quo and drive innovation. Our aspiration is that everyone experiences this culture with equal opportunity and access and that all voices are not only heard but also celebrated. We hope you will join our celebration and we welcome and encourage applicants from all the beautiful dimensions that compose our global village.
Equal Opportunity Policy (EEO)
Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race color religion sex sexual orientation gender identity national origin ancestry citizenship age veteran status genetic information physical or mental disability medical condition marital status or any other basis prohibited by law.
Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for and will not pay any fees commissions or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.
Red Hat supports individuals with disabilities and provides reasonable accommodations to job applicants. If you need assistance completing our online job application email . General inquiries such as those regarding the status of a job application will not receive a reply.
Required Experience:
Staff IC
Key Skills
About Company
We revolutionized the operating system with Red Hat® Enterprise Linux®. Now, we have a broad portfolio, including hybrid cloud infrastructure, middleware, agile integration, cloud-native application development, and management and automation solutions. With Red Hat technologies, compa ... View more