Job Summary:
We are looking to add a Large Language Model (LLM) Algorithm Engineer in Changsha China within our EM Labs team.
It is a great opportunity to work in a techdriven company. In a relaxed and friendly environment our headquarters are in the heart of the city at Runhe Financial Center full of interesting and challenging projects.
Company Intro:
EveryMatrix is a leading B2B SaaS provider delivering iGaming software content and services. We provide casino sports betting platform and payments and affiliate management to 200 customers worldwide. The company is profitable has over EUR 100m in annual revenues and 1200 employees in offices across ten countries in Europe Asia and the US. EveryMatrix was founded in 2008 and remains a founderowned private company.
What Youll get to do:
- Design distributed deployment solutions based on NVIDIA hardware architecture (NVLink/NVSwitch). Lead framework selection and performance tuning for vLLM TensorRTLLM SGLang etc. to achieve highthroughput inference services.
- Build a multimodal GPU cluster management system to optimize KV Cache storage and loading strategies improving service efficiency for longcontext scenarios.
- Model optimization and engineering deployment.
- Design Prompt Engineering strategies combined with RAG (RetrievalAugmented Generation) technology to enhance response accuracy in scenarios like intelligent customer service and knowledgebased Q&A. Familiarity with LangChain/AutoGen frameworks is required.
- Lead model finetuning (Finetuning) using efficient parameter tuning techniques like LoRA/QLoRA to address longtail issues in vertical domains. Proficiency in PEFT (ParameterEfficient FineTuning) methods is essential.
- Develop enterprisegrade internal toolchains including code assistance/generation tools code review systems and private knowledgebased systems.
- Design external customer systems such as smart customer service platforms (integrating speech recognition ticket management and compliance auditing). Build multiAgent collaborative online assistants leveraging multiAgent task allocation mechanisms.
Requirements:
- Masters degree or above in Computer Science Artificial Intelligence or related fields.
- 3 years of professional experience in NLP/LLM projects.
- Proficiency in PyTorch/TensorFlow frameworks with deep understanding of Transformer architecture and optimization of attention mechanisms.
- Familiarity with CUDA programming and NVLink topology design. Experience in NVIDIA chip operator development (e.g. CUDA kernel optimization) is a plus.
- Mastery of development and deployment frameworks such as LangChain vLLM and SGLang with the ability to independently develop and deploy API services.
- Pretraining or finetuning of opensource large models (e.g. LLaMA DeepSeek).
- Development of intelligent customer service systems (knowledge of IVR ACD and call center technologies required).
- Enterpriselevel code assistance tools (e.g. code generation code review systems).
- Construction of knowledge graphs in domains like ecommerce or internet industries.
Heres what we offer:
- Start with 20 days of annual leave with 2 additional days added each year up to 30 days by your fifth year with us. Enjoy an additional 13 public holidays and time off for special events including parental leave sick leave bereavement leave and marriage leave.
- Stay Healthy: 10 sick leave days per year no doctors note required
- Support for New Parents: 22 weeks of paid maternity leave with the flexibility to work from home fulltime until your child turns 1 year old; 4 weeks of paternity leave plus the flexibility to work from home fulltime until your child is 13 weeks old.
- Our office perks include onsite massages and frequent teambuilding activities in various locations.
Benefits & Perks:
- Monthly lunch allowance.
- English courses.
- Onsite gym.
- Access online learning platforms like Udemy for Business and LinkedIn Learning and a budget for external training.
At EveryMatrix were committed to creating a supportive and inclusive workplace where you can thrive both personally and professionally. Come join us and experience the difference!