As a member of this team the successful candidate will: - Develop novel ML/DL models including LLMs for natural language and conversational understanding- Work with software and platform engineers to convert and compile the models to run on device and integrate them with runtime systems.- Optimize latency and performance of the models- Diagnose model errors and perform accuracy hill climbing for shipped products.
Solid understanding of state-of-the-art technology in machine learning deep learning (including LLMs) and natural language understanding.
Excellent problem-solving (e.g. via building forward-looking prototype systems) critical thinking strong communication and collaboration skills
3-5 years proven programming skills using standard ML tools such as C/C Python PyTorch Tensorflow Hugging Face etc.
Hands-on experience working (training fine-tuning optimizing deploying) with large models (e.g. LLMs).
Hands-on experience applying common machine learning optimization techniques like quantization and distillation to reduce the resource consumption and/or eliminate latency
Publication at top ML/DL/NLP conferences such as NeurIPS ACL EMNLP etc. is a plus.
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.