You will work on advancing the capabilities of foundation models and steering them toward real-world applications in Apple products. This includes researching and developing methods to improve alignment reasoning and adaptation of large models to practical use cases while ensuring they meet Apples standards for efficiency scalability and privacy. A key part of your role will be to stay current with emerging research and identify techniques suitable for real-world deployment bridging the gap between state-of-the-art advancements and production-ready will design and optimize large-scale data pipelines to enable robust training and fine-grained evaluation of foundation models working with massive multimodal datasets to push the limits of performance. You will explore new ways to enhance reasoning and multimodal understanding adapting models for Apples unique ecosystem from cloud-scale infrastructure to on-device will be central to your workyou will partner with cross-functional teams of engineers and researchers bring these models to life ensuring seamless integration into Apple products and creating intelligent natural user experiences.
BS and a minimum of 3 years relevant industry experience.
Proficient programming skills in Python and experience with at least one modern deep learning framework (PyTorch JAX or TensorFlow).
Experience working with large-scale training pipelines and distributed systems.
PhD or equivalent practical experience in Computer Science Machine Learning or a related technical field.
Demonstrated expertise in deep learning with either: A publication record in relevant conferences (e.g. NeurIPS ICML ICLR CVPR ICCV ECCVCOLM etc) or a strong track record of applying deep learning techniques to real-world products.
Experience with foundation models (language vision-language or multimodal).
Familiarity with large-scale data pipelines including data curation preprocessing and efficient storage.
Experience fine-tuning or optimizing large models for production deployment.
Knowledge of retrieval-augmented generation (RAG) personalization or grounding techniques.
Familiarity with privacy-preserving or on-device machine learning.
Ability to work effectively in a cross-functional collaborative environment.
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.