This position requires a highly motivated person who wants to help us advance the field of generative AI and multimodal foundation models. You will be responsible for designing implementing and evaluating foundation models based on the latest advancements in the fields taking into account future hardware design and product addition you will have an opportunity to engage and collaborate with several teams across Apple to deliver the best products.
Strong experience in deep learning with demonstrated work in at least one area of multimodal systems (e.g. vision language video etc.).
Proficiency in Python and in a modern deep learning framework such as PyTorch or JAX.
Ability to work in a collaborative environment.
Ability to communicate the results of analyses in a clear and effective manner.
BS and a minimum of 3 years relevant industry experience.
PhD or equivalent practical experience in Computer Science Computer Vision Machine Learning or related technical field.
Track record of impactful research published at top ML conferences (CVPR ICCV/ECCV NeurIPS ICML ICLR etc.).
Deep expertise in multimodal foundation models.
Strong research experience in at least one major area of model development (data curation pre-training fine-tuning alignment or evaluation) particularly as it applies to multimodal systems.
Experience with large-scale training pipelines including working with large datasets and scaling models across distributed systems.
Ability to work independently and drive research projects from conception to completion.
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.