The computer vision algorithm engineer will work in a dynamic team as part of the Video Engineering org which develops on-device computer vision and machine perception technologies across Apples products. We balance research and product to deliver the highest quality state-of-the-art experiences innovating through the full stack and partnering with cross-functional teams to influence what brings our vision to life and into customers hands.
- M.S. or PhD in Electrical Engineering/Computer Science or a related field (mathematics physics or computer engineering) with a focus on computer vision and/or machine learning
- Rich experiences in video machine learning covering one of the topics: Agentic AI / Multi-Modal LLM / Video Foundation Model / Video Generative Editing
- Proven prototyping skills and proficient in coding (C C Python)
- Excellent written and verbal communications skills be comfortable presenting research to large audiences and have the ability to work hands-on in multi-functional teams
- Publication record in relevant venues (e.g. NeurIPS ICML ICLR CVPR ICCV ECCV SIGGRAPH)
- Industry experiences with multi-modal foundation model and frameworks
- Knowledge and understanding of generative AI multi-modal large language model video caption
- Solid understanding of state-of-the-arts in Video Understanding and familiar with the challenges of developing algorithms that run efficiently on resource constrained platforms
- Team oriented result oriented and self motivated
Required Experience:
Unclear Seniority
The computer vision algorithm engineer will work in a dynamic team as part of the Video Engineering org which develops on-device computer vision and machine perception technologies across Apples products. We balance research and product to deliver the highest quality state-of-the-art experiences inn...
The computer vision algorithm engineer will work in a dynamic team as part of the Video Engineering org which develops on-device computer vision and machine perception technologies across Apples products. We balance research and product to deliver the highest quality state-of-the-art experiences innovating through the full stack and partnering with cross-functional teams to influence what brings our vision to life and into customers hands.
- M.S. or PhD in Electrical Engineering/Computer Science or a related field (mathematics physics or computer engineering) with a focus on computer vision and/or machine learning
- Rich experiences in video machine learning covering one of the topics: Agentic AI / Multi-Modal LLM / Video Foundation Model / Video Generative Editing
- Proven prototyping skills and proficient in coding (C C Python)
- Excellent written and verbal communications skills be comfortable presenting research to large audiences and have the ability to work hands-on in multi-functional teams
- Publication record in relevant venues (e.g. NeurIPS ICML ICLR CVPR ICCV ECCV SIGGRAPH)
- Industry experiences with multi-modal foundation model and frameworks
- Knowledge and understanding of generative AI multi-modal large language model video caption
- Solid understanding of state-of-the-arts in Video Understanding and familiar with the challenges of developing algorithms that run efficiently on resource constrained platforms
- Team oriented result oriented and self motivated
Required Experience:
Unclear Seniority
View more
View less