drjobs Dolby Careers Senior Multimodal AI Researcher, Audio

Dolby Careers Senior Multimodal AI Researcher, Audio

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Atlanta, GA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Join the leader in entertainment innovation and help us design the future. At Dolby science meets art and high tech means more than computer code. As a member of the Dolby team youll see and hear the results of your work everywhere from movie theaters to smartphones. We continue to revolutionize how people create deliver and enjoy entertainment worldwide. To do that we need the absolute best talent. Were big enough to give you all the resources you need and small enough so you can make a real difference and earn recognition for your work. We offer a collegial culture challenging projects and excellent compensation and benefitsnot to mention a Flex Work approach that is truly flexible to support where when and how you do your best work.

The Advanced Technology Group (ATG) is the research division of the company. ATGs mission is to look ahead deliver insights and innovate technological solutions that will fuel Dolbys continued growth. Our researchers have a broad range of expertise related to computer science and electrical engineering such as AI/ML algorithms digital signal processing audio engineering image processing computer vision data science & analytics distributed systems cloud edge & mobile computing computer networking and IoT.

Dolby is looking for a talentedSenior Multimodal AI Researcher Audioto join Dolbys research efforts and drive innovation in multimodal AI for audio applications multimodal representations and generative modeling for audio speech and music. You will join the Machine Reasoning and Perception team to join a team of top-tier researchers working on challenging problems in multimodal AI for entertainment applications. You will focus on the creation and implementation of multimodal and audio AI technologies from the underlying theoretical concepts to the development of prototypes and demonstrations with the goal to create new experiences.

You will drive key innovations for Dolbys core business which allow Dolby and its customers to build products that push the boundaries of sound and multimedia experiences.

Summary

You will push the boundaries of the state-of-the-art in audio and multimodal technologies. The ideal candidate would have a strong background in deep learning both in terms of conceptual understanding as well as practical experience with previous exposure to audio applications. A core aspect of this role involves being able to keep up to date with the literature implement and innovate with the bleeding edge in generative models self-supervised learning and multi-modal learning.

With the explosion of large language models and natural language processing you will partner closely with Dolbys worldwide AI research staff which actively pursues the integration of such models into audio and media experiences. You will be able to hit the ground running innovate and contribute to such projects. Consequently experience with language models question answering vision-language models captioning etc. would be highly beneficial.

We are looking for candidates with experience inany ofthe following:

  • Generative modeling for audio applications (diffusion models autoregressive models masked generative transformers).
  • Multimodal semantic understanding and multimodal reasoning.
  • Multimodal representations (audio-video audio-text audio-video-text).
  • Multimodal AI architectures with a focus on generating audio music and speech (text-to-audio video-to-audio image-to-audio).
  • Self and semi-supervised learning.
  • AI driven audio enhancement processing and generation (for speech and music) such as speech enhancement and analysis source separation text-to-speech text-to-music music information retrieval audio classification.
  • LLMs for audio applications.

Main responsibilities

  • Partner closely with other domain experts to refine and execute Dolbys technical strategy in artificial intelligence and machine learning.
  • Use deep learning to create new solutions(including foundation models)and enhance existing applications.
  • Push the state-of-the-art and develop intellectual property.
  • Transfer technology to product groups.
  • Establish research collaborations with external university partners.
  • Mentor interns on novel research problems.
  • Publish papers in top-tier conferences and journals.
  • Advise internal leaders on recent deep learning advancements in the industry and academia to further influence research direction and business decisions.

Requirements

  • Ph.D. in Computer Science or similar field.
  • A strong background in deep learning both in terms of conceptual understanding as well as practical experience.
  • Technical knowledge of audio fundamentals.
  • Deep passion for audio music and multimedia applications.
  • Deep knowledge on current machine learning literature.
  • Strong publication record with publications in major machine learning conferences ( ICLR ICML) or top domain-specific conferences is desirable (e.g. ACL CVPR ICASSP Interspeech).
  • Highly skilled in Python and one or more popular deep learning frameworks (TensorFlow orPyTorch).
  • Ability to envision new technologies and turn them into innovative products.
  • Good communication and collaboration skills.

The Atlanta Area base salary range for this full-time position is$130700-$163000which can vary if outside this locationplus bonus benefits and some roles may also include equity. Our salary ranges are determined by role level and location. Within the range individual pay is determined by work location and additional factors including job-related skills competencies experience market demands internal parity and relevant education or training. Your recruiter can share more about the specific salary range and perks and benefits for your location during the hiring process.

Dolby will consider qualified applicants with criminal histories in a manner consistent with the requirements of San Francisco Police Code Article 49 and Administrative Code Article 12

Equal Employment Opportunity:
Dolby is proud to be an equal opportunity employer. Our success depends on the combined skills and talents of all our employees. We are committed to making employment decisions without regard to race religious creed color age sex sexual orientation gender identity national origin religion marital status family status medical condition disability military service pregnancy childbirth and related medical conditions or any other classification protected by federal state and local laws and ordinances.


Required Experience:

Senior IC

Employment Type

Full Time

Company Industry

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.