Multi Modal AI Senior Developer

Dentsu

Posted on : 02-05-2025

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

Pune - India

Monthly Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 02-05-2025

Job Description

We are seeking a seasoned and experienced Multi Modal AI Senior Developer with a deep expertise in leveraging Generative AI for creative and content generation at scale.
The ideal candidate would have a deep understanding of Multi Modal AI the ability to leverage GenAI models for all creative output and content types and the ability to work across all content and interaction modalities including text visuals audio/speech and video. A strong foundation in Large Language models and Vision Language Models (VLM) is also highly desirable.
As a Multi Modal AI Senior Developer you will play a key role in building cuttingedge products and innovative solutions combining the full power of Creative AI workflows Generative AI LLMs and Agentic AI.
Your primary focus will be on building bespoke products and creative workflows leveraging GenAI models to help build out our creative product portfolio for some of our largest most strategic enterprise product solutions.
The candidate should have a good technical background in Product Development CloudNative App Dev FrontEnd and BackEnd Web Application Development and the ability to build these solutions in cloud environments such as Azure and AWS integrating with the appropriate multi modal AI services.
The candidate will also need to have strong expertise with Cloud AI services such as Azure OpenAI AWS Bedrock and Google Gemini and all the foundation models hosted within those services. Additionally handson experience with a variety of foundation models Vision Language Models (VLMs) Creative AI Services and APIs and the ability to seamlessly integrate all of these together in an automated workflow using APIs and AI Assistants will be an essential skillset.

Job Description:

Job Description

Key Skills required :

Generative AI Multi Modal AI

Creative AI solutions and workflows across all creative content types including Copy/Text Imagery Key Visuals Characters Avatars Audio Speech and Video AI

Creative AI Automation workflows with content creation and content editing at scale using AI services and AI APIs

Experience with multiple Multi Modal AI Foundation Models

LLM LLM App Dev

AI Agents Agentic AI Workflows

Responsibilities:

Design and build web apps and solutions that leverage Creative AI Services Multi Modal AI models and Generative AI workflows
Leverage Multi modal AI capabilities supporting all content types and modalities including text imagery audio speech and video
Build creative automation workflows that help produce creative concepts creative production deliverables and integrated creative outputs leveraging AI and GenAI models
Integrate AI Image Gen Models and AI Image Editing models from key technology partners
Integrate Text / Copy Gen Models for key LLM providers
Integrate Speech / Audio Gen and Editing models for use cases such as transcription translation and AI generated audio narration
Integrate AI enabled Video Gen and Video Editing models
FineTune Multi Modal AI models for brand specific usage and branded content generation
Constantly Research and explore emerging trends and techniques in the field of generative AI and LLMs to stay at the forefront of innovation.
Drive product development and delivery within tight timelines
Collaborate with fullstack developers engineers and quality engineers to develop and integrate solutions into existing enterprise products.
Collaborate with technology leaders and crossfunctional teams to develop and validate client requirements and rapidly translate them into working solutions.
Develop implement and optimize scalable AIenabled products
Integrate GenAI and Multi Modal AI solutions into Cloud Platforms Cloud Native Apps and custom Web Apps
Execute implementation across all layers of the application stack including frontend backend APIs data and AI services
Build enterprise products and fullstack applications on the MERN Python stack with a clear separation of concerns across layers

Skills and Competencies:

Deep Handson Experience in Multi modal AI models and tools.
Handson Experience in API integration with AI services
Multi Modal AI competencies :
- Handson Experience with intelligent document processing and document indexing document content extraction and querying using multi modal AI Models
- Handson Experience with using Multi modal AI models and solutions for Imagery and Visual Creative including texttoimage imagetoimage image composition image variations etc.
- Handson Experience with popular AI Image Composition and Editing models from providers such as Adobe Firefly Getty Images ShutterStock Flux and Flux Pro and Stable Diffusion and the ability to integrate them programmatically over API calls and workflows
- Handson Experience with Computer Vision and Image Processing using Multimodal AI for use cases such as object detection automated captioning automated masking and image segmentation again all done programmatically over API calls and Workflows
- Handson Experience with using Multi modal AI for Speech including Text to Speech Speech to Text and use of Prebuilt vs. Custom Voices
- Handson Experience with building Voiceenabled and Voiceactivated experiences using Speech AI and Voice AI solutions
- Handson Experience with AI Character and AI Avatar development using a variety of different tools and platforms
- FineTuning Creative AI Content models for Custom Styles Custom Characters and Custom Brand specific imagery
- FineTuning Speech Models for Custom Voices
- Good understanding of advanced finetuning techniques such as LoRA
- Ability to execute and run finetuning workflows endtoend in particular for Image Gen and Image Editing models
- Handson Experience with leveraging APIs to orchestrate across Multi Modal AI models
- Handson Experience with building workflows that orchestrate across Multi Modal AI models
- Good Experience with using AI Assistants to drive natural language interactions and orchestration with Multi Modal AI models
- Good Experience with use of AI Agents and Agentic AI workflows to drive dynamic orchestration across Multi Modal AI services and models
Programming Skills :
- Good Expertise in MERN stack (JavaScript) including clientside and serverside JavaScript
- Good Expertise in Python based development including Python App Dev for Multi Modal AI Integration
- Wellrounded in both programming languages
- Strong experience in clientside JavaScript Apps and building Static Web Apps Dynamic Web Apps both in JavaScript
- Handson Experience in frontend and backend development
- Minimum 2 years handson experience in working with FullStack MERN apps using both clientside and serverside JavaScript
- Minimum 2 years handson experience in Python development
- Minimum 2 years handson experience in working with LLMs and LLM models using Python
LLM Dev Skills :
- Solid Handson Experience with building endtoend RAG pipelines and custom AI indexing solutions to ground LLMs and enhance LLM output
- Good Experience with building AI and LLM enabled Workflows
- Handson Experience integrating LLMs with external tools such as Web Search
- Ability to leverage advanced concepts such as tool calling and function calling with LLM models
- Handson Experience with Conversational AI solutions and chatdriven experiences
- Experience with multiple LLMs and models primarily GPT4o GPT o1 and o3 mini and preferably also Gemini Claude Sonnet etc.
- Experience and Expertise in Cloud GenAI platforms services and APIs primarily Azure OpenAI and perferably also AWS Bedrock and/or GCP Vertex AI.
- Handson Experience with Assistants and the use of Assistants in orchestrating with LLMs
- Handson Experience working with AI Agents and Agent Services.

NicetoHave capabilities (Not essential) :

Handson Experience with building Agentic AI workflows that enable iterative improvement of output
Handson experience with both SingleAgent and MultiAgent Orchestration solutions and frameworks
Handson experience with different Agent communication and chaining patterns
Ability to leverage LLMs for Reasoning and Planning workflows that enable higher order goals and automated orchestration across multiple apps and tools
Ability to leverage Graph Databases and Knowledge Graphs as an alternate method / replacement of Vector Databases for enabling more relevant semantic querying and outputs via LLM models.
Good Background with Machine Learning solutions
Good foundational understanding of Transformer Models
Good foundational understanding of Diffusion Models
Some Experience with custom ML model development and deployment is desirable.
Proficiency in deep learning frameworks such as PyTorch or Keras.
Experience with Cloud ML Platforms such as Azure ML Service AWS Sage maker and NVidia AI Foundry.

Location:

DGS India Pune Kharadi EON Free Zone

Brand:

Dentsu Creative

Time Type:

Full time

Contract Type:

Consultant

Required Experience:

Senior IC

Employment Type

Full-Time

Company Industry

Key Skills

Apply Now

About Company

Dentsu

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Multi Modal AI Senior Developer

Dentsu

Job Description

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Senior Manager

Senior Manager

AI Developer

RQ08462 - Software Developer - Senior

Senior Guidewire Developer Billing Center

AI Intern - Protecxo

RQ09571 - Software Developer - Full Stack - Senior

RQ09558 - Software Developer - Back End - Senior