AI Engineer Python focus Future Proof Labs

Rawalpindi - Pakistan

Monthly Salary: PKR 750000 - 1200000

Posted on: 30+ days ago

Vacancies: 1 Vacancy

Job Summary

About Futureproof Labs

Futureproof Labs is an AI-native product studio building human-first AI products designed for real-world speed and impact. Youll be joining a team focused on transforming raw AI capability into polished commercially deployable products.

Role Summary

We are seeking a MidSenior AI Engineer focused on voice / speech to design build and maintain voice-based components (speech-to-text text-to-speech voice interface) for Futureproof Labs AI products. This is a focused non-generalist role: you will specialize in voice-interaction and speech-AI not full-stack across everything.

We are looking to add the absolute best engineering talent to bolster our team as we build AI products for the US and Canadian markets. Youll get to work with experienced and established founders in North America as you build cutting-edge solutions that can have a global impact.

Key Responsibilities

Build and optimize speech-to-text (STT) and text-to-speech (TTS) capabilities for real-time / near-real-time use.
Integrate voice I/O with existing AI backend (NLP / LLM-driven logic).
Design and implement voice-based user interactions / flows (voice queries commands prompts clarifications).
Handle typical real-world speech challenges noise accents/dialects latency error recovery fallback to text if needed.
Ensure proper handling of voice data (privacy security consent) especially if audio or transcripts are stored or processed.
Write tests monitoring logging for voice pipelines track performance accuracy latency error rates.
Iterate on voice modules based on usage data and feedback; continuously improve performance and usability.

Required Experience & Skills

At least 35 years of experience in speech / voice engineering or related ML/AI roles.
Strong proficiency in backend programming (e.g. Python) for integrating speech modules with AI logic.
Understanding of real-time processing audio pipelines latency optimization making voice interactions smooth and responsive.
Ability to integrate voice front-end with AI/LLM-based backend systems.
Good collaboration and communication skills to work with product design and backend/ML teams.

Nice to Have

Experience with conversational AI / dialog flows (handling multi-turn conversation context retention).
Exposure to multilingual / accented speech or handling dialect/ accent variability.
Audio pre-processing skills (noise reduction normalization) or familiarity with speech-domain challenges.
Previous experience building voice-based tools voice assistants voice-driven workflows or voice-enabled enterprise tools.

About Futureproof Labs Futureproof Labs is an AI-native product studio building human-first AI products designed for real-world speed and impact. Youll be joining a team focused on transforming raw AI capability into polished commercially deployable products. Role Summary We are seeking a MidSenior ...

About Futureproof Labs

Role Summary

Key Responsibilities

Build and optimize speech-to-text (STT) and text-to-speech (TTS) capabilities for real-time / near-real-time use.
Integrate voice I/O with existing AI backend (NLP / LLM-driven logic).
Design and implement voice-based user interactions / flows (voice queries commands prompts clarifications).
Handle typical real-world speech challenges noise accents/dialects latency error recovery fallback to text if needed.
Ensure proper handling of voice data (privacy security consent) especially if audio or transcripts are stored or processed.
Write tests monitoring logging for voice pipelines track performance accuracy latency error rates.
Iterate on voice modules based on usage data and feedback; continuously improve performance and usability.

Required Experience & Skills

At least 35 years of experience in speech / voice engineering or related ML/AI roles.
Strong proficiency in backend programming (e.g. Python) for integrating speech modules with AI logic.
Understanding of real-time processing audio pipelines latency optimization making voice interactions smooth and responsive.
Ability to integrate voice front-end with AI/LLM-based backend systems.
Good collaboration and communication skills to work with product design and backend/ML teams.

Nice to Have

Experience with conversational AI / dialog flows (handling multi-turn conversation context retention).
Exposure to multilingual / accented speech or handling dialect/ accent variability.
Audio pre-processing skills (noise reduction normalization) or familiarity with speech-domain challenges.
Previous experience building voice-based tools voice assistants voice-driven workflows or voice-enabled enterprise tools.