AI Developer with Python for Customer Care AI Platform team (hybrid)
Job Summary
Bei IONOS arbeitest Du bei dem führenden europäischen Anbieter von Cloud-Infrastruktur Cloud-Services und Hosting-Dienstleistungen partnerschaftlich mit unterschiedlichen Teams zusammen. Wir bieten Dir eine Perspektive in einer der zukunftssichersten Branchen. Uns zeichnen offene Arbeitsstrukturen Duz-Kultur und flache Hierarchien mit unvergleichlichem Team-Spirit aus. Wir sind fest davon überzeugt dass Job und Spaß vereinbar sind und bieten Dir hierfür das entsprechende Umfeld. Bei ständigem Wachstum sind wir stets auf der Suche nach neuen Kolleginnen und Kollegen. Werde Teil von IONOS und lass uns gemeinsam wachsen.
About the team:
Our mission is to build a modern ecosystem used for all IONOS customer support needs. The tools developed by us are used in over 20 locations by more than 2.000 users supporting 8 million customer contracts in 10 markets.
The development team has full responsibility for the development lifecycle. This means we plan develop test and deploy our software without any other internal or external dependencies.
Our portfolio revolves around an internally built CRM which is now being enhanced with AI capabilities.
About the product you will be building:
We are building a next-generation AI platform designed to redefine how our company interacts with customers. This isnt just a chatbot; its a high-performance multimodal AI ecosystem powered by state-of-the-art Speech-to-Speech (S2S) models advanced Large Language Models (LLMs) and intelligent orchestration frameworks. Our platform will understand reason and respond across text and voice while seamlessly executing real-time actions to resolve customer needs.
We are aiming for a hybrid architecture of Open Source LLMs industry-leading proprietary models and Model Context Protocol (MCP) to enable contextual reasoning tool invocation and seamless orchestration across systems. The goal is not just to talk to the customer but to act on their needs.
What makes this project unique:
The Voice Frontier: We are building low-latency emotive speech-to-speech pipelines for a truly natural voice channel experience.
Deep System Integration: Our platform connects directly to the companys core systems via MCPs allowing the AI to access real-time customer context and execute complex workflows.
Self-Evolving Logic: We are developing an automated QA and evaluation module that continuously analyzes interactions across programmatically measuring quality accuracy latency and resolution outcomes we can close the feedback loop and adapt system behavior in hours not weeks.
Hybrid Innovation: Youll work at the intersection of build vs. buy integrating the best of the open-source community with custom-built internal infrastructure.
Whats in it for you:
You wont just be shipping code; youll be part of making this concept evolve and shift.
Youll join a friendly experienced team where your voice matters and your contribution shapes real-world outcomes. Youll work in a modern environment with technologies and practices that help us ship reliable software efficiently.
Role description:
As an AI Engineer on this team you will build the core intelligence systems behind our multimodal AI will be responsible for moving beyond simple chat interfaces to build high-performance real-time systems that handle complex reasoning deep context retrieval LLM orchestration retrieval-augmented generation (RAG) and seamless voice interactions.
Main responsibilities:
- Design Agentic Workflows: Design and implement LLM-based systems that go behind response generation - enabling structured tool usage workflow orchestration and secure interaction with internal services via MCP (Model Context Protocol).
- Build and Optimize RAG & CAG: Develop high-performance Retrieval-Augmented Generation and Context-Augmented Generation pipelines to ensure accurate relevant and low-latency responses. Continuously improve context management ranking strategies and grounding mechanisms to support complex multi-step interactions.
- Voice Channel Mastery: Develop and optimize real-time Speech-to-Speech (S2S) pipelines focusing on streaming architectures latency reduction (including Time to First Word - TTFW) and maintaining a natural conversational flow.
- Evaluation Quality & Alignment: Build and maintain an automated QA module including LLM-as-a-judge patterns to measure accuracy safety latency and resolution quality at scale. Translate evaluation insights into systematic models and prompt improvements..
- Model Strategy & Hybrid Integration: Integrate and operate both commercial foundation models (e.g. OpenAI Anthropic Google) and open-source alternatives (e.g. Qwen Kimi DeepSeek Moonshot GLM) selecting and optimizing models based on performance latency cost and use-case requirements.
We are looking for some of:
- Strong Python and/or Java Engineering Skills: Advanced-level Python development experience including asynchronous programming (e.g. FastAPI asyncio) and building high-performance production-grade services. Experience with streaming architectures is a strong advantage.
- LLM Application & Multi-Agent Orchestration Experience: Hands-on experience building LLM-powered systems including multi-step workflows stateful agents and tool invocation. Familiarity with orchestration frameworks such as LangChain LlamaIndex or LangGraph particularly in building stateful multi-turn agents.
- Advanced Retrieval & Context Management: Deep understanding of vector databases (e.g. Weaviate Qdrant pgvector Elasticsearch) semantic search embedding strategies and re-ranking techniques. Experience designing and optimizing RAG pipelines.
- Real-Time & Low-Latency Systems: Experience in designing systems that operate under latency constraints including streaming APIs event-driven architectures and performance optimization. Understanding of trade-offs between quality cost and response time.
- Evaluation-Driven Development: Experience in implementing evaluation frameworks for LLM-based systems including automated QA pipelines and LLM-as-a-judge patterns.
- Familiar with API Design: knowledge of RESTful API design OAuth2
What we offer:
- Access to local/international trainings development and growth opportunities including access to e-learning platforms covering both technical and soft skills areas;
- Modern technologies product responsibility;
- Flexible work schedule;
- Hybrid work option;
- Medical services package from one of two private providers;
- 25 vacation days per year;
- Substitute days off for public holidays that occur on the weekend;
- Meal tickets;
- Internal referral program;
- Team events networking events organized to promote a passionate creative and diverse culture;
- Summerfest and Winterfest parties;
- Of course coffee soft drinks and fresh fruits are on us in the office.
Über IONOS
IONOS ist der führende europäische Digitalisierungs-Partner für kleine und mittlere Unternehmen (KMU). IONOS hat mehr als sechs Millionen Kundinnen und Kunden und ist mit einer weltweit verfügbaren Plattform in 18 Märkten in Europa und Nordamerika aktiv. Mit seinen Web Presence & Productivity-Angeboten agiert das Unternehmen als One-Stop-Shop für alle Digitalisierungs-Bedürfnisse - von Domains und Webhosting über klassische Website-Builder und Do-It-Yourself-Lösungen von E-Commerce bis zu Online-Marketing-Tools. Darüber hinaus bietet IONOS Cloud-Lösungen für Firmen die im Zuge der Weiterentwicklung ihres Geschäfts in die Cloud wechseln möchten.
Wir wertschätzen Vielfalt und begrüßen alle Bewerbungen unabhängig von z. B. Geschlecht Nationalität ethnischer und sozialer Herkunft Religion Behinderung Alter sowie sexueller Orientierung und Identität körperlichen Merkmalen Familienstand oder einem anderen sachfremden Kriterium nach geltendem Recht.
About Company
IONOS ist mit mehr als acht Millionen Kundenverträgen der führende europäische Anbieter von Cloud-Infrastruktur, Cloud-Services und Hosting-Dienstleistungen