Systems Software Engineer (Rust, ML Inference)

Ai-coustics

Job Location:

Berlin - Germany

Yearly Salary: EUR 75000 - 95000

Posted on: 2 days ago

Vacancies: 1 Vacancy

Job Summary

ai-coustics is building the reliability layer for Voice AI the system that closes the gap between raw audio input and reliable machine understanding in production. By combining state-of-the-art speech and audio research with real-time production-grade SDKs we test observe and enable Voice AI systems to work in any environment. Our software is used by fast-growing Voice AI companies across Europe and the United States whose products require reliable performance at scale: call center agents voice agents telephony apps and enterprise voice assistants. We believe voice will become the main interface for technology and ai-coustics is building the foundational infrastructure to make audio input reliable measurable and easy to deploy.

We are backed by leading early-stage investors including Connect Ventures Partech Inovia Capital as well as angel investors from HuggingFace DeepMind and Amazon with deep expertise in AI and developer infrastructure. These partners share our vision and are helping us build a world-class team operating with high levels of responsibility and velocity. We look for people who take ownership think systemically and want to solve challenging real-world problems in close collaboration with our customers. If youre motivated by developing technology that is used in practice shaping an emerging category and setting a new standard for how voice AI works in the real world youll feel at home at ai-coustics.

Role Overview

ai-coustics is seeking a Systems Software Engineer to join our Systems team working at the core of our real-time Audio AI SDK and inference this role you will help maintain optimize and expand the SDK that powers ai-coustics speech enhancement and Voice AI products across a wide range of platforms runtimes and languages.

You will work primarily on our Rust-based inference and systems codebase which underpins the Airten real-time inference engine DSP modules telemetry model execution pipeline and public SDKs used by developers worldwide. Your work will directly impact model performance runtime efficiency reliability developer experience and our ability to deploy neural audio models in latency-critical production environments.

This role sits at the intersection of systems programming ML inference real-time audio and developer infrastructure. You do not need to be an ML researcher but you should be excited about making neural networks run fast safely and predictably in real-world applications.

Ideal starting date: August/September

Tasks

ML Inference Engine & Runtime Development

Design implement and optimize systems-level components of the ai-coustics SDK and inference runtime
Improve the performance memory usage and stability of the Airten real-time inference engine
Work on model execution tensor operations scheduling streaming inference and runtime abstractions
Support deployment of neural audio models across CPU WASM and other constrained runtime environments
Explore and integrate ideas from modern inference engines and ML runtimes such as Burn ONNX Runtime tract TensorRT or similar systems
Help bridge the gap between research models and production-ready low-latency inference

Audio DSP & Real-Time ML Systems

Develop and maintain DSP modules and supporting audio-processing infrastructure
Optimize streaming workloads under strict latency jitter and memory constraints
Build tooling to validate numerical correctness real-time behavior and model quality across platforms
Collaborate with ML researchers to make models easier to export test benchmark and deploy
Contribute to model conversion and deployment workflows including formats such as ONNX internal model formats or Rust-native representations

Language Bindings & Platform Support

Maintain and expand our C API and public C library generated from our internal Rust codebase
Improve and support SDK wrappers and bindings for C Python and Rust via the public C API
Maintain WASM and SDKs built directly from the internal Rust source
Ensure consistent behavior performance and API guarantees across Linux macOS Windows WASM and embedded-adjacent environments

Testing Reliability & Tooling

Design implement and extend our testing pipeline including unit tests integration tests numerical tests and performance benchmarks
Build tooling to validate real-time constraints memory usage model outputs and cross-language consistency
Improve CI workflows to ensure safe and fast iteration on a closed-source core with public-facing SDKs
Create benchmarks and profiling workflows that help us understand runtime bottlenecks and performance regressions
Improve observability and diagnostics for SDK integrations in customer environments

Documentation & Developer Experience

Write and maintain technical documentation for SDK APIs runtime internals model deployment and integration guides
Collaborate with product and developer-facing teams to improve onboarding and usability
Support internal teams and external developers by diagnosing SDK and inference issues and proposing robust fixes
Contribute to API design with a focus on ergonomics safety portability and long-term maintainability

Requirements

Technical Skills

Strong experience in systems programming ideally with Rust
Solid understanding of C/C interoperability ABIs and FFI design
Experience building or maintaining SDKs libraries inference runtimes or developer-facing systems
Familiarity with real-time systems performance optimization memory management and profiling
Experience writing tests and benchmarks for low-level or performance-critical code
Comfortable working across multiple platforms such as Linux macOS Windows and WASM
Ability to reason about API design unsafe boundaries ownership error handling and long-term maintainability

ML Inference & Audio Systems

Familiarity with ML inference runtimes or deploying neural networks in production
Experience with model formats or inference engines such as ONNX Burn tract TensorRT TFLite Core ML or similar systems
Understanding of how neural networks are represented executed optimized and benchmarked
Exposure to real-time audio constraints such as latency jitter buffering streaming workloads and deterministic processing
Interest in making ML models portable efficient and reliable outside of Python research environments

Mindset & Collaboration

Strong ownership mentality and attention to detail
Comfortable working in a closed-source core with open SDK surfaces
Ability to reason about trade-offs between performance safety portability and developer experience
Clear written communication skills for documentation and technical design discussions
Enjoys working in a fast-moving startup environment with real-world production impact
Excited about building infrastructure that helps Voice AI systems work reliably in messy real-world audio conditions

Benefits

Opportunity to work at a rapidly growing Voice AI startup backed by top investors.
Compensation and equity: Competitive salary package additional benefits and stock options enabling you to take part in the companys success.
Startup Culture: Dynamic fast-paced environment with passionate and collaborative colleagues.
High Impact: Groundbreaking startup at a pivotal growth stage making a real difference in how people experience audio.
Ownership & Autonomy: Take full ownership of projects and ship fast.
Work With the Best: World-class team of engineers and builders with ample room for professional growth.
Contribute to the Future: Define the landscape of Voice AI technology.

If you are ready to lead the charge in revolutionizing Voice AI and drive our startup to new heights we would love to hear from you. Apply today to join the ai-coustics team!

ai-coustics is building the reliability layer for Voice AI the system that closes the gap between raw audio input and reliable machine understanding in production. By combining state-of-the-art speech and audio research with real-time production-grade SDKs we test observe and enable Voice AI systems...

Role Overview

Ideal starting date: August/September

Tasks

ML Inference Engine & Runtime Development

Design implement and optimize systems-level components of the ai-coustics SDK and inference runtime
Improve the performance memory usage and stability of the Airten real-time inference engine
Work on model execution tensor operations scheduling streaming inference and runtime abstractions
Support deployment of neural audio models across CPU WASM and other constrained runtime environments
Explore and integrate ideas from modern inference engines and ML runtimes such as Burn ONNX Runtime tract TensorRT or similar systems
Help bridge the gap between research models and production-ready low-latency inference

Audio DSP & Real-Time ML Systems

Develop and maintain DSP modules and supporting audio-processing infrastructure
Optimize streaming workloads under strict latency jitter and memory constraints
Build tooling to validate numerical correctness real-time behavior and model quality across platforms
Collaborate with ML researchers to make models easier to export test benchmark and deploy
Contribute to model conversion and deployment workflows including formats such as ONNX internal model formats or Rust-native representations

Language Bindings & Platform Support

Maintain and expand our C API and public C library generated from our internal Rust codebase
Improve and support SDK wrappers and bindings for C Python and Rust via the public C API
Maintain WASM and SDKs built directly from the internal Rust source
Ensure consistent behavior performance and API guarantees across Linux macOS Windows WASM and embedded-adjacent environments

Testing Reliability & Tooling

Design implement and extend our testing pipeline including unit tests integration tests numerical tests and performance benchmarks
Build tooling to validate real-time constraints memory usage model outputs and cross-language consistency
Improve CI workflows to ensure safe and fast iteration on a closed-source core with public-facing SDKs
Create benchmarks and profiling workflows that help us understand runtime bottlenecks and performance regressions
Improve observability and diagnostics for SDK integrations in customer environments

Documentation & Developer Experience

Write and maintain technical documentation for SDK APIs runtime internals model deployment and integration guides
Collaborate with product and developer-facing teams to improve onboarding and usability
Support internal teams and external developers by diagnosing SDK and inference issues and proposing robust fixes
Contribute to API design with a focus on ergonomics safety portability and long-term maintainability

Requirements

Technical Skills

Strong experience in systems programming ideally with Rust
Solid understanding of C/C interoperability ABIs and FFI design
Experience building or maintaining SDKs libraries inference runtimes or developer-facing systems
Familiarity with real-time systems performance optimization memory management and profiling
Experience writing tests and benchmarks for low-level or performance-critical code
Comfortable working across multiple platforms such as Linux macOS Windows and WASM
Ability to reason about API design unsafe boundaries ownership error handling and long-term maintainability

ML Inference & Audio Systems

Familiarity with ML inference runtimes or deploying neural networks in production
Experience with model formats or inference engines such as ONNX Burn tract TensorRT TFLite Core ML or similar systems
Understanding of how neural networks are represented executed optimized and benchmarked
Exposure to real-time audio constraints such as latency jitter buffering streaming workloads and deterministic processing
Interest in making ML models portable efficient and reliable outside of Python research environments

Mindset & Collaboration

Strong ownership mentality and attention to detail
Comfortable working in a closed-source core with open SDK surfaces
Ability to reason about trade-offs between performance safety portability and developer experience
Clear written communication skills for documentation and technical design discussions
Enjoys working in a fast-moving startup environment with real-world production impact
Excited about building infrastructure that helps Voice AI systems work reliably in messy real-world audio conditions

Benefits

Opportunity to work at a rapidly growing Voice AI startup backed by top investors.
Compensation and equity: Competitive salary package additional benefits and stock options enabling you to take part in the companys success.
Startup Culture: Dynamic fast-paced environment with passionate and collaborative colleagues.
High Impact: Groundbreaking startup at a pivotal growth stage making a real difference in how people experience audio.
Ownership & Autonomy: Take full ownership of projects and ship fast.
Work With the Best: World-class team of engineers and builders with ample room for professional growth.
Contribute to the Future: Define the landscape of Voice AI technology.

If you are ready to lead the charge in revolutionizing Voice AI and drive our startup to new heights we would love to hear from you. Apply today to join the ai-coustics team!

Apply Now

About Company

Ai-coustics

ai-coustics is a Berlin-based startup pioneering Generative Audio AI technology. Were on a mission to redefine the way people experience speech quality and intelligibility in real-time communication and media content by providing solutions to our millions of people across various vert ... View more

View Profile View Profile

AI AutoApply

Apply to 100+ jobs with one click

AI Resume Builder

Create an ATS-ready CV in minutes

AI Cover Letter

Write a personalized letter instantly

Systems Software Engineer (Rust, ML Inference)

Berlin - Germany

Job Summary

Tasks

Requirements

Benefits

Tasks

Requirements

Benefits

About Company

Related Jobs