MVA Multi-Modality Interaction Developer

Beijing, Beijing, China • Posted June 03, 2026

Job Type: Full-time

Location: Beijing, Beijing

Posted: June 03, 2026

Category: Computer Occupations

Application Deadline: July 13, 2026

Role Description

                    Tätigkeitsbereich:Forschung & Entwicklung incl. DesignFachabteilung:RD ChinaGesellschaft:Mercedes-Benz Group China Ltd.Standort:Mercedes-Benz Group China Ltd., BeijingStartdatum:sofortVeröffentlichungsdatum:..6Stellennummer:MERJ6Arbeitszeit:Vollzeit BewerbenAufgabenKey ResponsibilitiesDevelop based on the current mainstream speech systems, including SSPE, wakeup, vad, asr, nlu, dm, tts, LLM, and etc.
Design and implement multimodal fusion combining speech, DMS camera, OMS camera, Dash camera, microphone, sensors, audio system state, voice print, and vehicle state data.
Normalize and structure multimodal inputs into system context representations suitable for LLM reasoning to support future LLM-based assistant use cases, such as; context-aware dialogue, assistant memory collection and apply, and etc.
Design and maintain consistent multimodal data pipelines, handling time alignment, normalization, and state coherence as data flows from vehicle systems into LLM...
                

Interested in this role?

Click the button below to start your application for MVA Multi-Modality Interaction Developer at Mercedes-Benz.

Apply Now