MVA Multi-Modality Interaction Developer
Beijing, Beijing, China • Posted June 03, 2026
Job Type:
Full-time
Location:
Beijing, Beijing
Posted:
June 03, 2026
Category:
Computer Occupations
Application Deadline:
July 13, 2026
Role Description
Tätigkeitsbereich:Forschung & Entwicklung incl. DesignFachabteilung:RD ChinaGesellschaft:Mercedes-Benz Group China Ltd.Standort:Mercedes-Benz Group China Ltd., BeijingStartdatum:sofortVeröffentlichungsdatum:..6Stellennummer:MERJ6Arbeitszeit:Vollzeit BewerbenAufgabenKey ResponsibilitiesDevelop based on the current mainstream speech systems, including SSPE, wakeup, vad, asr, nlu, dm, tts, LLM, and etc. Design and implement multimodal fusion combining speech, DMS camera, OMS camera, Dash camera, microphone, sensors, audio system state, voice print, and vehicle state data. Normalize and structure multimodal inputs into system context representations suitable for LLM reasoning to support future LLM-based assistant use cases, such as; context-aware dialogue, assistant memory collection and apply, and etc. Design and maintain consistent multimodal data pipelines, handling time alignment, normalization, and state coherence as data flows from vehicle systems into LLM...
Interested in this role?
Click the button below to start your application for MVA Multi-Modality Interaction Developer at Mercedes-Benz.
Apply Now