Interview with Lin Dahua, co-founder of SenseTime: Multi-modality is the necessary path to AGI and an essential component.

date
28/07/2025
During the 2025 World Artificial Intelligence Conference, SenseTime Technology released the "XenAI" embodied intelligence platform. It is reported that the platform is based on SenseTime's embodied world model as the core engine, relying on SenseTime's large devices to provide end-side and cloud-side computing power support, and can provide perception, visual navigation, and multimodal interaction capabilities for robots and intelligent devices, promoting intelligent terminals to evolve to a higher level of autonomy and intelligence. In communication with media such as Sina Technology, SenseTime Technology co-founder and chief scientist Lin Dahua stated, "Multimodality is the only way to AGI, it is an indispensable part. SenseTime has been doing computer vision for many years, with good multimodal models, AI technology, and also many collaborations with hardware companies, including our work in intelligent driving, which has also accumulated a lot of model application and control technology systems. This is also the reason why we have put forward the embodied intelligence platform, to enable these capabilities to support ecological and intelligent development in a platform-based manner."