Multimodal large language models have shown powerful abilities to understand and reason across text and images, but their ...
🤔 Can autoregressive visual generation supervision improve VLMs' understanding capability? 🚀 Reconstructing the visual semantics of images leads to better visual comprehension. Abstract. Typical ...
ABSTRACT: As morphemes are the smallest phonetic and semantic word formation units in Chinese, the study of morphemes has always been an important part of Chinese language acquisition research. Taking ...
This article describes a combined visual and haptic localization experiment that addresses the area of multimodal cueing. The aim of the present investigation was to characterize two-dimensional (2D) ...
Abstract: The task of Audio-Visual Event Localization (AVEL) aims to detect and locate the temporal segments of events through the interaction of audio and visual modalities. However, existing methods ...
The manuscript presents a short report investigating mismatch responses in the auditory cortex, following previous studies focused on visual cortex. By correlating mouse locomotion speed with acoustic ...
The congruency sequence effect (CSE) refers to the reduction in the congruency effect in the current trial after an incongruent trial compared with a congruent trial. Although previous studies widely ...
Abstract: How to effectively interact audio with vision has garnered considerable interest within the multi-modality research field. Recently, a novel audio-visual video segmentation (AVS) task has ...
International Publicity contains a variety of modal symbols including text, pictures and sound, and their meanings are expressive. It is conducive for the Communist Party of China to use international ...
Omni-modal large language models (LLMs) are at the forefront of artificial intelligence research, seeking to unify multiple data modalities such as vision, language, and speech. The primary goal is to ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results