Abstract: Conversational emotion recognition (CER) is an important research topic in human-computer interactions. Although recent advancements in transformer-based cross-modal fusion methods have ...
@inproceedings{Tang2024DRMF, title={DRMF: Degradation-Robust Multi-Modal Image Fusion via Composable Diffusion Prior}, author={Tang, Linfeng and Deng, Yuxin and Yi, Xunpeng and Yan, Qinglong and Yuan, ...
Multimodal Large Language Models (MLLMs) have attracted much attention for their multifunctionality. However, traditional Transformer architectures incur significant overhead due to their secondary ...
Once data is loaded into Excel, Copilot allows users to ask questions in natural language instead of building new formulas.
Abstract: Automatic speech recognition (ASR) has been significantly improved in the past years. However, most robust ASR systems are based on air-conducted (AC) speech, and their performances in low ...