已选(0)清除
条数/页: 排序方式: |
| GraphMLLM: A Graph-based Multi-level Layout Language-independent Model for Document Understanding 会议论文 , 希腊雅典, 2024-09 作者: He-Sen Dai; Xiao-Hui Li; Fei Yin; Xudong Yan; Shuqi Mei; Cheng-Lin Liu Adobe PDF(967Kb)  |  收藏  |  浏览/下载:18/5  |  提交时间:2024/06/05 Visual information extraction Self-supervised pre-training Multi-level page layouts |
| MULTIMODAL CROSS- AND SELF-ATTENTION NETWORK FOR SPEECH EMOTION RECOGNITION 会议论文 , Toronto, Canada, 6-12 June 2021 作者: Licai Sun; Bin Liu; Jianhua Tao; Zheng Lian Adobe PDF(1078Kb)  |  收藏  |  浏览/下载:10/3  |  提交时间:2024/06/03 |
| Coarse-to-Fine Recurrently Aligned Transformer with Balance Tokens for Video Moment Retrieval and Highlight Detection 会议论文 , 日本横滨, 2024-6 作者: Pan Yi; Zhang Yujia; Chang Hui; Shiying Sun; Zhou Feihu; Zhao Xiaoguang Adobe PDF(1027Kb)  |  收藏  |  浏览/下载:25/9  |  提交时间:2024/05/31 |
| MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition 会议论文 , Ottawa, ON, Canada, October 29-November 3, 2023 作者: Licai Sun; Zheng Lian; Bin Liu; Jianhua Tao Adobe PDF(1960Kb)  |  收藏  |  浏览/下载:18/4  |  提交时间:2024/05/31 |
| Neighbor-view Enhanced Model for Vision and Language Navigation 会议论文 Proceedings of the ACM International Conference on Multimedia, Chengdu, China, 2021-10-20 作者: Dong An; Yuankai Qi; Yan Huang; Qi Wu; Liang Wang; Tieniu Tan Adobe PDF(2412Kb)  |  收藏  |  浏览/下载:9/3  |  提交时间:2024/05/28 |
| Second-Order Global Attention Networks for Graph Classification and Regression 会议论文 , Beijing, China, August 27-28, 2022 作者: Hu Fenyu; Cui Zeyu; Wu Shu; Liu Qiang; Wu Jinlin; Wang Liang; Tan Tieniu Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:208/70  |  提交时间:2023/07/06 |
| PAN: Prototype-based Adaptive Network for Robust Cross-Modal Retrieval 会议论文 , Virtual Event, 2021.7.11 作者: Zhixiong Zeng; Shuai Wang; Nan Xu; Wenji Mao Adobe PDF(1417Kb)  |  收藏  |  浏览/下载:142/33  |  提交时间:2023/06/14 |
| Event-Driven Network for Cross-Modal Retrieval 会议论文 , Virtual Event, 2020.10.19 作者: Zhixiong Zeng; Nan Xu; Wenji Mao Adobe PDF(2431Kb)  |  收藏  |  浏览/下载:134/57  |  提交时间:2023/06/13 |
| VigilanceNet: Decouple Intra- and Inter-Modality Learning for Multimodal Vigilance Estimation in RSVP-Based BCI 会议论文 , Lisboa, Portugal, 2022-10-10 作者: Cheng XY(程昕钰); Wei W(魏玮); Du CD(杜长德); Qiu S(邱爽); Tian SL(田三力); Ma XJ(马小军); He HG(何晖光) Adobe PDF(2488Kb)  |  收藏  |  浏览/下载:208/53  |  提交时间:2023/05/31 Vigilance Estimation Multimodal EEG EOG Rapid Serial Visual Presentation |
| A Prototype-Based Generalized Zero-Shot Learning Framework for Hand Gesture Recognition 会议论文 , Online, 2021-1 作者: Jinting Wu; Yujia Zhang; Xiaoguang Zhao Adobe PDF(2914Kb)  |  收藏  |  浏览/下载:190/48  |  提交时间:2022/09/02 |