CASIA OpenIR

浏览/检索结果: 共262条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
基于多对多生成对抗网络的非对称跨域迁移行人再识别 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 1, 页码: 103-120
作者:  梁文琦;  王广聪;  赖剑煌
Adobe PDF(20818Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
行人再识别  多对多跨域迁移  非监督迁移学习  生成对抗网络  
从视频到语言:视频标题生成与描述研究综述 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 2, 页码: 375-397
作者:  汤鹏杰;  王瀚漓
Adobe PDF(8546Kb)  |  收藏  |  浏览/下载:3/1  |  提交时间:2024/05/20
视频描述  卷积神经网络  循环神经网络  语段生成  情感表达  逻辑语义  
自适应特征融合的多模态实体对齐研究 期刊论文
自动化学报, 2024, 卷号: 50, 期号: 4, 页码: 758-770
作者:  郭浩;  李欣奕;  唐九阳;  郭延明;  赵翔
Adobe PDF(7063Kb)  |  收藏  |  浏览/下载:12/3  |  提交时间:2024/04/28
多模态知识图谱  实体对齐  预训练模型  特征融合  
CASIA-Iris-Africa: A Large-scale African Iris Image Database 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 383-399
作者:  Jawad Muhammad;  Yunlong Wang;  Junxing Hu;  Kunbo Zhang;  Zhenan Sun
Adobe PDF(8969Kb)  |  收藏  |  浏览/下载:11/0  |  提交时间:2024/04/23
African iris recognition, racial bias, iris image database, biometrics, iris recognition  
Comprehensive Relation Modelling for Image Paragraph Generation 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 369-382
作者:  Xianglu Zhu;  Zhang Zhang;  Wei Wang;  Zilei Wang
Adobe PDF(1963Kb)  |  收藏  |  浏览/下载:15/7  |  提交时间:2024/04/23
Image paragraph generation, visual relationship, scene graph, graph convolutional network (GCN), long short-term memory  
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 4-28
作者:  Mengting Liu;  Ying Zhou;  Yuwei Wu;  Feng Gao
Adobe PDF(14438Kb)  |  收藏  |  浏览/下载:20/1  |  提交时间:2024/04/23
Artificial intelligence (AI) art, audio-visual, artificial intelligence generated content (AIGC), multimodal, artistic evaluation  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  收藏  |  浏览/下载:18/3  |  提交时间:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
Masked Vision-language Transformer in Fashion 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 421-434
作者:  Ge-Peng Ji;  Mingchen Zhuge;  Dehong Gao;  Deng-Ping Fan;  Christos Sakaridis;  Luc Van Gool
Adobe PDF(2779Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/04/23
Vision-language, masked image reconstruction, transformer, fashion, e-commercial  
Long-term Visual Tracking: Review and Experimental Comparison 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 6, 页码: 512-530
作者:  Chang Liu;  Xiao-Fan Chen;  Chun-Juan Bo;  Dong Wang
Adobe PDF(7769Kb)  |  收藏  |  浏览/下载:10/2  |  提交时间:2024/04/23
Visual object tracking  long-term tracking  short-term tracking  re-detection  online update  
Causal Reasoning Meets Visual Representation Learning: A Prospective Study 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 6, 页码: 485-511
作者:  Yang Liu;  Yu-Shen Wei;  Hong Yan;  Guan-Bin Li;  Liang Lin
Adobe PDF(3224Kb)  |  收藏  |  浏览/下载:14/2  |  提交时间:2024/04/23
Causal reasoning  visual representation learning  reliable artificial intelligence  spatial-temporal data  multi-modal analysis