CASIA OpenIR

浏览/检索结果: 共29条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Zero-Shot Predicate Prediction for Scene Graph Parsing 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 3140-3153
作者:  Li, Yiming;  Yang, Xiaoshan;  Huang, Xuhui;  Ma, Zhe;  Xu, Changsheng
收藏  |  浏览/下载:134/0  |  提交时间:2023/11/17
Deep learning  zero-shot  scene graph  
Human Parsing With Part-Aware Relation Modeling 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 25, 页码: 2601-2612
作者:  Zhang, Xiaomei;  Chen, Yingying;  Tang, Ming;  Wang, Jinqiao;  Zhu, Xiangyu;  Lei, Zhen
Adobe PDF(6053Kb)  |  收藏  |  浏览/下载:118/10  |  提交时间:2023/11/17
Human parsing  modeling  part-aware relation  
Instance GNN: A Learning Framework for Joint Symbol Segmentation and Recognition in Online Handwritten Diagrams 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2580-2594
作者:  Yun, Xiao-Long;  Zhang, Yan-Ming;  Yin, Fei;  Liu, Cheng-Lin
收藏  |  浏览/下载:241/0  |  提交时间:2022/07/25
Handwriting recognition  Task analysis  Grammar  Semantics  Image segmentation  Trajectory  Text recognition  Online handwritten diagram recognition  symbol segmentation  symbol recognition  freehand sketch analysis  graph neural networks  
Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 338-351
作者:  Zheng, Aihua;  Hu, Menglan;  Jiang, Bo;  Huang, Yan;  Yan, Yan;  Luo, Bin
收藏  |  浏览/下载:236/0  |  提交时间:2022/03/17
Visualization  Task analysis  Measurement  Speech recognition  Videos  Location awareness  Image recognition  Adversarial learning  audio-visual matching  cross-modal learning  metric learning  
Unsupervised Video Summarization via Relation-Aware Assignment Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3203-3214
作者:  Gao, Junyu;  Yang, Xiaoshan;  Zhang, Yingying;  Xu, Changsheng
Adobe PDF(3649Kb)  |  收藏  |  浏览/下载:315/62  |  提交时间:2021/11/03
Feature extraction  Training  Optimization  Semantics  Recurrent neural networks  Task analysis  Graph neural network  unsupervised learning  video summarization  
Adaptive Deep Metric Learning for Affective Image Retrieval and Classification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 1640-1653
作者:  Yao, Xingxu;  She, Dongyu;  Zhang, Haiwei;  Yang, Jufeng;  Cheng, Ming-Ming;  Wang, Liang
收藏  |  浏览/下载:215/0  |  提交时间:2021/08/15
Measurement  Visualization  Semantics  Feature extraction  Task analysis  Image analysis  Image retrieval  Affective image retrieval  convolutional neural network  deep metric learning  visual sentiment analysis  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:208/32  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination  
WiderPerson: A Diverse Dataset for Dense Pedestrian Detection in the Wild 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 2, 页码: 380-393
作者:  Zhang, Shifeng;  Xie, Yiliang;  Wan, Jun;  Xia, Hansheng;  Li, Stan Z.;  Guo, Guodong
浏览  |  Adobe PDF(6651Kb)  |  收藏  |  浏览/下载:321/54  |  提交时间:2020/04/07
Benchmark testing  Detectors  Training  Urban areas  Cameras  Task analysis  Deep learning  Pedestrian detection  dataset  rich diversity  high density  
Deep Multi-Modality Adversarial Networks for Unsupervised Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 9, 页码: 2419-2431
作者:  Ma, Xinhong;  Zhang, Tianzhu;  Xu, Changsheng
Adobe PDF(2142Kb)  |  收藏  |  浏览/下载:342/42  |  提交时间:2019/12/16
Unsupervised domain adaptation  triplet loss  stacked attention  multi-modality  social event recognition  
Three-Dimensional Attention-Based Deep Ranking Model for Video Highlight Detection 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 10, 页码: 2693-2705
作者:  Jiao,Yifan;  Li,Zhetao;  Huang,Shucheng;  Yang,Xiaoshan;  Liu,Bin;  Zhang,Tianzhu
浏览  |  Adobe PDF(4692Kb)  |  收藏  |  浏览/下载:519/185  |  提交时间:2018/10/10
Video Highlight Detection  Attention Model  Deep Ranking