CASIA OpenIR

浏览/检索结果: 共62条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Hierarchical Attention Networks for Fact-based Visual Question Answering 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 页码: 18
作者:  Yao, Haibo;  Luo, Yongkang;  Zhang, Zhi;  Yang, Jianhang;  Cai, Chengtao
收藏  |  浏览/下载:62/0  |  提交时间:2023/11/17
Fact-based Visual Question Answering  Hierarchical attention networks  Self-attention  Multiple attention interaction  Positional encoding  
Real-time continuous detection and recognition of dynamic hand gestures in untrimmed sequences based on end-to-end architecture with 3D DenseNet and LSTM 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 页码: 38
作者:  Lu, Zhi;  Qin, Shiyin;  Lv, Pin;  Sun, Liguo;  Tang, Bo
收藏  |  浏览/下载:42/0  |  提交时间:2023/11/17
Continuous detection  Gesture recognition  Long short-term memory  3D densely connected convolutional networks  Connectionist temporal classification  Canonical correlation analysis  
Self-attention mechanism in person re-identification models 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 页码: 19
作者:  Chen, Wenbai;  Lu, Yue;  Ma, Hang;  Chen, Qili;  Wu, Xibao;  Wu, Peiliang
收藏  |  浏览/下载:144/0  |  提交时间:2021/03/29
Person re-identification  Deep neural network  Self-attention  Computer vision  
A hybrid convolutional architecture for accurate image manipulation localization at the pixel-level 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 期号: 80, 页码: 23377–23392
作者:  Zhang, Yixuan;  Zhang, Jiguang;  Xu, Shibiao
Adobe PDF(1369Kb)  |  收藏  |  浏览/下载:263/57  |  提交时间:2021/03/08
Manipulation localization  Top-down detection  Bottom-up segmentation  DenseCRFs  
Cross-domain personalized image captioning 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 卷号: 79, 期号: 45-46, 页码: 33333-33348
作者:  Long, Cuirong;  Yang, Xiaoshan;  Xu, Changsheng
收藏  |  浏览/下载:160/0  |  提交时间:2021/03/02
Personalization  Image captioning  Domain adaptation  
User behavior fusion in dialog management with multi-modal history cues 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 卷号: 74, 期号: 22, 页码: 10025-10051
作者:  Yang, Minghao;  Tao, Jianhua;  Chao, Linlin;  Li, Hao;  Zhang, Dawei;  Che, Hao;  Gao, Tingli;  Liu, Bin
收藏  |  浏览/下载:72/0  |  提交时间:2020/10/27
Dialog Management (Dm)  Multi-modal Data Fusion  Human Computer Interaction (Hci)  Emotion Detection  
Emotional head motion predicting from prosodic and linguistic features 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 卷号: 75, 期号: 9, 页码: 5125-5146
作者:  Yang, Minghao;  Jiang, Jinlin;  Tao, Jianhua;  Mu, Kaihui;  Li, Hao
收藏  |  浏览/下载:52/0  |  提交时间:2020/10/27
Visual Prosody  Head Gesture  Prosody Clustering  
Detecting Uyghur text in complex background images with convolutional neural network 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 卷号: 76, 期号: 13, 页码: 15083-15103
作者:  Fang, Shancheng;  Xie, Hongtao;  Chen, Zhineng;  Zhu, Shiai;  Gu, Xiaoyan;  Gao, Xingyu
收藏  |  浏览/下载:39/0  |  提交时间:2020/10/27
Uyghur  Text Detection  Text Localization  Convolutional Neural Network  
OS-LFFD: a light and fast face detector with Ommateum structure 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 页码: 20
作者:  Xu, Dezhong;  Wu, Lifang;  He, Yonghao;  Zhao, Qing;  Jian, Meng;  Yan, Junchi;  Zhao, Liang
收藏  |  浏览/下载:166/0  |  提交时间:2020/08/21
Edge devices  Face detector  Effective receptive field  Ommateum block  
A robust real-time facial alignment system with facial landmarks detection and rectification for multimedia applications 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 页码: 23
作者:  Chou, Kuang Pen;  Prasad, Mukesh;  Yang, Jie;  Su, Sheng-Yao;  Tao, Xian;  Saxena, Amit;  Lin, Wen-Chieh;  Lin, Chin-Teng
收藏  |  浏览/下载:206/0  |  提交时间:2020/08/03
Face alignment  Facial feature localization  Head pose estimation  Face recognition