CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Automatic Recognition of Concealed Fish Bones under Laryngoscopy: A Practical AI Model Based on YOLO-V5 期刊论文
LARYNGOSCOPE, 2023, 页码: 8
作者:  Tao, Xiaoyao;  Zhao, Xu;  Liu, Hairui;  Wang, Jinqiao;  Tian, Chunhui;  Liu, Longsheng;  Ding, Yujie;  Chen, Xue;  Liu, Yehai
收藏  |  浏览/下载:39/0  |  提交时间:2024/02/22
deep learning  YOLO  fish bones  laryngoscopy  computer vision  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:151/20  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:147/29  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Rice Yield Prediction and Model Interpretation Based on Satellite and Climatic Indicators Using a Transformer Method 期刊论文
REMOTE SENSING, 2022, 卷号: 14, 期号: 19, 页码: 21
作者:  Liu, Yuanyuan;  Wang, Shaoqiang;  Chen, Jinghua;  Chen, Bin;  Wang, Xiaobo;  Hao, Dongze;  Sun, Leigang
收藏  |  浏览/下载:195/0  |  提交时间:2022/11/14
crop yield prediction  remote sensing  deep learning  feature importance  attention  
Global-Guided Selective Context Network for Scene Parsing 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 4, 页码: 1752-1764
作者:  Jiang, Jie;  Liu, Jing;  Fu, Jun;  Zhu, Xinxin;  Li, Zechao;  Lu, Hanqing
收藏  |  浏览/下载:261/0  |  提交时间:2022/06/10
Semantics  Task analysis  Decoding  Logic gates  Image color analysis  Fuses  Feature extraction  Attention mechanism (AM)  contextual selection  global guidance (GG)  scene parsing  
MSCap: Multi-Style Image Captioning with Unpaired Stylized Text 会议论文
, 美国长滩, 2019.06.16
作者:  Longteng, Guo;  Jing, Liu;  Peng, Yao;  Jiangwei, Li;  Hanqing, Lu
Adobe PDF(914Kb)  |  收藏  |  浏览/下载:125/25  |  提交时间:2021/06/25
Normalized and Geometry-Aware Self-Attention Network for Image Captioning 会议论文
, 线上, 2020.06.14
作者:  Guo LT(郭龙腾);  Liu J(刘静);  Zhu XX(朱欣鑫);  Yao P(姚鹏);  Lu SC(卢诗晨);  Lu HQ(卢汉清)
Adobe PDF(574Kb)  |  收藏  |  浏览/下载:324/79  |  提交时间:2021/06/25
Image captioning  Self-attention  
Dynamic Warping Network for Semantic Video Segmentation 期刊论文
COMPLEXITY, 2021, 卷号: 2021, 页码: 10
作者:  Li, Jiangyun;  Zhao, Yikai;  He, Xingjian;  Zhu, Xinxin;  Liu, Jing
收藏  |  浏览/下载:280/0  |  提交时间:2021/04/21
Efficient Face Alignment with Fast Normalization and Contour Fitting Loss 期刊论文
ACM Transactions on Multimedia Computing, Communications, and Applications, 2019, 期号: 3, 页码: 16
作者:  Liu, Zhiwei;  Zhu, Xiangyu;  Tang, Ming;  Lei, Zhen;  Wang, Jinqiao
浏览  |  Adobe PDF(1359Kb)  |  收藏  |  浏览/下载:242/64  |  提交时间:2020/09/10
Face alignment, convolutional neural networks, real-time, semantic meaning  
A Novel Data Augmentation Scheme for Pedestrian Detection with Attribute Preserving GAN 期刊论文
Neurocomputing, 2020, 卷号: 401, 期号: 11, 页码: 123-132
作者:  Liu, Songyan;  Guo, Haiyun;  Hu, Jian-Guo;  Zhao, Xu;  Zhao, Chaoyang;  Wang, Tong;  Zhu, Yousong;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(2691Kb)  |  收藏  |  浏览/下载:392/85  |  提交时间:2020/06/10
Generative Adversarial Networks  Pedestrian detection  Data augmentation