CASIA OpenIR

浏览/检索结果: 共212条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
The Image Data and Backbone in Weakly Supervised Fine-Grained Visual Categorization: A Revisit and Further Thinking 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 2-16
作者:  Ye, Shuo;  Wang, Yu;  Peng, Qinmu;  You, Xinge;  Chen, C. L. Philip
收藏  |  浏览/下载:13/0  |  提交时间:2024/03/26
Fine-grained visual categorization  deep learning  weakly supervised learning  
Involving Distinguished Temporal Graph Convolutional Networks for Skeleton-Based Temporal Action Segmentation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 647-660
作者:  Li, Yun-Heng;  Liu, Kai-Yuan;  Liu, Sheng-Lan;  Feng, Lin;  Qiao, Hong
收藏  |  浏览/下载:6/0  |  提交时间:2024/03/26
Feature extraction  Motion segmentation  Correlation  Convolution  Topology  Convolutional neural networks  Solid modeling  Skeleton-based temporal action segmentation  enhanced spatial graph structure  segmented encoding  
Learning Gait Representation From Massive Unlabelled Walking Videos: A Benchmark 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14920-14937
作者:  Fan, Chao;  Hou, Saihui;  Wang, Jilong;  Huang, Yongzhen;  Yu, Shiqi
收藏  |  浏览/下载:11/0  |  提交时间:2024/03/26
Gait recognition  self-supervised  contrastive learning  GaitSSB  GaitLU-1M  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:29/9  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Enhancing Dimensional Emotion Recognition from Speech through Modulation-Filtered Cochleagram and Parallel Attention Recurrent Network 期刊论文
ELECTRONICS, 2023, 卷号: 12, 期号: 22, 页码: 15
作者:  Peng, Zhichao;  Zeng, Hua;  Li, Yongwei;  Du, Yegang;  Dang, Jianwu
收藏  |  浏览/下载:33/0  |  提交时间:2024/02/22
modulation-filtered cochleagram  parallel attention recurrent neural network  dimensional emotion recognition  auditory signal processing  noise-robust  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:78/11  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Face Forgery Detection by 3D Decomposition and Composition Search 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8342-8357
作者:  Zhu, Xiangyu;  Fei, Hongyan;  Zhang, Bin;  Zhang, Tianshuo;  Zhang, Xiaoyu;  Li, Stan Z.;  Lei, Zhen
收藏  |  浏览/下载:108/0  |  提交时间:2023/11/17
Faces  Forgery  Three-dimensional displays  Face recognition  Feature extraction  Lighting  Computer architecture  Composition search  differentiable search  fake face  forgery detection  3D decomposition  3D face model  
Towards Fine-Grained Optimal 3D Face Dense Registration: An Iterative Dividing and Diffusing Method 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 页码: 21
作者:  Fan, Zhenfeng;  Peng, Silong;  Xia, Shihong
收藏  |  浏览/下载:98/0  |  提交时间:2023/11/17
3D face  Dense correspondence  Non-rigid registration  3D morphable model  
Contrastive Adversarial Training for Multi-Modal Machine Translation 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 6, 页码: 157:1-18
作者:  Huang X(黄鑫);  Zhang JJ(张家俊);  Zong CQ(宗成庆)
Adobe PDF(2387Kb)  |  收藏  |  浏览/下载:172/55  |  提交时间:2023/06/26
contrastive learning  adversarial training  multi-modal machine translation  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:151/41  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation