2020
Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification [Code] [Slides] [Supp]
Y. Zou, X. Yang, Z. Yu, V. Kumar, J. Kautz
European Conference on Computer Vision (ECCV), 2020 (Oral)
Contrastive Learning for Weakly Supervised Phrase Grounding [Code] [Project] [Video]
T. Gupta, A. Vahdat, G. Checkik, X. Yang, J. Kautz, D. Hoiem
European Conference on Computer Vision (ECCV), 2020 (Spotlight)
UFO2: A Unified Framework towards Omni-supervised Object Detection [Code] [Project] [Supp]
Z. Ren, Z. Yu, X. Yang, M.-Y. Liu, A. Schwing, J. Kautz
European Conference on Computer Vision (ECCV), 2020
Simulating Content Consistent Vehicle Datasets with Attribute Descent [Code] [Demo]
Y. Yao, L. Zhang, X. Yang, M. Naphade, T. Gedeon
European Conference on Computer Vision (ECCV), 2020
Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection [Code] [Project]
Z. Ren, Z. Yu, X. Yang, M.-Y. Liu, Y. J. Lee, A. Schwing, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020
2019
Dancing to Music [Code] [Data] [Blog] [Supp]
H.-Y. Lee, X. Yang, M.-Y. Liu, T.-C. Wang, Y.-D. Lu, M.-H. Yang, J. Kautz
Advances in Neural Information Processing Systems (NeurIPS), 2019
Joint Discriminative and Generative Learning for Person Re-Identification [Code] [Video] [Supp]
Z. Zheng, X. Yang, Z. Yu, L. Zheng, Y. Yang, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019 (Oral)
STEP: Spatio-Temporal Progressive Learning for Video Action Detection [Code] [Supp]
X. Yang, X. Yang, M.-Y. Liu, F. Xiao, L. Davis, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019 (Oral)
CityFlow: A City-Scale Benchmark for Multi-Target Multi-Camera Vehicle Tracking and Re-Identification [Supp] [Data]
Z. Tang, M. Naphade, M.-Y. Liu, X. Yang, S. Birchfield, S. Wang, R. Kumar, D. Anastasiu, J.-N. Hwang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019 (Oral)
A Delay Metric for Video Object Detection: What Average Precision Fails to Tell [Code]
H. Mao, X. Yang, W. Dally
IEEE International Conference in Computer Vision (ICCV), 2019
PAMTRI: Pose-Aware Multi-Task Learning for Vehicle Re-Identification Using Randomized Synthetic Data [Code]
Z. Tang, M. Naphade, S. Birchfield, J. Tremblay, W. Hodge, R. Kumar, S. Wang, X. Yang
IEEE International Conference in Computer Vision (ICCV), 2019
Models Matter, So Does Training: An Empirical Study of CNNs for Optical Flow Estimation [Code]
D. Sun, X. Yang, M.-Y. Liu, J. Kautz
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
Discovering Spatio-Temporal Action Tubes
Y. Ye, X. Yang, Y. Tian
Journal of Visual Communication and Image Representation (JVCI), 2019
2018
Making Convolutional Networks Recurrent for Visual Sequence Learning [Supp]
X. Yang, P. Molchanov, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
PWC-Net: CNNs for Optical Flow using Pyramid, Warping and Cost Volume [Code] [Supp] [Project]
D. Sun, X. Yang, M.-Y. Liu, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018 (Oral)
MoCoGAN: Decomposing Motion and Content for Video Generation [Code]
S. Tulyakov, M.-Y. Liu, X. Yang, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
Budget-Aware Activity Detection with A Recurrent Policy Network [Supp]
B. Mahasseni, X. Yang, P. Molchanov, J. Kautz
British Machine Vision Conference (BMVC), 2018 (Oral)
Video You Only Look Once: Overall Temporal Convolutions for Action Recognition
L. Jing, X. Yang, Y. Tian
Journal of Visual Communication and Image Representation (JVCI), 2018
2017
Super Normal Vector for Human Activity Recognition with Depth Cameras [Code]
X. Yang, Y. Tian
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2017
Dynamic Facial Analysis: From Bayesian Filtering to Recurrent Neural Network [Blog] [Project]
J. Gu, X. Yang, S. De Mello, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017
Evaluation of Low-Level Features for Real-World Surveillance Event Detection [Code]
Y. Xian, X. Rong, X. Yang, Y. Tian
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2017
3D Convolutional Neural Network with Multi-Model Framework for Action Recognition
L. Jing, Y. Ye, X. Yang, Y. Tian
IEEE International Conference on Image Processing (ICIP), 2017 (Oral)
2016
Multilayer and Multimodal Fusion of Deep Neural Networks for Video Classification
X. Yang, P. Molchanov, J. Kautz
ACM Multimedia, 2016 (Oral)
Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks [Data] [Project]
P. Molchanov, X. Yang, S. Gupta, K. Kim, S. Tyree, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016
Region Trajectories for Video Semantic Concept Detection
Y. Ye, X. Rong, X. Yang, Y. Tian
ACM International Conference on Multimedia Retrieval (ICMR), 2016
Towards Selecting Robust Hand Gestures for Automotive Interfaces
S. Gupta, P. Molchanov, X. Yang, K. Kim, S. Tyree, J. Kautz
IEEE Intelligent Vehicles Symposium (IV), 2016
2015
Feature Representations for Human Activity Recognition in Color and Depth Sequences
X. Yang
Ph.D. Dissertation, 2015
Discriminative Hierarchical K-Means Tree for Large-Scale Image Classification
S. Chen, X. Yang, Y. Tian
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2015
Exploring Pooling Strategies based on Idiosyncrasies of Spatio-Temporal Interest Points
Y. Ye, X. Yang, Y. Tian
ACM International Conference on Multimedia Retrieval (ICMR), 2015
Hybrid Example based Single Image Super Resolution
Y. Xian, X. Yang, Y. Tian
International Symposium on Visual Computing (ISVC), 2015 (Oral)
CCNY at TRECVID 2015: Video Semantic Concept Localization
Y. Ye, X. Rong, X. Yang, Y. Tian
NIST TREC Video Retrieval Evaluation (TRECVID), 2015
2014
Super Normal Vector for Activity Recognition Using Depth Sequences [Code]
X. Yang, Y. Tian
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
Action Recognition Using Super Sparse Coding Vector with Spatio-Temporal Awareness [Code]
X. Yang, Y. Tian
European Conference on Computer Vision (ECCV), 2014
Effective 3D Action Recognition Using EigenJoints
X. Yang, Y. Tian
Journal of Visual Communication and Image Representation (JVCI), 2014 (Best Paper Award Runner-Up)
Scene Text Recognition in Multiple Frames based on Text Tracking
X. Rong, C. Yi, X. Yang, Y. Tian
IEEE International Conference on Multimedia & Expo (ICME), 2014
Assistive Clothing Pattern Recognition for Visually Impaired People
X. Yang, S. Yuan, Y. Tian
IEEE Transactions on Human-Machine Systems (THMS), 2014
CCNY at TRECVID 2014: Surveillance Event Detection
Y. Xian, X. Rong, X. Yang, Y. Tian
NIST TREC Video Retrieval Evaluation (TRECVID), 2014
2013
Histogram of 3D Facets: A Characteristic Descriptor for Hand Gesture Recognition
C. Zhang, X. Yang, Y. Tian
IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2013 (Oral)
Feature Representations for Scene Text Character Recognition: A Comparative Study
C. Yi, X. Yang, Y. Tian
International Conference on Document Analysis and Recognition (ICDAR), 2013
AT&T Research at TRECVID 2013: Surveillance Event Detection
X. Yang, Z. Liu, E. Zavesky, D. Gibbon, B. Shahraray, Y. Tian
NIST TREC Video Retrieval Evaluation (TRECVID), 2013
Texture Representations Using Subspace Embeddings
X. Yang, Y. Tian
Pattern Recognition Letters (PRL), 2013
Toward A Computer Vision based Wayfinding Aid for Blind Persons to Access Unfamiliar Indoor Environments
Y. Tian, X. Yang, C. Yi, A. Arditi
Machine Vision and Applications (MVA), 2013
2012
Recognizing Actions Using Depth Motion Maps based Histograms of Oriented Gradients [Code]
X. Yang, C. Zhang, Y. Tian
ACM Multimedia, 2012
EigenJoints based Action Recognition Using Naive-Bayes-Nearest-Neighbor
X. Yang, Y. Tian
IEEE CVPR Workshop on Human Activity Understanding from 3D Data, 2012
MediaCCNY at TRECVID 2012: Surveillance Event Detection [Code] [Data]
X. Yang, C. Yi, L. Cao, Y. Tian
NIST TREC Video Retrieval Evaluation (TRECVID), 2012
Robust and Effective Component based Banknote Recognition for the Blind
F. Zaman, X. Yang, Y. Tian
IEEE Transactions on System, Man, and Cybernetics (TSMC) Part C, 2012
2011
Recognizing Clothes Patterns for Blind People by Confidence Margin based Feature Combination [Data]
X. Yang, S. Yuan, Y. Tian
ACM Multimedia, 2011
Robust and Effective Component based Banknote Recognition by SURF Features
F. Zaman, X. Yang, Y. Tian
IEEE Wireless and Optical Communication Conference (WOCC), 2011
2010
Context based Indoor Object Detection as an Aid to Blind Persons Accessing Unfamiliar Environments [Data]
X. Yang, Y. Tian, C. Yi, and A. Arditi
ACM Multimedia, 2010
Robust Door Detection in Unfamiliar Environments by Combining Edge and Corner Features
X. Yang, Y. Tian
IEEE CVPR Workshop on Computer Vision Applications for Visually Impaired, 2010
Patents
Dynamic Hand Gesture based Region of Interest Localization
US9354711 Issued on 2016/03/31
Hand Gesture based Region of Interest Localization
US9778750 Issued on 2017/10/03
Online Detection and Classification of Dynamic Gestures with Recurrent Convolutional Neural Networks
US10157309 Issued on 2018/12/18
Systems and Methods for Dynamic Facial Analysis Using A Recurrent Neural Network
US10373332 Issued on 2019/08/06
Fusing Multilayer and Multimodal Deep Neural Networks for Video Classification
US10402697 Issued on 2019/09/03
System and Method for Optical Flow Estimation
US10424069 Issued on 2019/09/24
System and Method for Optical Flow Estimation
US10467763 Issued on 2019/11/05
System and Method for Content and Motion Controlled Action Video Generation
US10595039 Issued on 2020/03/17
Budget-Aware Method for Detecting Activity in Video
US10860859 Issued on 2020/12/08
Iterative Spatio-Temporal Action Detection in Video
US11017556 Issued on 2021/05/25
Transforming Convolutional Neural Networks for Visual Sequence Learning
US11049018 Issued on 2021/06/29
Cross-Domain Image Processing for Object Re-Identification
US11367268 Issued on 2022/06/21
Image Identification Using Neural Networks
US Patent App. 16/357047, 2019
Weakly-Supervised Object Detection Using One or More Neural Networks
US Patent App. 16/443346, 2019
Neural Architecture for Self-Supervised Event Learning and Anomaly Detection
US Patent App. 16/453913, 2019
Self-Supervised Hierarchical Motion Learning for Video Action Recognition
US Patent App. 16/998914, 2020
Joint Representation Learning from Images and Text
US Patent App. 17/000048, 2020
Method and Apparatus for Generating Interactive Scenario, and Electronic Device
US Patent App. 17/032726, 2020
Method and System for Self-Supervised Learning of Pillar Motion for Autonomous Driving
US Patent App. 17/231271, 2022
System and Method for 3D Multi-Object Tracking in LiDAR Point Clouds
US Patent App. 17/395626, 2023