Publications

(2022). X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation. European Conference on Computer Vision (ECCV).

PDF Cite

(2022). SketchSampler: Sketch-based 3D Reconstruction via View-dependent Depth Sampling. European Conference on Computer Vision (ECCV).

PDF Cite Project

(2022). Improving RGB-D Point Cloud Registration by Learning Multi-scale Local Linear Transformation. European Conference on Computer Vision (ECCV).

PDF Cite Code Project

(2022). 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Oral Presentation.

PDF Cite Code Project Ranks in ScanRefer Benchmark

(2022). DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion Transformer. AAAI Conference on Artificial Intelligence (AAAI).

PDF Cite Dataset Project DOI

(2022). VPU: A Video-Based Point Cloud Upsampling Framework. IEEE Transactions on Image Processing (IEEE T-IP).

Cite Project DOI

(2021). VoteHMR: Occlusion-Aware Voting Network for Robust 3D Human Mesh Recovery from Partial Point Clouds. ACM International Conference on Multimedia (ACM MM), Oral Presentation.

PDF Cite Code Project DOI

(2021). StyleFormer: Real-time Arbitrary Style Transfer via Parametric Style Composition. IEEE/CVF International Conference on Computer Vision (ICCV).

PDF Cite Code Project DOI

(2021). 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds. IEEE/CVF International Conference on Computer Vision (ICCV).

PDF Cite Code Project DOI Ranks in ScanRefer Benchmark

(2021). Sequential Point Cloud Upsampling by Exploiting Multi-Scale Temporal Dependency. IEEE Transactions on Circuits and Systems for Video Technology (IEEE T-CSVT).

Cite Project DOI

(2021). Transformer3D-Det: Improving 3D Object Detection by Vote Refinement. IEEE Transactions on Circuits and Systems for Video Technology (IEEE T-CSVT).

Cite Project DOI

(2021). ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Oral Presentation.

PDF Cite Dataset Project DOI ForgeryNet Challenge

(2021). Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Cite Code Project DOI Ranks in Papers with Code

(2021). PCG-TAL: Progressive Cross-Granularity Cooperation for Temporal Action Localization. IEEE Transactions on Image Processing (IEEE T-IP).

Cite DOI

(2021). Motion Compensated Virtual View Synthesis Using Novel Particle Cell. IEEE Transactions on Multimedia (IEEE T-MM).

Cite DOI

(2021). IncreACO: Incrementally Learned Automatic Check-out with Photorealistic Exemplar Augmentation. IEEE Winter Conference on Applications of Computer Vision (WACV).

Cite DOI URL

(2020). High-Quality Video Generation from Static Structural Annotations. International Journal of Computer Vision (IJCV).

Cite Code Project DOI

(2020). Thinking in Frequency: Face Forgery Detection by Mining Frequency-Aware Clues. European Conference on Computer Vision (ECCV).

PDF Cite DOI Unofficial implementation

(2020). Powering One-Shot Topological NAS with Stabilized Share-Parameter Proxy. European Conference on Computer Vision (ECCV).

PDF Cite DOI

(2020). Morphing and Sampling Network for Dense Point Cloud Completion. AAAI Conference on Artificial Intelligence (AAAI).

PDF Cite Code Dataset Project DOI

(2019). Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM. IEEE/CVF International Conference on Computer Vision (ICCV).

PDF Cite Project DOI

(2019). Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization. IEEE/CVF International Conference on Computer Vision (ICCV).

PDF Cite Code Poster DOI

(2019). CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval. IEEE/CVF International Conference on Computer Vision (ICCV).

PDF Cite Code DOI

(2019). Bags of tricks for learning depth and camera motion from monocular videos. Virtual Reality & Intelligent Hardware (VRIH).

Cite Project DOI

(2019). Cascaded regression using landmark displacement for 3D face reconstruction. Pattern Recognition Letters (PRL).

Cite Project DOI

(2019). Video Generation From Single Semantic Label Map. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Cite Code Project DOI

(2019). Semantics Disentangling for Text-To-Image Generation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Oral Presentation.

PDF Cite Code Project DOI

(2019). GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Cite Project DOI

(2019). Context and Attribute Grounded Dense Captioning. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Cite Code DOI

(2019). Visibility Constrained Generative Model for Depth-Based 3D Facial Pose Tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE T-PAMI).

Cite Project DOI

(2018). Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection. ACM international conference on Multimedia (ACM MM).

PDF Cite Code Project DOI

(2018). Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition. European Conference on Computer Vision (ECCV).

PDF Cite Code DOI

(2018). Spatio-Temporal Disocclusion Filling Using Novel Sprite Cells. IEEE Transactions on Multimedia (IEEE T-MM).

Cite DOI

(2018). Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Cite Code DOI

(2018). Exploring Disentangled Feature Representation Beyond Face Identification. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Cite Project DOI

(2018). Avatar-Net: Multi-scale Zero-Shot Style Transfer by Feature Decoration. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Cite Code Project Video DOI

(2017). HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis. IEEE International Conference on Computer Vision (ICCV).

PDF Cite Code Dataset Project DOI

(2017). A Generative Model for Depth-Based Robust 3D Facial Pose Tracking. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Cite Project DOI

(2016). Real-Time Head Pose Tracking with Online Face Template Reconstruction. IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE T-PAMI).

Cite Project DOI

(2015). Online Temporally Consistent Indoor Depth Video Enhancement via Static Structure. IEEE Transactions on Image Processing (IEEE T-IP).

Cite DOI

(2015). A disocclusion filling method using multiple sprites with depth for virtual view synthesis. IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

Cite DOI

(2015). Accelerating the Distribution Estimation for the Weighted Median/Mode Filters. Asian Conference on Computer Vision (ACCV).

Cite DOI

(2014). Temporal depth video enhancement based on intrinsic static structure. IEEE International Conference on Image Processing (ICIP), Oral Presentation.

Cite DOI

(2014). Screen-camera calibration using a thread. IEEE International Conference on Image Processing (ICIP).

Cite DOI

(2013). Depth enhancement based on hybrid geometric hole filling strategy. IEEE International Conference on Image Processing (ICIP).

Cite DOI

(2013). A Head Pose Tracking System Using RGB-D Camera. International Conference on Computer Vision Systems (ICVS).

Cite Project DOI