Publications

2025

 

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model
Xun Liang, Simin Niu, Zhiyu Li, Sensen Zhang, Hanyu Wang, Feiyu Xiong, Zhaoxin Fan, Bo Tang, Jihao Zhao, Jiawei Yang, Shichao Song, Mengwei Wang
The 63rd Annual Meeting of the Association for Computational Linguistics (ACL), 2025.
[Paper] [Code]

 

MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System
Jihao Zhao, Zhiyuan Ji, Zhaoxin Fan, Hanyu Wang, Simin Niu, Bo Tang, Feiyu Xiong, Zhiyu Li
The 63rd Annual Meeting of the Association for Computational Linguistics (ACL), 2025.
[Paper] [Code]

 

EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers
Daiheng Gao, Shilin Lu, Wenbo Zhou, Jiaming Chu, Jie Zhang, Mengxi Jia, Bang Zhang, Zhaoxin Fan (corresponding author), Weiming Zhang
Forty-second International Conference on Machine Learning (ICML), 2025.
[Paper] [Code]

 

GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer
Yihong Lin, Zhaoxin Fan (Equal Contribution), Xianjia Wu, Lingyu Xiong, Liang Peng, Xiandong Li, Wenxiong Kang, Songju Lei, Huang Xu
34th International Joint Conference on Artificial Intelligence (IJCAI), 2025.
[Paper] [Code]

 

Meta-Learning Empowered Meta-Face: Personalized Speaking Style Adaptation for Audio-Driven 3D Talking Face Animation
Xukun Zhou, Fengxin Li, Ziqiao Peng, Xinyu Wang, Hongyan Liu, Zhaoxin Fan (corresponding author), Jun He
IEEE International Conference on Multimedia and Expo (ICME), 2025.
[Paper] [Code]

 

Twin Progressive Generative Adversarial Network For High-Resolution Image Inpainting
Zhiying Li, Weibin Chen, Zhaoxin Fan, Kaichuan Kong, Xiaobo Jin, Guanggang Geng
IEEE International Conference on Multimedia and Expo (ICME), 2025.
[Paper] [Code]

 

DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations
Ziqiao Peng, Yanbo Fan, Haoyu Wu, Xuan Wang, Hongyan Liu, Jun He, Zhaoxin Fan (corresponding author)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
[Paper] [Code]

 

MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement Training Smoothing
Shuo Wang, Wanting Li, Yongcai Wang, Zhaoxin Fan (corresponding author), Zhe Huang, Xudong Cai, Jian Zhao, Deying Li
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
[Paper] [Code]

 

JTD-UAV: MLLM-Enhanced Joint Tracking and Description Framework for Anti-UAV Systems
Yifan Wang, Jian Zhao, Zhaoxin Fan (corresponding author), Xin Zhang, Xuecheng Wu, Yudian Zhang, Lei Jin, Xinyue Li, Gang Wang, Mengxi Jia, Ping Hu, Zheng Zhu, Xuelong Li
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
[Paper] [Code]

 

VarGes: Improving Variation in Co-Speech 3D Gesture Generation via StyleCLIPS
Ming Meng, Ke Mu, Yonggui Zhu, Zhe Zhu, Haoyu Sun, Heyang Yan, Zhaoxin Fan (corresponding author)
Computational Visual Media (CVMJ), 2025.
[Paper] [Code]

 

Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
Junhao Chen, Xiang Li, Xiaojun Ye, Zhaoxin Fan (corresponding author), Hao Zhao
The 31st International Conference on Computational Linguistics (COLING), 2025.
[Paper] [Code]

2024

 

MLPHand: Real Time Multi-View 3D Hand Mesh Reconstruction via MLP Modeling
Jian Yang, Jiakun Li, Guoming Li, Zhen Shen, Huai-Yu Wu, Zhaoxin Fan (corresponding author)
European Conference on Computer Vision (ECCV), 2024.
[Paper] [Code]

 

Human Pose Driven Object Effects Recommendation
Zhaoxin Fan, Fengxin Li, Hongyan Liu, Jun He, and Xiaoyong Du
ACM International Conference on Multimedia Retrieval (ICMR), 2024.
[Paper] [Code]

 

ACR-Pose: Adversarial Canonical Representation Reconstruction Network for Category Level 6D Object Pose Estimation
Zhaoxin Fan, Zhenbo Song, Jian Xu, Zhicheng Wang, Kejian Wu, Hongyan Liu, and Jun He
ACM International Conference on Multimedia Retrieval (ICMR), 2024.
[Paper]

 

STDG: Semi-Teacher-Student Training Paradigram for Depth-guided One-stage Scene Graph Generation
Xukun Zhou, Zhenbo Song, Jun He, Hongyan Liu, Zhaoxin Fan (corresponding author)
ACM International Conference on Multimedia Retrieval (ICMR), 2024.
[Paper] [Code]

 

CoDancers: Music-Driven Coherent Group Dance Generation with Choreographic Unit
Kaixing Yang, Xukun Zhou, Xulong Tang, Ran Diao, Hongyan Liu, Jun He, Zhaoxin Fan (corresponding author)
ACM International Conference on Multimedia Retrieval (ICMR), 2024.
[Paper] [Code]

 

BeatDance: A Beat-Based Model-Agnostic Contrastive Learning Framework for Music-Dance Retrieval
Kaixing Yang, Xukun Zhou, Xulong Tang, Ran Diao, Hongyan Liu, Jun He, Zhaoxin Fan (corresponding author)
ACM International Conference on Multimedia Retrieval (ICMR), 2024.
[Paper] [Code]

 

SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis
Ziqiao Peng, Wentao Hu, Yue Shi, Xiangyu Zhu, Xiaomei Zhang, Hao Zhao, Jun He, Hongyan Liu, Zhaoxin Fan (corresponding author)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
[Paper] [Code]

 

MonoSIM: Simulating Learning Behaviors of Heterogeneous Point Cloud Object Detectors for Monocular 3D Object Detection
Han Sun, Zhaoxin Fan (equal contribution), Zhenbo Song, Zhicheng Wang, Kejian Wu, and Jianfeng Lu
IEEE Transactions on Instrumentation & Measurement (TIM), 2024.
[Paper] [Code]

 

Everything2Motion: Synchronizing Diverse Inputs via a Unified Framework for Human Motion Synthesis
Zhaoxin Fan, Longbin Li, Pengxin Xu, Fan Shen, Kai Chen
Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI), 2024.
[Paper]

 

FuRPE: Learning Full-body Reconstruction from Part Experts
Zhaoxin Fan, Yuqing Pan, Hao Xu, Zhenbo Song, Zhicheng Wang, Kejian Wu, Hongyan Liu, and Jun He
Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI) Workshop, 2024.
[Paper] [Code]

 

Multi-dimensional Fusion and Consistency for Semi-supervised Medical Image Segmentation
Yixing Lu, Zhaoxin Fan (equal contribution), Min Xu
International Conference on Multimedia Modeling (MMM), 2024.
[Paper]

 

A Novel Transformer Autoencoder for Multi-modal Emotion Recognition with Incomplete Data
Cheng Cheng, Wenzhe Liu, Zhaoxin Fan, Lin Feng, Ziyu Jia
Neural Networks, 2024.
[Paper]

2023

 

EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation
Ziqiao Peng, Haoyu Wu, Zhenbo Song, Hao Xu, Xiangyu Zhu, Hongyan Liu, Jun He, Zhaoxin Fan (corresponding author)
International Conference on Computer Vision (ICCV), 2023.
[Paper] [Code]

 

D-IF: Uncertainty-aware Human Digitization via Implicit Distribution Field
Xueting Yang, Yihao Luo, Yuliang Xiu, Wei Wang, Hao Xu, Zhaoxin Fan (corresponding author)
International Conference on Computer Vision (ICCV), 2023.
[Paper] [Code]

 

SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces
Ziqiao Peng, Yihao Luo, Yue Shi, Hao Xu, Xiangyu Zhu, Hongyan Liu, Jun He, Zhaoxin Fan (corresponding author)
ACM International Conference on Multimedia (ACM MM), 2023.
[Paper] [Code]

 

Deep Semantic-aware Remote Sensing Image Deblurring
Zhenbo Song, Zhenyuan Zhang, Feiyi Fang, Zhaoxin Fan, Jianfeng Lu
Signal Processing, 2023.
[Paper]

 

Reconstruction-Aware Prior Distillation for Semi-supervised Point Cloud Completion
Zhaoxin Fan, Yulin He, Zhicheng Wang, Kejian Wu, Hongyan Liu, Jun He
International Joint Conference on Artificial Intelligence (IJCAI), 2023.
[Paper]

 

Robust Single Image Reflection Removal Against Adversarial Attacks
Zhenbo Song, Zhenyuan Zhang, Kaihao Zhang, Wenhan Luo, Zhaoxin Fan, Wenqi Ren, Jianfeng Lu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
[Paper]

 

GIDP: Learning a Good Initialization and Inducing Descriptor Post-enhancing for Large-scale Place Recognition
Zhaoxin Fan, Zhenbo Song, Hongyan Liu, Jun He
International Conference on Robotics and Automation (ICRA), 2023.
[Paper]

2022

 

Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation From Monocular RGB Image
Zhaoxin Fan, Zhenbo Song, Jian Xu, Zhicheng Wang, Kejian Wu, Hongyan Liu, Jun He
European Conference on Computer Vision (ECCV), 2022.
[Paper] [Code]

 

RPR-Net: A Point Cloud-based Rotation-Aware Large Scale Place Recognition Network
Zhaoxin Fan, Zhenbo Song, Wenping Zhang, Hongyan Liu, Jun He, Xiaoyong Du
European Conference on Computer Vision Workshop (ECCV Workshop), 2022.
[Paper]

 

SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition
Zhaoxin Fan, Zhenbo Song, Zhiwu Lu, Hongyan Liu, Jun He, Xiaoyong Du
AAAI Conference on Artificial Intelligence (AAAI), 2022.
[Paper] [Code]

 

Unsupervised Multi-task Learning for 3D Subtomogram Image Alignment, Clustering and Segmentation
Haoyi Zhu, Chuting Wang, Yuanxin Wang, Zhaoxin Fan, Mostofa Rafid Uddin, Xin Gao, Jing Zhang, Xiangrui Zeng, Min Xu
IEEE International Conference on Information Processing (ICIP), 2022.
[Paper]

 

PilotAttnNet: Multi-Modal Attention Network for End-to-End Steering Control
Jincan Zhang, Zhenbo Song, Jianfeng Lu, Xingwei Qu, Zhaoxin Fan
Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2022.
[Paper]

 

Deep Learning on Monocular Object Pose Detection and Tracking: A Comprehensive Overview
Zhaoxin Fan, Yazhi Zhu, Yulin He, Qi Sun, Hongyan Liu, Jun He
ACM Computing Surveys (CSUR), 2022.
[Paper]

2020-2021

 

SRNet: A 3D Scene Recognition Network using Static Graph and Dense Semantic Fusion
Zhaoxin Fan, Hongyan Liu, Jun He, Qi Sun, Xiaoyong Du
Computer Graphics Forum (CGF), 2020.
[Paper]

 

A Graph‐based One‐Shot Learning Method for Point Cloud Recognition
Zhaoxin Fan, Hongyan Liu, Jun He, Qi Sun, Xiaoyong Du
Computer Graphics Forum (CGF), 2020.
[Paper]

 

MPDNet: A 3D Missing Part Detection Network Based on Point Cloud Segmentation
Zhaoxin Fan, Hongyan Liu, Jun He, Min Zhang, Xiaoyong Du
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021.
[Paper]

 

DAGC: Employing Dual Attention and Graph Convolution for Point Cloud based Place Recognition
Qi Sun, Hongyan Liu, Jun He, Zhaoxin Fan, Xiaoyong Du
ACM International Conference on Multimedia Retrieval (ICMR), 2020.
[Paper]

 

PointFPN: A Frustum-based Feature Pyramid Network for 3D Object Detection
Zhaoxin Fan, Hongyan Liu, Jun He, Siwei Jiang, Xiaoyong Du
International Conference on Tools with Artificial Intelligence (ICTAI), 2020.
[Paper]

Patents