Here are details of some works 📚 Thanks to all the co-authors for our works.



  1. Xing J, Wang M, Mu B, et al. Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition[J]. AAAI 2023. Corresponding Author


  1. Mengmeng Wang, Jiazheng Xing, Jing Su, Jun Chen, Yong Liu*. Learning SpatioTemporal and Motion Features in a Unified 2D Network for Action Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022.🎉🎉🎉

  2. Wang, M., Mei, J., Liu, L., Tian, G., Liu, Y., & Pan, Z. (2022). Delving Deeper Into Mask Utilization in Video Object Segmentation. IEEE Transactions on Image Processing (TIP), 31, 6255-6266.

  3. Li, Z., Wang, M., Pi, H., Xu, K., Mei, J., & Liu, Y. (2022, November). E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXV (pp. 267-284). Cham: Springer Nature Switzerland. Equal first contributor.

  4. Xu, C., Zhang, J., Wang, M., Tian, G., & Liu, Y. (2022). Multilevel Spatial-Temporal Feature Aggregation for Video Object Detection. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 32(11), 7809-7820.

  5. Ma, T., Geng, S., Wang, M., Xu, S., Li, H., Zhang, B., ... & Qiao, Y. (2022). Unleashing the Potential of Vision-Language Models for Long-Tailed Visual Recognition. BMVC 2022.

  6. Yang Y, Wang M, Mei J, et al. Exploiting semantic-level affinities with a mask-guided network for temporal action proposal in videos[J]. Applied Intelligence, 2022: 1-21.

  7. Lin H, Wang M, Liu Y, et al. Correlation-based and content-enhanced network for video style transfer[J]. Pattern Analysis and Applications, 2022: 1-13.


  1. Wang, Mengmeng, Jiazheng Xing, and Yong Liu. "Actionclip: A new paradigm for video action recognition." arXiv preprint arXiv:2109.08472 (2021).
  2. Deng C, Wang M*, Liu L, et al. Extended feature pyramid network for small object detection[J]. IEEE Transactions on Multimedia (TMM), 2021. corresponding author
  3. Li Z, Wang M, Mei J, et al. Mail: A unified mask-image-language trimodal network for referring image segmentation[J]. arXiv preprint arXiv:2111.10747, 2021. Equal first contributor.
  4. Tian, G., Sun, Y., Liu, Y., Zeng, X., Wang, M., Liu, Y., ... & Chen, J. (2021). Adding before pruning: Sparse filter fusion for deep convolutional neural networks via auxiliary attention. IEEE Transactions on Neural Networks and Learning Systems (TNNLS).
  5. Mei J, Wang M, Lin Y, et al. Transvos: Video object segmentation with transformers[J]. arXiv preprint arXiv:2106.00588, 2021.
  6. Huang, T., Zou, H., Cui, J., Yang, X., Wang, M., Zhao, X., ... & Liu, Y. (2021). RFNet: recurrent forward network for dense point cloud completion. In Proceedings of the IEEE/CVF international conference on computer vision (ICCV) (pp. 12508-12517).
  7. Liu L, Song X, Wang M, et al. Self-supervised monocular depth estimation for all day images using domain separation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 2021: 12737-12746.
  8. Xiaoyang Lyu, Liang Liu, Mengmeng Wang, Xin Kong, Lina Liu, Yong Liu*, Xinxin Chen, Yi Yuan, HR-Depth : High Resolution Self-Supervised Monocular Depth Estimation,The Association for the Advance of Artificial Intelligence (AAAI), 2021
  9. Lina Liu, Xibin Song, Xiaoyang Lyu, Junwei Diao, Mengmeng Wang, Yong Liu*, Liangjun Zhang, FCFR-Net: Feature Fusion based Coarse-to-Fine Residual Learning for Depth Completion, The Association for the Advance of Artificial Intelligence (AAAI), 2021
  10. Jilin Tang, Yi Yuan*, Tianjia Shao, Yong Liu, Mengmeng Wang, Kun Zhou, Structure-aware Person Image Generation with Pose Decomposition and Semantic Correlation, the Association for the Advance of Artificial Intelligence (AAAI), 2021
  11. Guangming Yao†, Tianjia Shao†, Yi Yuan*, Shuang Li, Shanqi Liu, Yong Liu, Mengmeng Wang, Kun Zhou, One-shot Face Reenactment Using Appearance Adaptive Normalization,the Association for the Advance of Artificial Intelligence (AAAI), 2021
  12. Xu, C., Wu, X., Li, Y., Jin, Y., Wang, M*, & Liu, Y. (2021). Cross-modality online distillation for multi-view action recognition. Neurocomputing, 456, 384-393. corresponding author

Before 2021

  1. Hao Zhang, Mengmeng Wang, Yong Liu*, Yi Yuan. FDN: Feature Decoupling Network for Head Pose Estimation, Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI), New York, USA, 7-12 Feb. 2020.
  2. Zhang, J., Xu, C., Liu, L., Wang, M., Wu, X., Liu, Y., & Jiang, Y. (2020). Dtvnet: Dynamic time-lapse video generation via single still image. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part V 16 (pp. 300-315). Springer International Publishing.
  3. Xianfang Zeng, Yusu Pan, Mengmeng Wang, Jiangning Zhang, Yong Liu*. Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose, Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI), New York, USA, 7-12 Feb. 2020
  4. Jiangning Zhang, Xianfang Zeng, Mengmeng Wang, Yusu Pan, Liang Liu,Yong Liu*, Yu Ding, Changjie Fan. FReeNet: Multi-Identity Face Reenactment, 2019 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 16 - 18 June, 2020, Equal First Author
  5. Jiangning Zhang, Chao Xu, Lina Liu, Mengmeng Wang, Xia Wu, Yong Liu*, DTVNet: Dynamic Time-lapse Video Generation via Single Still Image, European Conference on Computer Vision (ECCV), 2020,
  6. Xianfang Zeng, Yusu Pan, Hao Zhang, Mengmeng Wang, Guanzhong Tian, Yong Liu*, Unpaired Salient Object Translation via Spatial Attention Prior, Neurocomputing
  7. Kong, X., Yang, X., Zhai, G., Zhao, X., Zeng, X., Wang, M., ... & Wen, F. (2020). Semantic graph based place recognition for 3d point clouds. In 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 8216-8223).
  8. Boyuan Jiang, Mengmeng Wang *, Weihao Gan, Wei Wu, Junjie Yan. STM: SpatioTemporal and motion encoding for action recognition, Proceedings of the IEEE International Conference on Computer Vision (ICCV). 2019: 2000-2009. Corresponding Author
  9. Mengmeng Wang, Yong Liu*, Daobilige Su, Yufan Liao, Lei Shi and Jinhong Xu. Accurate and Real-time 3D Tracking for the Following Robots by Fusing Vision and Ultra-sonar Information. IEEE/ASME Transactions on Mechatronics, 2018, 23(3): 997 - 1006.(IF=4.943,SCI)
  10. Mengmeng Wang, Yong Liu*, Zeyi Huang. Large Margin Object Tracking with Circulant Feature Maps, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, Hawaii, 22-25 July, 2017.
  11. Mengmeng Wang, Daobilige Su, Lei Shi, Yong Liu*, Jaime Valls Miro. Real-Time 3D Human Tracking for Mobile Robots with Multisensors, 2017 IEEE International Conference on Robotics & Automation (ICRA), Singapore, May 29-June 3, 2017.
  12. Mengmeng Wang, Yong Liu, Rong Xiong. Robust object tracking with a hierarchical ensemble framework, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Korean, Oct.9 - Oct. 14, 2016, 2016: 438-445.