Yaosi Hu (胡姚姒)
Post-Doctoral Fellow

The Hong Kong Polytechnic University

Location: The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong
Education | Publications | Standard Proposals | Services

Email: youncyhu@gmail.com
[Google Scholar] [GitHub] [Research Team]

About Me

I am currently a Post-Doctoral Fellow at the Hong Kong Polytechnic University, working with Prof. Chang Wen Chen. I received my Ph.D. degree from Wuhan University, China, supervised by Prof. Zhenzhong Chen. I have also served as a research intern at Microsoft Research Asia, supervised by Dr. Chong Luo during 2021-2022 working on video generation.
My current research interests include computer vision and image/video processing, especially focusing on generative models and video quality assessment.

Education


Publications

Conferences:

A Lightweight No-reference Video Quality Assessment Method
Huiying Shi, Yaosi Hu, Yingxue Zhang, Zhenzhong Chen
IEEE International Conference on Visual Communications and Image Processing (VCIP), 2023.
[PDF] [ Bibtex]

Make It Move: Controllable Image-to-Video Generation with Text Descriptions
Yaosi Hu, Chong Luo, Zhenzhong Chen
IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022.
[PDF] [Code] [ Bibtex]

Subjective Quality Assessment of One-to-One Video-Telephony Service
Mengying Liu, Jose Joskowicz, Rafael Sotelo, Yaosi Hu, Zhenzhong Chen, Lei Yang
IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB), 2022.
[PDF] [ Bibtex]

Video Quality Assessment based on Quality Aggregation Networks
Wei Wu, Yingxue Zhang, Yaosi Hu, Zhenzhong Chen, Shan Liu
IEEE Visual Communications and Image Processing (VCIP), 2022.
[PDF] [ Bibtex]

Learn to Look Around: Deep Reinforcement Learning Agent for Video Saliency Prediction
Yiran Tao, Yaosi Hu, Zhenzhong Chen
IEEE Visual Communications and Image Processing (VCIP), 2021.
[PDF] [ Bibtex]

MAPS: Joint Multimodal Attention and POS Sequence Generation for Video Captioning
Cong Zou, Xuchen Wang, Yaosi Hu, Zhenzhong Chen, Shan Liu
IEEE Visual Communications and Image Processing (VCIP), 2021.
[PDF] [ Bibtex]

Subjective Study of Perceptual Quality for Micro-Video Applications
Yaosi Hu, Yingxue Zhang, Zizheng Liu, Zhenzhong Chen, Shan Liu
IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), 2020.
[PDF] [ Bibtex]

A Multimodal Variational Encoder-Decoder Framework for Micro-video Popularity Prediction
Jiayi Xie, Yaochen Zhu, Zhibin Zhang, Jian Peng, Jing Yi, Yaosi Hu, Hongyi Liu, Zhenzhong Chen
Proceedings of The Web Conference (WWW), 2020.
[PDF] [ Bibtex]

Hierarchical Global-Local Temporal Modeling for Video Captioning
Yaosi Hu, Zhenzhong Chen, Zheng-Jun Zha, Feng Wu
ACM International Conference on Multimedia (ACM MM), 2019.
[PDF] [ Bibtex]

Two-Stream Refinement Network for RGB-D Saliency Detection
Di Liu, Yaosi Hu, Kao Zhang, Zhenzhong Chen
IEEE International Conference on Image Processing (ICIP), 2019.
[PDF] [ Bibtex]

RGB-D Semantic Segmentation: A Review
Yaosi Hu, Zhenzhong Chen, Weiyao Lin
IEEE International Conference on Multimedia & Expo Workshops (ICMEW), 2018.
[PDF] [ Bibtex]




Journals:

Memory-guided Representation Matching for Unsupervised Video Anomaly Detection
Yiran Tao, Yaosi Hu, Zhenzhong Chen
Journal of Visual Communication and Image Representation (JVCI), 2024.
[PDF] [ Bibtex]

A Benchmark for Controllable Text-Image-to-Video Generation
Yaosi Hu, Chong Luo, Zhenzhong Chen
IEEE Transactions on Multimedia (TMM), 2023.
[PDF] [Code] [ Bibtex]

Multiple Visual Relationship Forecasting and Arrangement in Videos
Wanping Ouyang, Yaosi Hu, Yangjun Ou, Zhenzhong Chen
Neurocomputing, 2023.
[PDF] [ Bibtex]

Decomposing Style, Content, and Motion for Videos
Yaosi Hu, Dacheng Yin, Yuwang Wang, Zhenzhong Chen, Chong Luo
Journal of Visual Communication and Image Representation (JVCI), 2022.
[PDF] [Demo] [ Bibtex]

Subjective Evaluation of Visual Quality and Simulator Sickness of Short 360o Videos: ITU-T Rec. P.919
Jesús Gutiérrez, Pablo Pérez, Marta Orduna, Ashutosh Singla, Carlos Cortés, Pramit Mazumdar, Irene Viola, Kjell Brunnström, Federica Battisti, Natalia Cieplińska, Dawid Juszka, Lucjan Janowski, Mikołaj Leszczuk, Anthony Adeyemi-Ejeye, Yaosi Hu, Zhenzhong Chen, Glenn Van Wallendael, Peter Lambert, César Díaz, John Hedlund, Omar Hamsis, Stephan Fremerey, Frank Hofmeyer, Alexander Raake, Pablo César, Marco Carli, and Narciso García
IEEE Transactions on Multimedia (TMM), 2022.
[PDF] [Data] [ Bibtex]

Predicate Correlation Learning for Scene Graph Generation
Leitian Tao, Li Mi, Nannan Li, Xianhang Cheng, Yaosi Hu, Zhenzhong Chen
IEEE Transactions on Image Processing (TIP), 2022.
[PDF] [ Bibtex]

Exploiting the Local Temporal Information for Video Captioning
Ran Wei, Li Mi, Yaosi Hu, Zhenzhong Chen
Journal of Visual Communication and Image Representation (JVCI), 2020.
[PDF] [ Bibtex]


Arxiv:

SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control
Binyuan Huang, Yuqing Wen, Yucheng Zhao, Yaosi Hu, Yingfei Liu, Fan Jia, Weixin Mao, Tiancai Wang, Chi Zhang, Chang Wen Chen, Zhenzhong Chen, Xiangyu Zhang
arXiv, 2024.
[PDF] [ Bibtex]

LaMD: Latent Motion Diffusion for Video Generation
Yaosi Hu, Zhenzhong Chen, Chong Luo
arXiv, 2023.
[PDF] [ Bibtex]

Learning Human Cognitive Appraisal Through Reinforcement Memory Unit
Yaosi Hu, Zhenzhong Chen
arXiv, 2022.
[PDF] [ Bibtex]


Standard Proposals

  • ITU-T P.919. Subjective test methodologies for 360º video on head-mounted displays, 2020.10. Pablo Pérez, Jesús Gutiérrez, Ashutosh Singla, Irene Viola, Federica Battisti, Dawid Juszka, Marta Orduna, Zhenzhong Chen, Yaosi Hu.
  • Zhenzhong Chen, Yaosi Hu, Yingxue Zhang, Yi Han, Xuan Yan, Shan Liu, Xiaozhong Xu. Subjective Quality Assessment Database for PGC Video Content Towards Mobile Video Applications. AVS M5083, 2019.12, Shenzhen, CN.
  • Yingxue Zhang, Yaosi Hu, Zhenzhong Chen, Xiaozhong Xu, Shan Liu, Xuan Yan, Yi Han. Subjective Test Methodology for PGC Video Content Towards Mobile Video Applications. AVS M5082, 2019.12, Shenzhen, CN.
  • Zizheng Liu, Ruigang Yao, Wei Wu, Yingxue Zhang, Yaosi Hu, Zhenzhong Chen, Xiaozhong Xu, Shan Liu, Xuan Yan, Yi Han. No-reference Video Quality Aseessment Model for PGC Video Content Towards Mobile Video Applications. AVS M5084, 2019.12, Shenzhen, CN.

Services

  • Reviewer - International Journal of Computer Vision (IJCV)
  • Reviewer - IEEE Transactions on Image Processing (TIP)
  • Reviewer - IEEE Transactions on Multimedia (TMM)
  • Reviewer - Expert Systems with Applications (ESWA)
  • Reviewer - Neurocomputing
  • Reviewer - Entertainment Computing
  • Reviewer - IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
  • Reviewer - IEEE/CVF International Conference on Computer Vision (ICCV)
  • Reviewer - European Conference on Computer Vision (ECCV)
  • Reviewer - AAAI Conference on Artificial Intelligence (AAAI)
  • Reviewer - IEEE International Conference on Multimedia and Expo (ICME)
  • Reviewer - Asian Conference on Computer Vision (ACCV)
  • Reviewer - IEEE International Conference on Multimedia Information Processing and Retrieval (MIPR)
  • 2020: Program Coordinator - IEEE 3rd International Conference on Multimedia Information Processing and Retrieval (MIPR)
  • 2019: Program Coordinator and Local Arrangement - Video Quality Experts Group Meeting