IAPR/IEEE Winter School on Biometrics 2025

Foundations of Vision-Language Models: Concepts and Roadmap

Slides (pdf)  Size: 42.4MB

Biography

Dr. Kaiyang Zhou is an Assistant Professor at the Department of Computer Science, Hong Kong Baptist University, working on computer vision and machine learning. He has published more than 30 technical papers in top-tier journals and conferences in relevant fields, including CVPR, ICCV, ECCV, NeurlPS, ICLR, ICML, AAAI, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), and International Journal of Computer Vision (IJCV), with over 9,000 citations received in total. He is an Associate Editor of IJCV, the flagship journal in computer vision, and regularly serves as area chair and senior program committee for top-tier computer vision and machine learning conferences, such as NeurIPS, CVPR, ECCV, and AAAI. He is also the creator of several impactful AI software packages, such as Torchreid (the No.1 popular person re-identification project on GitHub), Dassl (a multifunctional machine learning framework), and CoOp (a prompt learning tool for improving vision-language models). Prior to joining HKBU, he was a postdoc at Nanyang Technological University, Singapore, working with Prof. Ziwei Liu and Prof. Chen Change Loy. He received his PhD in computer science from the University of Surrey, UK, under the supervision of Prof. Tao Xiang.

Fields of specialization: multimodal models, domain generalization, domain adaptation.

Kaiyang Zhou

Kaiyang Zhou
Hong Kong Baptist University, Hong Kong, China