Publications

My work spans multimodal LLMs, computer vision, biometric security, and applied multimedia understanding. For citation counts and the most up-to-date list, please refer to my public research profiles.

Google Scholar OpenReview ORCID

Selected Recent and Representative Work

A compact view of the topics that have shaped my research trajectory, from multimodal reasoning to biometric security.

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Jieyu Wang, Xiaobing Zha, Xian Wei, Tingting Qiao, Guangcan Mai, and Xiaochun Cao. ICLR 2026 Poster / arXiv 2025

Region-level multimodal understanding with a focus on precise pixel grounding and contextual reasoning.

OpenReview arXiv

SecureFace: Face Template Protection

Guangcan Mai, Kai Cao, Xiangyuan Lan, and Pong C. Yuen. IEEE Transactions on Information Forensics and Security, 2021

Template protection for face recognition with strong attention to privacy and security requirements.

DOI

On the Reconstruction of Face Images from Deep Face Templates

Guangcan Mai, Kai Cao, Pong C. Yuen, and Anil K. Jain. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019

A study of the reconstruction risks behind deep face templates and their security implications.

DOI arXiv

Journal Papers

SecureFace: Face Template Protection

Guangcan Mai, Kai Cao, Xiangyuan Lan, and Pong C. Yuen. IEEE Transactions on Information Forensics and Security, 2021

DOI

On the Reconstruction of Face Images from Deep Face Templates

Guangcan Mai, Kai Cao, Pong C. Yuen, and Anil K. Jain. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019

DOI arXiv

Binary Feature Fusion for Discriminative and Secure Multi-biometric Cryptosystem

Guangcan Mai, Meng-Hui Lim, and Pong C. Yuen. Image and Vision Computing, 2017

DOI

Learning Discriminability-preserving Histogram Representation from Unordered Features for Multibiometric Feature-fused-template Protection

Meng-Hui Lim, Sunny Verma, Guangcan Mai, and Pong C. Yuen. Pattern Recognition, 2016

DOI

Conference Papers

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Jieyu Wang, Xiaobing Zha, Xian Wei, Tingting Qiao, Guangcan Mai, and Xiaochun Cao. ICLR 2026 Poster / arXiv 2025

OpenReview arXiv

LeapDetect: An Agile Platform for Inspecting Power Transmission Lines from Drones

Guangcan Mai, Renjie Gou, Liya Ji, Hua Wu, Fei Cao, Qifeng Chen, and Jun Luo. IEEE International Conference on Data Mining Workshops, Demo Session, 2019

DOI

On the Guessability of Binary Biometric Templates: A Practical Guessing Entropy based Approach

Guangcan Mai, Meng-Hui Lim, and Pong C. Yuen. International Joint Conference on Biometrics, 2017

DOI

Fusing Binary Templates for Multi-biometric Cryptosystem

Guangcan Mai, Meng-Hui Lim, and Pong C. Yuen. IEEE International Conference on Biometrics: Theory, Applications and Systems, 2015

DOI

Thesis

Biometric System Security and Privacy: Data Reconstruction and Template Protection

Guangcan Mai. Ph.D. Thesis, Hong Kong Baptist University, 2018