Short Bio

I am an Algorithm Engineer with TikTok. My research interests include multimodal large language models, computer vision, and audio-video content understanding.

Before joining TikTok, I was a Senior Computer Vision Algorithm Engineer with YY Live (Baidu Group), Guangzhou, and a Staff Researcher with Lenovo Machine Intelligence Center, Hong Kong SAR. I received the Ph.D degree in computer science from Hong Kong Baptist University in 2018, and the B.Eng degree in computer science and technology from South China University of Technology, in 2013. I was a visiting scholar at Michigan State University.

Links

Services

Invited Reviewer for
  • IEEE TPAMI, IEEE TIFS, IEEE TIP, IEEE TCYB
  • Pattern Recognition, IEEE/CAA Journal of Automatica Sinica
  • ICME, ICPR, ICASSP

Selected Publications

  • Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs
    arXiv preprint, 2024
  • SecureFace: Face Template Protection
    IEEE Transactions on Information Forensics and Security (TIFS), 2020
  • On the Reconstruction of Face Images from Deep Face Templates
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
  • LeapDetect: An Agile Platform for Inspecting Power Transmission Lines from Drones
    IEEE ICDM Demo, 2019
  • Binary Feature Fusion for Discriminative and Secure Multi-biometric Cryptosystem
    Image and Vision Computing (IVC), 2017
  • On the Guessability of Binary Biometric Templates: A Practical Guessing Entropy based Approach
    IEEE IJCB, 2017
  • Fusing Binary Templates for Multi-biometric Cryptosystem
    IEEE BTAS, 2015