Many real-world applications like video surveillance and urban governance need to address the recognition of masked faces. In this research, we build the first large-scale MAsked FAce dataset (MAFA) in the world and propose novel solutions(e.g., LLE-CNNs, identity-diversity inpainting, de-occlusion distillation) to alleviate the effect of large occlusions in understanding masked faces, which are published in CVPR, ACM MM and IEEE TCSVT.
Low-resolution face images lack sufficient information required for recognition and have a certain degree of ambiguity. In this research, we propose cross-quality knowledge distillation framework and devise a series of effective algorithms (e.g., selective knowledge distillation, bridge distillation, few-shot distillation, hybrid order relational distillation, etc) to facilitate low-resolution face recognition in the wild, which are published in IEEE TIP, TCSVT, ACM MM and AAAI.
Tracking and understanding tiny objects plays an important role in many visual intelligence applications like video surveilance, military and medical. In this research, we aim to propose effective solutions to facilitate the robustness, accuracy and efficiency of deep visual tracking in the real-world scenarios like UAV and unmanned systems, which are publised in ACM MM, IEEE TNNLS and TIP, and have been deployed in some unmanned devices. Our solutions won the first places in two international challenges (VisDrone-SOT@ICCV2019 and Anti-UAV@CVPR2020).
The rapid developping Deepfake technologies can easily generate large-scale high-fidelity fake videos, causing serious social influence. This research aims to discove the latent patterns inside Deepfake videos, proposes effective Deepfake detection approach by using temporal dropout 3DCNN and predictive representation learning, and develops explainable tool (XAI) to make the prediction more understandable for human. Our solution delivers state-of-the-art performance on popular benchmarks. These works are published in IEEE TIP, ACM TOMM, ACM MM and IJCAI.
The uncontrollability in real scenario seriously limits the ability of the representation learning, such as the unexplainability for human, the vulnerability to adversarial attack, the difficulty for learning from imperfect data, and other situations where the effective, fair and safe representation is very hard to learn. We work on constructing a series of controllable representation learning algorithms (e.g., selective-supervised contrastive learning, coupled-view learning, soft decision trees embedding, multi-granularity representation) to solve the above problems in real-world applications, which are publised in NeurIPS, CVPR, AAAI, ACM MM, IEEE TMM and IEEE TIP.
When it comes to machine learning, the importance of data is self-evident, but most data is not wanted to be made public, a situation known as a "data island". In this research, we propose a controlled shared learning framework and design a series of effective algorithms (such as shared distillation, maximum local sharing, compression sharing) for machine learning under controlled conditions (data controlled, model controlled). These methods are suitable for various tasks (classification, semantic segmentation) and experiments on various datasets with excellent performance. which are published in IEEE TIP, MMSP, and so on.