上一条:Jia Li, Yin Chen, Xuesong Zhang, et al. Multimodal feature extraction and fusion for emotional reaction intensity estimation and expression classification in videos with transformers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops. 2023: 5837-5843.
下一条:Li Jiaxiu (本科生), Li Kun, Li Jia* (通讯作者) Chen Guoliang, Wang Meng, Guo Dan*; Dual-Path Temporal Map Optimization for Make-up Temporal Video Grounding[J], Multimedia Systems, 30,140(2024)