计算机视觉论文索引 [Visual Pretrain] Next-Embedding Prediction Makes Strong Vision Learners [Video Action] Modeling Video Evolution For Action Recognition Convolutional Two-Stream Network Fusion for Video Action Recognition Written on December 19, 2025