| 13:30 – 13:45 | Session 1 | S19201 | TALENT: Table VQA via Augmented Language-Enhanced Natural-text Transcription | Yutong Guo, Wanying Wang, Yue Wu, Zichen Miao, and Haoyu Wang | Oral | 15 min |
| 13:45 – 14:00 | S19202 | Smart Vision-Language Reasoners | Denisa Olteanu Roberts and Lucas Roberts | Oral | 15 min |
| 14:00 – 14:15 | S19203 | BrAMA: A Data-Efficient Brain-Inspired Architecture for Semi-Supervised Multi-Modal Association | Jonathan Grienay, Marina Reyboz, Martial Mermillod, Laurent Rodriguez, and Benoit Miramond | Oral | 15 min |
| 14:15 – 14:30 | S19204 | A Practical Synthesis of Detecting AI-Generated Textual, Visual, and Audio Content | Lele Cao | Oral | 15 min |
| 14:30 – 14:45 | Coffee Break | 15 min |
| 14:45 – 15:00 | Session 2 | S19206 | Evaluating Open-Source Vision-Language Models for Multimodal Sarcasm Detection | Saroj Basnet, Shafkat Farabi, Tharindu Ranasinghe, Diptesh Kanojia, and Marcos Zampieri | Oral | 15 min |
| 15:00 – 15:15 | S19207 | TTNS: Dynamic Three-Tier Negative Sampling for Scalable Multi-Modal Search Ranking in Production | Fengbin Chen, Liping Zhang, and Tracy King | Oral | 15 min |
| 15:15 – 15:30 | DM258 | A Study on Multimodal Emotion Recognition Model Incorporating Edge Noise Optimization | Chen Huang, Huijie Liu, Yan Zhang, Chao Yang, and Jianhua Song | Oral | 15 min |
| 15:30 – 15:40 | Poster Spotlights | S19205 | OPTiCAL: An Abstract Positional Reasoning Benchmark for Vision-Language Models | Christopher Driggers-Ellis, Gabriel Ayoubi, and Christan Grant | Poster Spotlight | 10 min |
| 15:40 – 15:50 | DM494 | Guided Manifold Alignment with Geometry-Regularized Twin Autoencoders | Jake S. Rhodes, Adam G. Rustad, Marshall S. Nielsen, Morgan McClellan, Dallan Gardner, and Dawson Hedges | Poster Spotlight | 10 min |
| 15:50 – 16:00 | DM949 | SVDLoRA: Data-Driven Low-Rank Adaptation via Spectral Decomposition | Fanglue Zhang, Shufan Shen, Chao Bi, Li Su, Qingming Huang, and Shuhui Wang | Poster Spotlight | 10 min |
| 16:00 – 16:30 | Joint Main Conference Coffee Break | 30 min |
| 16:30 – 17:25 | Poster Interaction & Discussion | 55 min |
| 17:25 – 17:30 | Closing Remarks | 5 min |