DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos Through Question Answering

Haochen Wang, Kai Hu,Liangcai Gao

ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)（2025）

Cited 0|Views5

Key words

Video Qustion Answering,Document Understanding,Multi-model Large Language Model,Dataset

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Chat Paper

Summary is being generated by the instructions you defined