VLR-CVC/DocVQA-2026
Viewer • Updated • 73 • 8.91k • 62
Multimodal AI, Document Understanding, Reading Systems.
ComicsPAP: understanding comic strips by picking the correct panel
One missing piece in Vision and Language: A Survey on Comics Understanding