Aritra Dutta's picture

Open to Work

Aritra Dutta

dutta18

·

https://vpnleaderboard.com/

AI & ML interests

None yet

Recent Activity

upvoted an article about 5 hours ago

Multimodal Embedding & Reranker Models with Sentence Transformers

new activity 5 days ago

lmms-lab/DocVQA:DataFilesNotFoundError: No (supported) data files found in lmms-lab/DocVQA

liked a model 5 days ago

nanonets/Nanonets-OCR-s

View all activity

Organizations

upvoted an article about 5 hours ago

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

2 days ago

•

28

New activity in lmms-lab/DocVQA 5 days ago

DataFilesNotFoundError: No (supported) data files found in lmms-lab/DocVQA

#5 opened 5 days ago by

liked a model 5 days ago

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20, 2025 • 25.3k • 1.59k

updated a dataset 9 days ago

dutta18/A-OKVQA-17K

Viewer • Updated 9 days ago • 18.2k • 135

published a dataset 9 days ago

dutta18/A-OKVQA-17K

Viewer • Updated 9 days ago • 18.2k • 135

updated a dataset 9 days ago

dutta18/Physical-Reasoning-VQA-45K

Viewer • Updated 9 days ago • 64.9k • 117

published a dataset 9 days ago

dutta18/Physical-Reasoning-VQA-45K

Viewer • Updated 9 days ago • 64.9k • 117

updated a dataset 9 days ago

dutta18/Quantity-Reasoning-VQA-23K

Viewer • Updated 9 days ago • 23.7k • 108

published a dataset 9 days ago

dutta18/Quantity-Reasoning-VQA-23K

Viewer • Updated 9 days ago • 23.7k • 108

upvoted a collection 10 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 561

New activity in google/gemma-3-4b-it about 1 month ago

Finetuning Code Link In Native PyTorch

#87 opened about 1 month ago by

liked a model about 1 month ago

meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 203k • 1.58k

New activity in mistralai/Ministral-3-3B-Instruct-2512 about 1 month ago

How to use local image in the chat template?

#15 opened about 1 month ago by

updated a dataset 2 months ago

dutta18/multidomain-VQA-with-cot-trace-9K

Viewer • Updated Feb 6 • 10.8k • 14

published a dataset 2 months ago

dutta18/multidomain-VQA-with-cot-trace-9K

Viewer • Updated Feb 6 • 10.8k • 14

upvoted an article 2 months ago

Article

Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers

Feb 1, 2022

•

15

New activity in lmms-lab/LongVA-7B 2 months ago

TypeError: unsupported operand type(s) for //: 'int' and 'NoneType' while calling the processor

#1 opened 2 months ago by

liked a model 2 months ago

mPLUG/mPLUG-Owl3-7B-240728

Image-Text-to-Text • 8B • Updated Sep 29, 2024 • 1.2k • 43

New activity in mPLUG/mPLUG-Owl3-7B-240728 2 months ago

KeyError: None for when the model.generate() method is executed in mPlugOwl3

#8 opened 2 months ago by

New activity in aws-prototyping/long-llava-qwen2-7b 3 months ago

Model's processor have small bug in the integer division.

#2 opened 3 months ago by