merve's picture

Building on HF

merve PRO

merve

huggingface

·

https://github.com/merveenoyan/smol-vision

AI & ML interests

I love this website VLMs, vision & co

Recent Activity

new activity about 17 hours ago

allenai/MolmoPoint-GUI-8B:Add eval for Screenspot-Pro

upvoted a collection about 17 hours ago

new activity about 22 hours ago

Hcompany/Holo2-235B-A22B:Add ScreenSpot-Pro evaluation result (Holo2-235B-A22B)

View all activity

Organizations

upvoted a collection about 17 hours ago

MolmoPoint

MolmoPoint models • 3 items • Updated about 21 hours ago • 6

upvoted an article about 23 hours ago

Article

✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use

Jan 3, 2025

•

24

upvoted an article 2 days ago

Article

State of Open Source on Hugging Face: Spring 2026

2 days ago

•

40

upvoted an article 7 days ago

Article

How NVIDIA AI-Q Reached \#1 on DeepResearch Bench I and II

7 days ago

•

29

upvoted an article 9 days ago

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

10 days ago

•

182

upvoted an article 21 days ago

Article

Mixture of Experts (MoEs) in Transformers

+5

22 days ago

•

137

upvoted an article 24 days ago

Article

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

29 days ago

•

18

upvoted a collection about 1 month ago

Tiny Aya

Bridging Scale and Multilingual Depth • 10 items • Updated about 1 month ago • 64

upvoted 2 articles about 1 month ago

Article

From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output

Feb 7

•

22

Article

Community Evals: Because we're done trusting black-box leaderboards over the community

+5

Feb 4

•

88

upvoted a changelog about 1 month ago

Hugging Face Changelog

Community Evals and Benchmark Repositories

Feb 5

• 73

upvoted 5 articles about 1 month ago

Article

🚀 SyGra V2.0.0

Feb 5

•

8

Article

Introducing SyGra Studio

Feb 5

•

25

Article

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

Feb 4

•

28

Article

Training Design for Text-to-Image Models: Lessons from Ablations

Feb 3

•

69

Article

H Company's new Holo2 model takes the lead in UI Localization

Feb 3

•

5

upvoted a paper about 2 months ago

C-RADIOv4 (Tech Report)

Paper • 2601.17237 • Published Jan 24 • 10

upvoted a collection about 2 months ago

Open Coding Agents

13 items • Updated 14 days ago • 51

upvoted an article about 2 months ago

Article

Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness

Nov 5, 2025

•

12

upvoted a collection about 2 months ago

Nemotron ColEmbed V2

State-of-the-Art Late Interaction Vision-Language Embedding Models • 3 items • Updated 3 days ago • 10