4 40 55

NAN

nan1248

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding

submitted a paper 5 days ago

PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding

authored a paper 5 days ago

GenX: Mastering Code and Test Generation with Execution Feedback

View all activity

Organizations

None yet

upvoted a paper 5 days ago

PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding

Paper • 2604.00886 • Published 6 days ago • 5

submitted a paper to Daily Papers 5 days ago

PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding

Paper • 2604.00886 • Published 6 days ago • 5

authored 2 papers 5 days ago

GenX: Mastering Code and Test Generation with Execution Feedback

Paper • 2412.13464 • Published Dec 18, 2024 • 1

AndesVL Technical Report: An Efficient Mobile-side Multimodal Large Language Model

Paper • 2510.11496 • Published Oct 13, 2025 • 5

upvoted a paper about 2 months ago

POINTS-GUI-G: GUI-Grounding Journey

Paper • 2602.06391 • Published Feb 6 • 18

upvoted a paper 3 months ago

Sliding Window Attention Adaptation

Paper • 2512.10411 • Published Dec 11, 2025 • 21

upvoted 2 collections 3 months ago

AndesVL

Collection

AndesVL is a suite of mobile-optimized Multimodal Large Language Models (MLLMs) with 0.6B to 4B parameters. • 8 items • Updated Feb 1 • 15

Molmo2 Data

Collection

Artifacts for the Molmo2 data release • 13 items • Updated Mar 2 • 39

liked a model 4 months ago

tencent/HunyuanOCR

Image-Text-to-Text • Updated Jan 13 • 194k • 562

upvoted a paper 5 months ago

Can Visual Input Be Compressed? A Visual Token Compression Benchmark for Large Multimodal Models

Paper • 2511.02650 • Published Nov 4, 2025 • 10

upvoted a paper 6 months ago

DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents

Paper • 2510.19336 • Published Oct 22, 2025 • 17

liked 8 models 6 months ago

upvoted a paper 6 months ago

AndesVL Technical Report: An Efficient Mobile-side Multimodal Large Language Model

Paper • 2510.11496 • Published Oct 13, 2025 • 5

NAN

AI & ML interests

Recent Activity

Organizations

nan1248's activity