Building on HF

1 42 60

Adarsh Zolekar

adarshzolekar

PhysiQuanty's profile picture

ahmadbabajo's profile picture

HustleMoney's profile picture

adarsh_zolekar
AdarshZolekar
adarshzolekar
adarshzolekar.bsky.social

AI & ML interests

Passionate about AI & ML, Deep Learning and related AI domains. Exploring models, datasets and applications while contributing to the Hugging Face community.

Recent Activity

liked a model 6 days ago

MiniMaxAI/MiniMax-M2.7

upvoted a paper 6 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

upvoted a paper 6 days ago

Adam's Law: Textual Frequency Law on Large Language Models

View all activity

Organizations

adarshzolekar 's collections 4

Multimodal AI Models

Purpose: Models that understand text + image + audio together.

llava-hf/llava-1.5-7b-hf

Image-Text-to-Text • 7B • Updated Jun 6, 2025 • 2.53M • 355
Salesforce/blip-image-captioning-base

Image-to-Text • Updated Feb 3, 2025 • 2.18M • 849
google/pix2struct-base

Image-to-Text • 0.3B • Updated Dec 24, 2023 • 2.34k • 79
microsoft/kosmos-2-patch14-224

Image-to-Text • Updated Nov 28, 2023 • 160k • 184

Vision Models (Image & Video)

Purpose: Text-to-image, image classification, detection, segmentation.

stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 2.01M • • 7.65k
rupeshs/LCM-runwayml-stable-diffusion-v1-5

Text-to-Image • Updated Nov 12, 2023 • 146 • 30
openai/clip-vit-base-patch32

Zero-Shot Image Classification • Updated Feb 29, 2024 • 20.3M • 917
facebook/detr-resnet-50

Object Detection • 41.6M • Updated Apr 10, 2024 • 228k • • 946

Audio & Speech Models

Purpose: Speech recognition, text-to-speech, music, audio analysis.

openai/whisper-large-v3

Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 4.84M • • 5.61k
facebook/wav2vec2-base-960h

Automatic Speech Recognition • 94.4M • Updated Nov 14, 2022 • 1.21M • 396
coqui/XTTS-v2

Text-to-Speech • Updated Dec 11, 2023 • 6.91M • 3.49k
microsoft/speecht5_tts

Text-to-Speech • Updated Nov 8, 2023 • 233k • 826

Text & Code Models (NLP)

Purpose: Text generation, summarization, translation, embeddings, coding.

meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 9.46M • • 5.74k
mistralai/Mistral-7B-Instruct-v0.3

7B • Updated Dec 3, 2025 • 2.52M • 2.53k
google/gemma-7b

Text Generation • 9B • Updated Jun 27, 2024 • 138k • 3.3k
bigscience/bloom

Text Generation • 176B • Updated Jul 28, 2023 • 6.62k • 5k