AI & ML interests
Edge AI Compute, CNN, Visual Transformer, LLM, VLM
Recent Activity
View all activity
Organization Card
AXera Models Research
This is the home for Axera's npu model(axmodel) and npu's tools (Pulsar2). We released(such as):
- MiniCPM4 : MiniCPM4-0.5B
- Qwen3 : Qwen3-0.6B, Qwen3-1.7B, Qwen3-4B
- Qwen2.5 : Qwen2.5-0.5B, Qwen2.5-1.5B, Qwen2.5-3B, Qwen2.5-7B
- DeepSeek : DeepSeek-R1-Distill-Qwen-1.5B, DeepSeek-R1-Distill-Qwen-7B
- HuggingFaceTB : SmolLM, SmolVLM, SmolVLM2
- Multimodal Models : CLIP, MobileCLIP2, JinaCLIP, StableDiffusion, Qwen3-VL-2B/4B, InternelVL3_5-1B/2B, FastVLM-0.5B/1.5B, Qwen2.5-VL-3B/7B, Janus-Pro-1B, MiniCPM4-V
- Vision Models : Ultralytics, Depth-Anything-V2, MixFormerV2, LivePortrait, Real-ESRGAN
- Audio Models : Whisper, SenseVoice, ZipFormer, CosyVoice2, MeloTTS, FireRed-AED, SileroVAD, Kokoro
Solution
- Frigate NVR : AI NVR solution, support AX650 and AXCL
- Immich : High performance self-hosted photo and video management solution
Tools
- Pulsar2 : The NPU Toolchain for AX650/AX8850, AX630C/AX620Q, AX615, AX637
- AXCL:The driver install package for AX650/AX8850
- PPQ-XS : The NPU Toolchain for AX520/AX513
Other
models
147
AXERA-TECH/3D-Speaker-MT.Axera
Audio-Text-to-Text
•
Updated
•
6
AXERA-TECH/SenseVoice
Automatic Speech Recognition
•
Updated
•
83
•
3
AXERA-TECH/yolo26-seg
Image Segmentation
•
Updated
•
23
AXERA-TECH/3D-Speaker-Meeting-Summary
Audio-Text-to-Text
•
Updated
•
13
•
1
AXERA-TECH/IGEV-plusplus
Depth Estimation
•
Updated
•
16
AXERA-TECH/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
•
Updated
•
23
•
5
AXERA-TECH/HY-MT1.5-1.8B_GPTQ_INT4
Translation
•
Updated
•
31
•
1
AXERA-TECH/siglip2-base-patch16-224
Zero-Shot Classification
•
Updated
•
4
AXERA-TECH/RAFT-stereo
Depth Estimation
•
Updated
•
9
AXERA-TECH/gtcrn.axera
Updated
datasets
0
None public yet