NVIDIA Jetson Orin Nano Collection Ultra-efficient model variants optimized for Jetson Orin Nano. Designed for constrained edge environments requiring low memory footprint. โข 3 items โข Updated 2 days ago โข 1
NVIDIA Jetson AGX Orin Collection Models optimized and bench-marked for NVIDIA Jetson AGX Orin. Memory-efficient and latency-optimized variants designed for real-time edge inference. โข 3 items โข Updated 3 days ago โข 1
view article Article Benchmarks + Report: Optimized Cosmos-Reason2 (Qwen3-VL) for on-device inference on 8GB RAM (Jetson Orin Nano Super) about 20 hours ago
EdgeN Collection Quantization strategy where most weights are converted to INT4, activations remain in FP16, and sensitive layers are preserved in FP16. โข 2 items โข Updated 2 days ago โข 1
FlashHead Collection Efficient Drop-In Replacement for the Classification Head in Language Model Inference. โข 15 items โข Updated 2 days ago โข 1
FlashHead Collection Efficient Drop-In Replacement for the Classification Head in Language Model Inference. โข 15 items โข Updated 2 days ago โข 1