NVIDIA Jetson Orin Nano Collection Ultra-efficient model variants optimized for Jetson Orin Nano. Designed for constrained edge environments requiring low memory footprint. โข 3 items โข Updated 3 days ago โข 1
NVIDIA Jetson AGX Orin Collection Models optimized and bench-marked for NVIDIA Jetson AGX Orin. Memory-efficient and latency-optimized variants designed for real-time edge inference. โข 3 items โข Updated 3 days ago โข 1
EdgeN Collection Quantization strategy where most weights are converted to INT4, activations remain in FP16, and sensitive layers are preserved in FP16. โข 2 items โข Updated 3 days ago โข 1
FlashHead Collection Efficient Drop-In Replacement for the Classification Head in Language Model Inference. โข 15 items โข Updated 3 days ago โข 1