NVIDIA Jetson Orin Nano Collection Ultra-efficient model variants optimized for Jetson Orin Nano. Designed for constrained edge environments requiring low memory footprint. • 5 items • Updated 8 days ago • 4
NVIDIA Jetson AGX Orin Collection Models optimized and bench-marked for NVIDIA Jetson AGX Orin. Memory-efficient and latency-optimized variants designed for real-time edge inference. • 8 items • Updated 8 days ago • 3
view article Article How to Build a vLLM Plugin: A Guide to the general_plugins Entry Point 27 days ago • 2