Resources & References
Duration: 2 min
Curated resources to deepen your understanding and continue learning beyond this course.
📚 Resources & References
- vLLM — High-throughput LLM serving
- TensorRT — NVIDIA inference optimization
- Triton Inference Server — Model serving
- ONNX Runtime — Cross-platform inference