1 repo
Cross-Platform AI Accelerators — Hardware Acceleration
We curate 1 GitHub repository matching hardware acceleration · Cross-Platform AI Accelerators. Refine with filters or upvote what's useful.
Cross-Platform AI Accelerators — Hardware Acceleration
We'll search the best matching repositories with AI.
- vllm-project/vllm
vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen
Pythonamdblackwellcuda