1 repo

Speculative Decoding Strategies — Inference Acceleration Techniques

We curate 1 GitHub repository matching inference acceleration techniques · Speculative Decoding Strategies. Refine with filters or upvote what's useful.

Speculative Decoding Strategies — Inference Acceleration Techniques

We'll search the best matching repositories with AI.
  • unslothai/unsloth

    unslothai/unsloth

    52,461GitHub

    Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade

    Pythonagentdeepseekdeepseek-r1