1 repo

Quantization Strategies — Model Optimization Techniques

We curate 1 GitHub repository matching model optimization techniques · Quantization Strategies. Refine with filters or upvote what's useful.

Quantization Strategies — Model Optimization Techniques

We'll search the best matching repositories with AI.
  • meta-llama/llama

    meta-llama/llama

    59,157GitHub

    Llama is a computational framework and runtime environment designed for executing transformer-based neural networks locally. It functions as a generative AI inference engine, enabling the processing of input sequences through pre-trained model weights to produce text completions and structured data outputs directly on

    Python