← All repositories
52,461 stars4,351 forksPythonapache-2.00 views
unsloth.ai/docs

Unsloth

Features

  • Language Model TrainingUnsloth enables fine-tuning of models with reduced memory usage and faster speeds by automating dataset creation and applying reinforcement learning techniques.
  • LLM Fine-Tuning EnginesA high-performance training framework that optimizes memory usage and compute speed for fine-tuning large language models on consumer hardware.
  • Model Fine-Tuning FrameworksUnsloth provides efficient training techniques to adapt existing models for specific tasks or domain knowledge while maintaining low hardware requirements.
  • No-Code Training InterfacesUnsloth trains text, vision, and audio models using optimized techniques by uploading documents or configuration files without writing code.
  • Multimodal Training PlatformsA specialized development environment that supports efficient fine-tuning of text, vision, and audio models through optimized kernels and data pipelines.
  • Reinforcement Learning ToolkitsA collection of automated workflows and reward-based training methods designed to improve model reasoning and output quality through iterative feedback.
  • Training Acceleration EnginesUnsloth integrates high-throughput inference engines into the training stack to enable simultaneous fine-tuning and fast model inference with lower memory requirements.
  • Efficient Training PipelinesFine-tuning large models with reduced memory usage and faster speeds to adapt them for specific tasks or domain knowledge.
  • Quantized AdaptersApplies low-precision weight updates to compressed model layers to enable training on consumer-grade hardware with minimal accuracy loss.
  • Mixture of Experts OptimizationsUnsloth applies split adapter techniques to mixture-of-experts models to reduce memory overhead and increase training speed by managing parameters efficiently.
  • Custom Kernel AcceleratorsExecutes low-level mathematical operations using hand-optimized kernels to maximize hardware throughput and minimize memory overhead during training.
  • GRPO TrainingUnsloth teaches models to reason by generating multiple response variations and updating weights based on numerical scores from custom reward functions.
  • Training Backend OptimizersUnsloth automatically chooses the best training method based on detected hardware to maximize efficiency using native implementations or custom high-speed kernels.
  • Local Model ExecutionUnsloth facilitates searching, downloading, and executing language models locally while utilizing integrated tools and API endpoints for custom workflows.
  • Gradient CheckpointingReduces peak memory consumption by recomputing intermediate activations during the backward pass instead of storing them in GPU memory.
  • Tool Call Auto-healingUnsloth fixes malformed or broken tool calls during inference to ensure reliable execution and prevent formatting errors in model responses.
  • Local Model ServingDeploying and running language models on local hardware with API endpoints for integration into existing software and tools.
  • Model Inference DeploymentUnsloth supports deploying large language models locally using various file formats with automated tool calling, parameter tuning, and output comparisons.
  • Model Inference ServersUnsloth allows starting an inference server from the command line to load models and expose them via an API endpoint with built-in authentication.
  • Speculative Decoding StrategiesUnsloth speeds up text generation by predicting multiple future tokens in parallel, reducing the total number of processing steps during inference.
  • Local Inference RuntimesA deployment environment that executes quantized language models locally while providing API endpoints and tool-calling capabilities for external integration.
  • Model Comparison InterfacesUnsloth evaluates performance differences by running the same prompt through two different models simultaneously to compare their outputs.
  • Model Management DashboardsA unified web-based dashboard that simplifies the process of downloading, training, benchmarking, and exporting language models across various hardware configurations.
  • Training Data PreparationUnsloth enables structuring raw text into organized question-answer pairs or standardized formats, including options to generate synthetic data using local resources.
  • Training Hyperparameter ConfigurationsUnsloth allows adjusting settings like batch size and learning rate to balance processing speed, memory consumption, and model accuracy during training.
  • Context Memory OptimizationsUnsloth manages memory during reinforcement learning by chunking data across sequences to support significantly longer context lengths without exceeding hardware limits.
  • Data Pipeline BuildersUnsloth designs data generation workflows by connecting blocks for seeding, processing, and validation to create custom datasets for fine-tuning.
  • Multimodal Fine-TuningAdapting vision, audio, and text models to specific datasets while balancing performance and accuracy through efficient training techniques.
  • Computational Graph OptimizersAnalyzes and rewrites model execution paths at runtime to improve processing speed and reduce latency for complex multimodal tasks.
  • Data Collator PipelinesStandardizes raw input processing by dynamically resizing, padding, and masking multimodal data to ensure consistent training batch structures.
  • Dataset Preparation ToolsStructuring and generating synthetic training data through visual workflows to ensure models learn effectively from organized information.
  • Code Execution SandboxesUnsloth runs Bash and Python code in a secure environment to verify model outputs, generate files, and perform computations for increased reliability.
  • Model ExportingUnsloth supports saving custom model weights and converting them into standard file formats for local inference or production deployment.
  • Reward FunctionsUnsloth allows creating custom scoring functions to evaluate model outputs and guide the training process toward specific reasoning or formatting goals.
  • Speech Model Fine-TuningUnsloth trains speech models using efficient techniques to capture unique vocal characteristics and speaking styles that standard cloning cannot replicate.
  • Reinforcement LearningApplying reinforcement learning and custom reward functions to teach models complex reasoning, formatting, and problem-solving capabilities.
  • Quantized Model FormatsUnsloth provides access to pre-optimized model files featuring improved chat templates and tokenization for efficient local inference and training.
  • Embedding Model Fine-TuningUnsloth trains embedding models using efficient techniques while maintaining compatibility with existing data pipelines and encoder-only architectures.
  • Model Selection UtilitiesUnsloth assists in choosing the right model version based on hardware constraints and performance requirements for efficient reasoning or inference.
  • Model Evaluation MetricsUnsloth tracks training progress through loss metrics and verifies model quality using manual chat sessions or automated test sets to prevent overfitting.
  • Training Progress MonitoringUnsloth tracks loss, gradient norms, and hardware utilization in real time to maintain control over the model development process.
  • Vision Model Fine-TuningUnsloth allows selecting specific modules within a vision model to fine-tune, balancing training performance and model accuracy.
  • Multimodal Input OptimizationsUnsloth optimizes how models process visual and audio data by adjusting token budgets and input ordering to respect specific processing limits.
  • Multimodal Context ProvidersUnsloth allows attaching documents, images, and audio files to chat conversations to provide multimodal context for model prompts and testing.
  • FP8 Training OptimizationUnsloth reduces memory usage and increases training throughput during reinforcement learning by using lower-precision numerical formats for model calculations.
  • Authentication StrategiesUnsloth provides authentication credential generation within settings to secure access to locally running model instances and ensure authorized requests.
  • Model Management DashboardsUnsloth provides a unified web interface to control model training, data preparation, and chat interactions across various hardware configurations and operating systems.