1 repo

Data Preprocessing Pipelines — Data Engineering

We curate 1 GitHub repository matching data engineering · Data Preprocessing Pipelines. Refine with filters or upvote what's useful.

Data Preprocessing Pipelines — Data Engineering

We'll search the best matching repositories with AI.
  • karpathy/nanoGPT

    karpathy/nanoGPT

    53,461GitHub

    nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predi

    Python