43 repos

Awesome GitHub repositories, curated.

A community-curated directory of interesting public GitHub repositories. Ask in plain English — AI ranks by relevance. Save what you find.

Browse repositories

We'll search the best matching repositories with AI.
  • opencv/opencv

    opencv/opencv

    86,238GitHub

    OpenCV is a comprehensive computer vision library designed for real-time performance and cross-platform deployment. It provides a native execution environment that leverages multi-threaded operations and automated memory management to handle intensive computational tasks, including image processing and machine learning

    Computer Vision LibrariesReal-Time Computer VisionReference-Counted Memory Management
  • rasbt/LLMs-from-scratch

    rasbt/LLMs-from-scratch

    85,529GitHub

    This repository serves as an educational framework for building large language models from the ground up. It provides a structured curriculum that guides learners through the end-to-end lifecycle of model development, including data processing, architecture design, and optimization. By focusing on low-level implementat

    Language Model DevelopmentBackpropagation ImplementationsDeep Learning Implementations
  • firecrawl/firecrawl

    firecrawl/firecrawl

    84,034GitHub

    Firecrawl is a web data extraction platform designed to convert unstructured web content into clean, LLM-ready formats like markdown or JSON. It functions as an autonomous web crawler and scraper, capable of mapping entire domains, performing recursive navigation, and executing complex data gathering tasks. By leveragi

    Autonomous Web AgentsAutonomous Web CrawlersAutonomous Web Researchers
  • microsoft/ML-For-Beginners

    microsoft/ML-For-Beginners

    83,800GitHub

    This project is an open-source educational curriculum designed to provide a structured path for developers to master machine learning and generative AI. It functions as a technical skill development platform, offering comprehensive study materials that guide learners through fundamental concepts, algorithms, and the pr

    Educational CurriculaGuided TutorialsMachine Learning Education
  • punkpeye/awesome-mcp-servers

    punkpeye/awesome-mcp-servers

    81,101GitHub

    This project serves as a centralized directory and interoperability hub for the Model Context Protocol, providing a curated collection of standardized service connectors that bridge artificial intelligence models with external software, databases, and APIs. It facilitates the integration of AI agents with diverse ecosy

    Model Context Protocol ServersTool Interoperability ProtocolsAI Agent Tool Integrations
  • hacksider/Deep-Live-Cam

    hacksider/Deep-Live-Cam

    79,568GitHub

    Deep-Live-Cam is a generative video transformation tool designed for real-time facial manipulation and cinematic enhancement. It functions as a local-first AI runtime, performing all media processing directly on the user's hardware to ensure complete data privacy without external network dependencies. By utilizing a hi

    Cinematic Video EnhancementsFace Swapping ToolsHigh-Performance AI Inference
  • fighting41love/funNLP

    fighting41love/funNLP

    78,999GitHub

    This project is a community-driven knowledge base and curated repository focused on natural language processing and large language model development. It serves as a centralized index for high-quality tools, libraries, and research materials, organizing technical resources into structured, version-controlled documentati

    Awesome ListsLLM Application FrameworksTechnical Resource Repositories
  • browser-use/browser-use

    browser-use/browser-use

    78,576GitHub

    Browser-use is a framework for building autonomous agents that navigate, interact with, and extract data from web interfaces using natural language instructions. By acting as an orchestration layer between large language models and browser automation protocols, it enables the execution of complex, multi-step workflows

    Autonomous Browser AgentsAutonomous Web AgentsCDP Automation Interfaces
  • hoppscotch/hoppscotch

    hoppscotch/hoppscotch

    77,888GitHub

    Hoppscotch is an open-source API development ecosystem designed for building, testing, and debugging REST, GraphQL, and real-time APIs. It provides a unified platform that functions across web browsers, desktop applications, and command-line interfaces, allowing developers to manage the entire API lifecycle from a sing

    API ClientsAPI Development EcosystemsAPI Development Platforms
  • netdata/netdata

    netdata/netdata

    77,812GitHub

    Netdata is a distributed observability platform designed for real-time infrastructure monitoring and performance tracking. It functions as a high-frequency agent that collects system, container, and application metrics with per-second precision, providing both local visualization and centralized aggregation across comp

    Distributed Metric AggregatorsDistributed Observability PlatformsInfrastructure Metric Collectors
  • tensorflow/models

    tensorflow/models

    77,684GitHub

    This repository serves as a centralized collection of state-of-the-art deep learning architectures and reference implementations designed for research and application development. It provides a comprehensive toolkit for computer vision and natural language processing, offering pre-built models and training pipelines fo

    Computer Vision ModelsComputer Vision Research ToolkitsDistributed Parameter Synchronisation
  • nomic-ai/gpt4all

    nomic-ai/gpt4all

    77,146GitHub

    GPT4All is a cross-platform runtime environment designed to execute large language models directly on local consumer hardware. By leveraging an optimized C++ inference backend, it enables private, offline AI interactions without requiring an internet connection or external cloud services. The project provides a compreh

    C++ Inference BackendsChat Completion InterfacesLocal Inference Engines
  • elastic/elasticsearch

    elastic/elasticsearch

    76,163GitHub

    Elasticsearch is a distributed search engine and document store designed for the high-performance indexing and retrieval of massive volumes of unstructured data. It functions as a centralized analytics platform, providing a schema-flexible architecture that organizes information into searchable indices while maintainin

    Distributed Search EnginesApplication Performance MonitoringData Querying Engines
  • d2l-ai/d2l-zh

    d2l-ai/d2l-zh

    75,708GitHub

    This project is an open-source, interactive educational platform designed to teach deep learning through a comprehensive, code-first curriculum. It provides a structured learning path that covers foundational mathematics, modern neural network architectures, and practical optimization techniques, enabling practitioners

    Attention MechanismsAutomatic DifferentiationAutomatic Differentiation Engines
  • zed-industries/zed

    zed-industries/zed

    75,634GitHub

    Zed is an AI-native, high-performance code editor designed for extreme responsiveness and keyboard-centric workflows. It functions as an extensible text processing workspace that integrates autonomous agents and predictive models directly into the development environment to automate complex engineering tasks, refactori

    AI-Assisted Software DevelopmentsAI-Native Development EnvironmentsExtensible Text Editors
  • mlabonne/llm-course

    mlabonne/llm-course

    75,340GitHub

    This project is a comprehensive educational curriculum and engineering handbook focused on the lifecycle of large language models. It serves as a structured knowledge base for machine learning practitioners, covering the fundamental mathematical and architectural principles of transformer-based sequence modeling, as we

    AI Research RepositoriesFine-Tuning StrategiesLarge Language Models
  • Developer-Y/cs-video-courses

    Developer-Y/cs-video-courses

    74,064GitHub

    This project is a community-driven educational repository that serves as a comprehensive directory of university-level computer science video lectures. It provides a structured learning path for students and professionals, aggregating high-quality academic resources to facilitate self-paced study across a wide range of

    Computer Science CurriculaOpen Learning ResourcesCollaborative Knowledge Bases
  • infiniflow/ragflow

    infiniflow/ragflow

    73,425GitHub

    This project is a comprehensive retrieval-augmented generation platform designed for building, managing, and deploying knowledge-based AI applications. It provides a unified environment for organizing datasets, configuring conversational chat assistants, and developing autonomous agents that execute multi-step reasonin

    AI Agent FrameworksChat AssistantsGrounded Answer Generation
  • redis/redis

    redis/redis

    73,096GitHub

    Redis is an in-memory, key-value database designed to provide sub-millisecond latency for read and write operations. It functions as a versatile data platform, serving as a distributed cache, a message broker, a NoSQL document store, and a vector database. The system utilizes an event-driven, single-threaded loop to pr

    Active-Active Database ClustersDistributed CachesDistributed State Management
  • awesomedata/awesome-public-datasets

    awesomedata/awesome-public-datasets

    72,846GitHub

    This project is a community-maintained, open-access directory of high-quality public datasets. It serves as a centralized reference point for researchers, developers, and data scientists to locate reliable information sources across a wide spectrum of industries and scientific fields. By providing a structured index, t

    Curated Data RepositoriesCurated Resource ListsModel Training Pipelines