17 repos

Search & Information Retrieval — Data & Databases

We curate 17 GitHub repositories matching data & databases · Search & Information Retrieval. Refine with filters or upvote what's useful.

Search & Information Retrieval — Data & Databases

We'll search the best matching repositories with AI.
  • EbookFoundation/free-programming-books

    EbookFoundation/free-programming-books

    382,801GitHub

    This project is a centralized, open-access repository that serves as a structured directory for technical education and professional development. It functions as a community-driven knowledge base, aggregating high-quality learning materials to support global accessibility to computer science and software engineering re

    Pythonbookseducationhacktoberfest
  • vinta/awesome-python

    vinta/awesome-python

    283,687GitHub

    This project is a comprehensive, community-curated directory that organizes a vast landscape of Python software libraries, frameworks, and tools. It serves as a centralized knowledge base designed to facilitate ecosystem navigation and accelerate developer discovery across the entire software development lifecycle. Th

    Pythonawesomecollectionspython
  • openclaw/openclaw

    openclaw/openclaw

    211,971GitHub

    Openclaw is a platform for managing agent execution environments, providing the infrastructure to control agent lifecycles, session state, and workspace persistence. It features a centralized gateway that handles model loops, tool invocation, and streaming events, while supporting multi-agent routing and persistent mem

    TypeScriptaiassistantcrustacean
  • Significant-Gravitas/AutoGPT

    Significant-Gravitas/AutoGPT

    181,891GitHub

    AutoGPT is an orchestration platform designed for building, managing, and deploying autonomous agents. It provides a visual canvas-based environment where users can assemble agents by connecting modular blocks that represent actions, data flows, and conditional logic. The platform supports the entire agent lifecycle, i

    Pythonaiartificial-intelligenceautonomous-agents
  • f/prompts.chat

    f/prompts.chat

    145,637GitHub

    Prompts.chat is a community-driven repository and management platform for AI prompts and agent skills. It provides a centralized interface for users to search, retrieve, and save prompts, while offering structured storage for multi-file agent skills that include documentation and supporting assets. The platform distin

    HTMLaiartificial-intelligenceawesome-list
  • ripienaar/free-for-dev

    ripienaar/free-for-dev

    118,073GitHub

    This project is a community-maintained directory of technical resources, tools, and services that offer free tiers for developers. It serves as a centralized reference point for discovering infrastructure, software, and educational materials, helping individuals and teams minimize operational costs while building and s

    HTMLawesome-listfree-for-developers
  • Anduin2017/HowToCook

    Anduin2017/HowToCook

    98,028GitHub

    HowToCook is a structured culinary knowledge base and computational engine designed for the management and scaling of instructional cooking content. It provides a framework for organizing technical preparation procedures and ingredient data, allowing users to maintain consistent culinary standards across various meal s

    Dockerfilechinesecookbookcooking
  • supabase/supabase

    supabase/supabase

    97,908GitHub

    This project provides an integrated backend platform built around a relational database. It automatically generates REST and GraphQL APIs from database schemas, allowing for direct data interaction through standard requests and client libraries. The platform includes a comprehensive authentication system that manages u

    TypeScriptaialternativeauth
  • firecrawl/firecrawl

    firecrawl/firecrawl

    84,034GitHub

    Firecrawl is a web data extraction platform designed to convert unstructured web content into clean, LLM-ready formats like markdown or JSON. It functions as an autonomous web crawler and scraper, capable of mapping entire domains, performing recursive navigation, and executing complex data gathering tasks. By leveragi

    TypeScriptaiai-agentsai-crawler
  • macrozheng/mall

    macrozheng/mall

    82,926GitHub

    This project is an enterprise-grade Java framework designed for building scalable, full-stack e-commerce applications. It provides a comprehensive foundation for microservice-based distributed architectures, enabling the development of complex retail platforms that include product management, order processing, and secu

    Javadockerelasticsearchelk
  • bregman-arie/devops-exercises

    bregman-arie/devops-exercises

    81,169GitHub

    This project is a comprehensive educational curriculum designed to build proficiency across modern infrastructure, cloud-native technologies, and systems administration. It functions as a reference library and interview preparation resource, offering a structured collection of conceptual questions, practical coding cha

    Pythonansibleawsazure
  • DopplerHQ/awesome-interview-questions

    DopplerHQ/awesome-interview-questions

    81,035GitHub

    This project is a comprehensive, community-sourced repository of technical interview questions and study materials. It serves as a centralized index for software engineers to prepare for technical assessments, benchmark their personal knowledge, and identify gaps in their expertise across a wide range of programming la

    android-interview-questionsangularjs-interview-questionsawesome
  • junegunn/fzf

    junegunn/fzf

    77,987GitHub

    This project is a general-purpose command-line filter that provides an interactive interface for processing standard input streams. It enables real-time fuzzy searching, data selection, and transformation, allowing users to navigate complex information or file systems directly within their terminal. By utilizing a pipe

    Gobashclifish
  • elastic/elasticsearch

    elastic/elasticsearch

    76,163GitHub

    Elasticsearch is a distributed search engine and document store designed for the high-performance indexing and retrieval of massive volumes of unstructured data. It functions as a centralized analytics platform, providing a schema-flexible architecture that organizes information into searchable indices while maintainin

    Javaelasticsearchjavasearch-engine
  • infiniflow/ragflow

    infiniflow/ragflow

    73,425GitHub

    This project is a comprehensive retrieval-augmented generation platform designed for building, managing, and deploying knowledge-based AI applications. It provides a unified environment for organizing datasets, configuring conversational chat assistants, and developing autonomous agents that execute multi-step reasonin

    Pythonagentagenticagentic-ai
  • redis/redis

    redis/redis

    73,096GitHub

    Redis is an in-memory, key-value database designed to provide sub-millisecond latency for read and write operations. It functions as a versatile data platform, serving as a distributed cache, a message broker, a NoSQL document store, and a vector database. The system utilizes an event-driven, single-threaded loop to pr

    Ccachecachingdatabase
  • awesomedata/awesome-public-datasets

    awesomedata/awesome-public-datasets

    72,846GitHub

    This project is a community-maintained, open-access directory of high-quality public datasets. It serves as a centralized reference point for researchers, developers, and data scientists to locate reliable information sources across a wide spectrum of industries and scientific fields. By providing a structured index, t

    aaron-swartzawesome-public-datasetsdatasets