17 repos
Search & Information Retrieval — Data & Databases
We curate 17 GitHub repositories matching data & databases · Search & Information Retrieval. Refine with filters or upvote what's useful.
Search & Information Retrieval — Data & Databases
- EbookFoundation/free-programming-books
This project is a centralized, open-access repository that serves as a structured directory for technical education and professional development. It functions as a community-driven knowledge base, aggregating high-quality learning materials to support global accessibility to computer science and software engineering re
Pythonbookseducationhacktoberfest - vinta/awesome-python
This project is a comprehensive, community-curated directory that organizes a vast landscape of Python software libraries, frameworks, and tools. It serves as a centralized knowledge base designed to facilitate ecosystem navigation and accelerate developer discovery across the entire software development lifecycle. Th
Pythonawesomecollectionspython - openclaw/openclaw
Openclaw is a platform for managing agent execution environments, providing the infrastructure to control agent lifecycles, session state, and workspace persistence. It features a centralized gateway that handles model loops, tool invocation, and streaming events, while supporting multi-agent routing and persistent mem
TypeScriptaiassistantcrustacean - Significant-Gravitas/AutoGPT
AutoGPT is an orchestration platform designed for building, managing, and deploying autonomous agents. It provides a visual canvas-based environment where users can assemble agents by connecting modular blocks that represent actions, data flows, and conditional logic. The platform supports the entire agent lifecycle, i
Pythonaiartificial-intelligenceautonomous-agents - f/prompts.chat
Prompts.chat is a community-driven repository and management platform for AI prompts and agent skills. It provides a centralized interface for users to search, retrieve, and save prompts, while offering structured storage for multi-file agent skills that include documentation and supporting assets. The platform distin
HTMLaiartificial-intelligenceawesome-list - ripienaar/free-for-dev
This project is a community-maintained directory of technical resources, tools, and services that offer free tiers for developers. It serves as a centralized reference point for discovering infrastructure, software, and educational materials, helping individuals and teams minimize operational costs while building and s
HTMLawesome-listfree-for-developers - Anduin2017/HowToCook
HowToCook is a structured culinary knowledge base and computational engine designed for the management and scaling of instructional cooking content. It provides a framework for organizing technical preparation procedures and ingredient data, allowing users to maintain consistent culinary standards across various meal s
Dockerfilechinesecookbookcooking - supabase/supabase
This project provides an integrated backend platform built around a relational database. It automatically generates REST and GraphQL APIs from database schemas, allowing for direct data interaction through standard requests and client libraries. The platform includes a comprehensive authentication system that manages u
TypeScriptaialternativeauth - firecrawl/firecrawl
Firecrawl is a web data extraction platform designed to convert unstructured web content into clean, LLM-ready formats like markdown or JSON. It functions as an autonomous web crawler and scraper, capable of mapping entire domains, performing recursive navigation, and executing complex data gathering tasks. By leveragi
TypeScriptaiai-agentsai-crawler - macrozheng/mall
This project is an enterprise-grade Java framework designed for building scalable, full-stack e-commerce applications. It provides a comprehensive foundation for microservice-based distributed architectures, enabling the development of complex retail platforms that include product management, order processing, and secu
Javadockerelasticsearchelk - bregman-arie/devops-exercises
This project is a comprehensive educational curriculum designed to build proficiency across modern infrastructure, cloud-native technologies, and systems administration. It functions as a reference library and interview preparation resource, offering a structured collection of conceptual questions, practical coding cha
Pythonansibleawsazure - DopplerHQ/awesome-interview-questions
This project is a comprehensive, community-sourced repository of technical interview questions and study materials. It serves as a centralized index for software engineers to prepare for technical assessments, benchmark their personal knowledge, and identify gaps in their expertise across a wide range of programming la
android-interview-questionsangularjs-interview-questionsawesome - junegunn/fzf
This project is a general-purpose command-line filter that provides an interactive interface for processing standard input streams. It enables real-time fuzzy searching, data selection, and transformation, allowing users to navigate complex information or file systems directly within their terminal. By utilizing a pipe
Gobashclifish - elastic/elasticsearch
Elasticsearch is a distributed search engine and document store designed for the high-performance indexing and retrieval of massive volumes of unstructured data. It functions as a centralized analytics platform, providing a schema-flexible architecture that organizes information into searchable indices while maintainin
Javaelasticsearchjavasearch-engine - infiniflow/ragflow
This project is a comprehensive retrieval-augmented generation platform designed for building, managing, and deploying knowledge-based AI applications. It provides a unified environment for organizing datasets, configuring conversational chat assistants, and developing autonomous agents that execute multi-step reasonin
Pythonagentagenticagentic-ai - redis/redis
Redis is an in-memory, key-value database designed to provide sub-millisecond latency for read and write operations. It functions as a versatile data platform, serving as a distributed cache, a message broker, a NoSQL document store, and a vector database. The system utilizes an event-driven, single-threaded loop to pr
Ccachecachingdatabase - awesomedata/awesome-public-datasets
This project is a community-maintained, open-access directory of high-quality public datasets. It serves as a centralized reference point for researchers, developers, and data scientists to locate reliable information sources across a wide spectrum of industries and scientific fields. By providing a structured index, t
aaron-swartzawesome-public-datasetsdatasets