8 repos

Data Integration & Synchronization — Data & Databases

We curate 8 GitHub repositories matching data & databases · Data Integration & Synchronization. Refine with filters or upvote what's useful.

Data Integration & Synchronization — Data & Databases

We'll search the best matching repositories with AI.
  • immich-app/immich

    immich-app/immich

    92,953GitHub

    Immich is a self-hosted media management platform designed to provide a centralized, private repository for photos and videos. It functions as a comprehensive system for organizing, backing up, and viewing personal media collections across mobile devices, web browsers, and external storage locations. By maintaining ful

    TypeScriptbackup-toolfluttergoogle-photos
  • firecrawl/firecrawl

    firecrawl/firecrawl

    84,034GitHub

    Firecrawl is a web data extraction platform designed to convert unstructured web content into clean, LLM-ready formats like markdown or JSON. It functions as an autonomous web crawler and scraper, capable of mapping entire domains, performing recursive navigation, and executing complex data gathering tasks. By leveragi

    TypeScriptaiai-agentsai-crawler
  • macrozheng/mall

    macrozheng/mall

    82,926GitHub

    This project is an enterprise-grade Java framework designed for building scalable, full-stack e-commerce applications. It provides a comprehensive foundation for microservice-based distributed architectures, enabling the development of complex retail platforms that include product management, order processing, and secu

    Javadockerelasticsearchelk
  • syncthing/syncthing

    syncthing/syncthing

    80,036GitHub

    Syncthing is a decentralized file synchronization engine that maintains consistent data states across multiple devices through peer-to-peer mesh networking. It operates as a background daemon that automatically replicates file creations, modifications, and deletions between trusted nodes without requiring central serve

    Gogop2ppeer-to-peer
  • junegunn/fzf

    junegunn/fzf

    77,987GitHub

    This project is a general-purpose command-line filter that provides an interactive interface for processing standard input streams. It enables real-time fuzzy searching, data selection, and transformation, allowing users to navigate complex information or file systems directly within their terminal. By utilizing a pipe

    Gobashclifish
  • nomic-ai/gpt4all

    nomic-ai/gpt4all

    77,146GitHub

    GPT4All is a cross-platform runtime environment designed to execute large language models directly on local consumer hardware. By leveraging an optimized C++ inference backend, it enables private, offline AI interactions without requiring an internet connection or external cloud services. The project provides a compreh

    C++ai-chatllm-inference
  • redis/redis

    redis/redis

    73,096GitHub

    Redis is an in-memory, key-value database designed to provide sub-millisecond latency for read and write operations. It functions as a versatile data platform, serving as a distributed cache, a message broker, a NoSQL document store, and a vector database. The system utilizes an event-driven, single-threaded loop to pr

    Ccachecachingdatabase
  • tesseract-ocr/tesseract

    tesseract-ocr/tesseract

    72,460GitHub

    Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into d

    C++hacktoberfestlstmmachine-learning