24 repos

Data Processing & Transformation — Data & Databases

We curate 24 GitHub repositories matching data & databases · Data Processing & Transformation. Refine with filters or upvote what's useful.

Data Processing & Transformation — Data & Databases

We'll search the best matching repositories with AI.
  • jwasham/coding-interview-university

    jwasham/coding-interview-university

    337,188GitHub

    This project is a comprehensive educational roadmap designed to guide software engineers through the mastery of computer science fundamentals and technical interview preparation. It provides a structured, dependency-aware learning path that organizes complex computing concepts into a hierarchical curriculum, enabling u

    algorithmalgorithmscoding-interview
  • vinta/awesome-python

    vinta/awesome-python

    283,687GitHub

    This project is a comprehensive, community-curated directory that organizes a vast landscape of Python software libraries, frameworks, and tools. It serves as a centralized knowledge base designed to facilitate ecosystem navigation and accelerate developer discovery across the entire software development lifecycle. Th

    Pythonawesomecollectionspython
  • TheAlgorithms/Python

    TheAlgorithms/Python

    217,914GitHub

    This project is a comprehensive repository of verified computational implementations designed to serve as an educational resource for computer science and algorithmic problem solving. It provides a structured collection of code examples that cover fundamental data structures, mathematical operations, and core programmi

    Pythonalgorithmalgorithm-competitionsalgorithms-implemented
  • tensorflow/tensorflow

    tensorflow/tensorflow

    193,864GitHub

    TensorFlow is a comprehensive machine learning framework designed for the construction, training, and deployment of complex mathematical models. It utilizes a graph-based execution model that represents operations as directed acyclic graphs, enabling automatic differentiation and efficient parallel processing. The syst

    C++deep-learningdeep-neural-networksdistributed
  • Significant-Gravitas/AutoGPT

    Significant-Gravitas/AutoGPT

    181,891GitHub

    AutoGPT is an orchestration platform designed for building, managing, and deploying autonomous agents. It provides a visual canvas-based environment where users can assemble agents by connecting modular blocks that represent actions, data flows, and conditional logic. The platform supports the entire agent lifecycle, i

    Pythonaiartificial-intelligenceautonomous-agents
  • n8n-io/n8n

    n8n-io/n8n

    175,396GitHub

    n8n is a workflow automation platform that combines a visual interface with code-based extensibility to design, orchestrate, and manage automated processes. It provides a comprehensive suite of tools for data transformation, filtering, and storage, allowing users to build complex logic through conditional branching, lo

    TypeScriptaiapisautomation
  • avelino/awesome-go

    avelino/awesome-go

    165,543GitHub

    This project serves as a comprehensive language ecosystem index, functioning as a centralized, community-curated directory for the Go programming language. It organizes a vast landscape of software components, libraries, and development tools into a structured, navigable hierarchy, enabling developers to efficiently di

    Goawesomeawesome-listgo
  • langchain-ai/langchain

    langchain-ai/langchain

    127,015GitHub

    LangChain is an orchestration framework designed for building, managing, and deploying applications powered by large language models. It provides a unified integration layer that normalizes disparate model provider APIs into a consistent set of primitives, enabling developers to build complex, multi-step AI workflows t

    Pythonagentsaiai-agents
  • d3/d3

    D3 is a modular library providing low-level primitives for creating data-driven visualizations. It functions as a flexible framework that allows for direct control over visual presentation by mapping abstract data dimensions to graphical properties, such as position, color, and size, without imposing predefined chart a

    Shellchartchartsd3
  • iptv-org/iptv

    iptv-org/iptv

    111,772GitHub

    This project is a community-maintained, open-source repository that functions as a centralized directory for streaming metadata. It aggregates publicly available network stream links and organizes them into standardized, machine-readable playlist formats. By acting strictly as a metadata-only index, the platform enable

    TypeScriptiptvm3uplaylist
  • godotengine/godot

    godotengine/godot

    106,855GitHub

    Godot is a comprehensive, node-based game engine designed for building interactive 2D and 3D applications. It provides an integrated development environment that utilizes a hierarchical scene system to organize objects, propagate spatial transformations, and manage lifecycle events. The engine functions as a cross-plat

    C++game-developmentgame-enginegamedev
  • GrowingGit/GitHub-Chinese-Top-Charts

    GrowingGit/GitHub-Chinese-Top-Charts

    106,216GitHub

    This project functions as a curated software directory and developer resource index, providing a centralized platform for discovering and evaluating high-quality open-source repositories. It serves as an aggregator that monitors trending software and educational resources, organizing them by technical domain and progra

    Java
  • iluwatar/java-design-patterns

    iluwatar/java-design-patterns

    93,757GitHub

    This project is a comprehensive educational knowledge base designed to help developers master software engineering excellence through a structured catalog of design patterns and architectural principles. It provides a curated repository of best practices, programming heuristics, and implementation examples, all organiz

    Javaawesome-listdesign-patternshacktoberfest
  • oven-sh/bun

    oven-sh/bun

    87,491GitHub

    Bun is a high-performance runtime environment designed to execute JavaScript and TypeScript applications with minimal latency and high throughput. Built on a native core implemented in Zig, it provides a unified execution engine that leverages JavaScriptCore for efficient memory management and low-latency startup. The

    Zigbunbundlerjavascript
  • firecrawl/firecrawl

    firecrawl/firecrawl

    84,034GitHub

    Firecrawl is a web data extraction platform designed to convert unstructured web content into clean, LLM-ready formats like markdown or JSON. It functions as an autonomous web crawler and scraper, capable of mapping entire domains, performing recursive navigation, and executing complex data gathering tasks. By leveragi

    TypeScriptaiai-agentsai-crawler
  • DopplerHQ/awesome-interview-questions

    DopplerHQ/awesome-interview-questions

    81,035GitHub

    This project is a comprehensive, community-sourced repository of technical interview questions and study materials. It serves as a centralized index for software engineers to prepare for technical assessments, benchmark their personal knowledge, and identify gaps in their expertise across a wide range of programming la

    android-interview-questionsangularjs-interview-questionsawesome
  • browser-use/browser-use

    browser-use/browser-use

    78,576GitHub

    Browser-use is a framework for building autonomous agents that navigate, interact with, and extract data from web interfaces using natural language instructions. By acting as an orchestration layer between large language models and browser automation protocols, it enables the execution of complex, multi-step workflows

    Pythonai-agentsai-toolsbrowser-automation
  • anuraghazra/github-readme-stats

    anuraghazra/github-readme-stats

    78,445GitHub

    This project is a serverless service that generates dynamic, themeable visual summaries of software development activity. It functions as an automated metadata visualizer, transforming raw platform logs and repository metrics into resolution-independent vector graphics that can be embedded directly into markdown enviro

    JavaScriptdynamicprofile-readmereadme-generator
  • junegunn/fzf

    junegunn/fzf

    77,987GitHub

    This project is a general-purpose command-line filter that provides an interactive interface for processing standard input streams. It enables real-time fuzzy searching, data selection, and transformation, allowing users to navigate complex information or file systems directly within their terminal. By utilizing a pipe

    Gobashclifish
  • hoppscotch/hoppscotch

    hoppscotch/hoppscotch

    77,888GitHub

    Hoppscotch is an open-source API development ecosystem designed for building, testing, and debugging REST, GraphQL, and real-time APIs. It provides a unified platform that functions across web browsers, desktop applications, and command-line interfaces, allowing developers to manage the entire API lifecycle from a sing

    TypeScriptapiapi-clientapi-rest