321 repos
Awesome GitHub repositories, curated.
A community-curated directory of interesting public GitHub repositories. Ask in plain English — AI ranks by relevance. Save what you find.
Browse repositories
- microsoft/markitdown
This project is an AI-powered document processing engine designed to transform diverse file formats into structured Markdown. By leveraging multimodal language models, it performs complex layout analysis and semantic text extraction, allowing for the conversion of both unstructured files and scanned images into machine
AI-Powered Extraction EnginesDocument ParsersLLM-Integrated Extraction Pipelines - microsoft/ML-For-Beginners
This project is an open-source educational curriculum designed to provide a structured path for developers to master machine learning and generative AI. It functions as a technical skill development platform, offering comprehensive study materials that guide learners through fundamental concepts, algorithms, and the pr
Educational CurriculaGuided TutorialsMachine Learning Education - microsoft/playwright
Playwright is a comprehensive browser automation framework designed for end-to-end testing and web workflow automation. It provides a unified API to drive web applications across multiple browser engines, enabling developers to simulate complex user interactions, perform web scraping, and validate application behavior
Browser Automation FrameworksAccessibility-Tree-Based LocatorsActionability Engines - microsoft/PowerToys
PowerToys is a collection of background-resident system utilities designed to extend native operating system functionality and streamline desktop workflows. It operates as a modular toolkit, utilizing a central plugin-based host architecture that allows users to dynamically enable or disable specific features for syste
Cross-Device Input ControllersCross-Device Input SynchronizersDesktop Workflow Optimizers - microsoft/terminal
This project is a terminal emulator that provides a modern command-line interface with support for tabbed navigation, GPU-accelerated text rendering, and comprehensive Unicode character display. It functions as a host for multiple shell sessions, managing them as independent processes within a unified windowing environ
Modern Terminal EmulatorsCommand-Line Profile ManagersGPU-Accelerated Rendering Engines - microsoft/TypeScript
TypeScript is a language that extends standard syntax by adding a static type system. It identifies potential runtime errors by analyzing the behaviors and capabilities of values during the compilation process. The language supports object-oriented structures, including classes with inheritance and member visibility co
Conditional TypesPartial Type ConstructionAbstract Classes - microsoft/vscode
This project is a cross-platform code editor designed for software development, offering a comprehensive suite of tools for text editing, workspace management, and task automation. It includes native support for version control, an integrated terminal, and a flexible task runner that allows for the execution of build,
Code EditorsAutonomous Coding AgentsExtension Hosts - microsoft/Web-Dev-For-Beginners
This project is an open-source educational curriculum designed to facilitate technical skill acquisition through a structured, project-based learning framework. It serves as a centralized knowledge base that guides learners through foundational web development concepts, modern programming logic, and advanced technical
Curriculum ModulesOpen-Source CurriculaPedagogical Frameworks - minio/minio
MinIO is a software-defined, cloud-native object storage server designed to manage large volumes of unstructured data. It functions as a distributed storage cluster that aggregates multiple independent nodes into a unified, scalable pool, providing a high-performance infrastructure compatible with standard cloud storag
Cloud Native Object StorageDistributed Storage ClustersObject Storage Servers - Mintplex-Labs/anything-llm
This platform serves as a comprehensive environment for managing private language models, document knowledge bases, and automated agent workflows within secure local infrastructure. It functions as a document-aware workspace that enables users to ingest diverse file formats into searchable repositories, ensuring that a
Agentic Workflow EnginesAI Agent OrchestratorsDocument-Aware AI Workspaces - MisterBooo/LeetCodeAnimation
LeetCodeAnimation is an educational code archive and technical interview resource designed to help developers master complex programming concepts. It functions as a centralized repository of source code and instructional materials, providing a structured environment for self-paced learning of fundamental computer scien
Algorithm VisualizationsAlgorithmic Problem SetsEducational Code Archives - mlabonne/llm-course
This project is a comprehensive educational curriculum and engineering handbook focused on the lifecycle of large language models. It serves as a structured knowledge base for machine learning practitioners, covering the fundamental mathematical and architectural principles of transformer-based sequence modeling, as we
AI Research RepositoriesFine-Tuning StrategiesLarge Language Models - modelcontextprotocol/servers
The Model Context Protocol is a standardized communication framework designed to connect language models to external data sources, functional tools, and interactive user interfaces. It provides a vendor-neutral interface layer that enables AI hosts to discover and execute capabilities across heterogeneous service envir
AI Context Integration ProtocolsAI Agent Tool IntegrationsAI Interoperability Layers - mozilla/pdf.js
This project is a portable document rendering engine designed to parse and display complex document layouts directly within standard web browser environments. It functions as a web-native viewer that enables the presentation of documents without requiring external software or browser plugins. The engine utilizes a can
Browser-Based PDF EnginesWeb-Based Document ViewersCanvas Rendering Engines - mrdoob/three.js
This project is a high-level 3D graphics engine designed to render complex, hardware-accelerated environments within web browsers. It provides a comprehensive abstraction layer that manages scene graphs, cameras, and lighting, mapping high-level scene definitions onto low-level graphics APIs. By decoupling these defini
3D Graphics Engines3D Rendering EnginesAbstraction-Layer Rendering Backends - mtdvio/every-programmer-should-know
This project is a comprehensive, community-curated knowledge base designed to support software engineers in mastering both fundamental computer science principles and practical industry methodologies. It serves as a centralized reference library that aggregates technical resources, academic literature, and professional
Career GuidanceComputer Science FoundationsComputer Science Fundamentals - mui/material-ui
Material UI is a comprehensive component-based library designed to accelerate the construction of professional, accessible web interfaces. It provides a centralized design system framework that manages visual tokens, typography, and color palettes to ensure consistent branding and layout constraints across complex appl
Component LibrariesComponent-Based UI LibrariesAccessible Component Suites - MunGell/awesome-for-beginners
This project is a curated directory of software repositories specifically selected to help newcomers make their first open-source contributions. It serves as a collaborative knowledge base that aggregates entry-level development opportunities, providing a structured path for novice developers to practice version contro
Awesome ListsContributor Onboarding GuidesOpen Source Directories - n8n-io/n8n
n8n is a workflow automation platform that combines a visual interface with code-based extensibility to design, orchestrate, and manage automated processes. It provides a comprehensive suite of tools for data transformation, filtering, and storage, allowing users to build complex logic through conditional branching, lo
Visual Data-Flow SystemsWorkflow Automation PlatformsBuilt-in Integration Nodes - NationalSecurityAgency/ghidra
Ghidra is a software reverse engineering suite designed to analyze compiled binaries and reconstruct program logic without access to original source code. It provides an interactive environment for disassembly and decompilation, utilizing a platform-independent intermediate representation to maintain consistency across
Binary Analysis FrameworksBinary DisassemblyBytecode Intermediate Representations