1 repo
Document Processing Pipelines — Data Processing Pipelines
We curate 1 GitHub repository matching data processing pipelines · Document Processing Pipelines. Refine with filters or upvote what's useful.
Document Processing Pipelines — Data Processing Pipelines
We'll search the best matching repositories with AI.
- opendatalab/MinerU
MinerU is a document parsing pipeline designed to transform unstructured files into machine-readable, structured data. It utilizes deep learning models to perform layout analysis, identifying document regions and extracting complex content such as mathematical expressions. By combining these neural network inferences w
Pythonai4sciencedocument-analysisextract-data