Release Notes

v1.0.0 - Initial Release (November 19, 2025)

Overview

First public release of PDF Tools for ComfyUI - a comprehensive suite of PDF processing, OCR, and AI vision analysis nodes.

Features

PDF Extraction

PDF Extractor v08/v09 - Advanced image extraction with quality assessment
- Automatic spread detection for scanned books
- Image quality scoring (sharpness, contrast, brightness)
- Duplicate detection
- Organize output by quality
- JSON metadata export
Simple PDF Extractor - Basic PDF image extraction

OCR Nodes

Surya OCR Layout Node - Multilingual OCR with 90+ languages
- Advanced layout detection
- Reading order analysis
- Table detection
- Multiple output formats (text, JSON, markdown)
PaddleOCR VL Remote - Visual-Language OCR
- Requires separate virtual environment (see PaddleOCR_VL_SETUP.md)
- CUDA 12.6 support

AI Vision & Layout Analysis

Florence-2 Cropper - AI-powered image cropping and region detection
LayoutLMv3 Node - Microsoft's document understanding model
Enhanced Layout Parser v06 - Advanced document layout analysis
Rectangle Detector - Geometric shape detection

Technical Details

Dependencies

Python 3.11.6+
PyMuPDF (AGPL v3)
Surya OCR (GPL v3)
Florence-2 (MIT)
transformers
torch
pillow
numpy

Installation

cd ComfyUI/custom_nodes/PDF_tools
.\install.ps1

Verification

.\check_install.ps1

Documentation

Complete documentation included:

README.md - Main documentation
INSTALLATION_GUIDE.md - Detailed installation instructions
QUICKSTART_SURYA.md - Quick start for Surya OCR
SURYA_OCR_NODE_GUIDE.md - Complete Surya guide
PaddleOCR_VL_SETUP.md - Separate venv setup for PaddleOCR
BATCH_PROCESSING_GUIDE.md - Batch processing workflows
PDF_LAYER_DETECTION_GUIDE.md - Layer detection details
LAYER_DETECTION_QUICKREF.md - Quick reference
CODE_OVERVIEW.md - Code structure
CONTRIBUTING.md - Contribution guidelines
CREDITS.md - Dependencies and licenses

License

Dual License:

Non-Commercial: Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)
Commercial: Requires separate commercial license

Package Split Notice

Download functionality (gallery-dl, yt-dlp) has been separated into a standalone package called "download-tools". This package now focuses exclusively on PDF processing, OCR, and vision analysis.

Repository

GitHub: https://github.com/EricRollei/PDF-Tools
Release: https://github.com/EricRollei/PDF-Tools/releases/tag/v1.0.0

Installation from GitHub

cd ComfyUI/custom_nodes
git clone https://github.com/EricRollei/PDF-Tools.git PDF_tools
cd PDF_tools
.\install.ps1

Known Issues

PaddleOCR VL requires separate virtual environment due to CUDA version conflicts (12.6 vs 12.8)
See PaddleOCR_VL_SETUP.md for detailed setup instructions

Contributors

Created and maintained by Eric Rollei

Statistics

111 files
69,115 lines of code
15+ processing nodes
30+ documented dependencies
Complete license headers on all Python files

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Release Notes

v1.0.0 - Initial Release (November 19, 2025)

Overview

Features

PDF Extraction

OCR Nodes

AI Vision & Layout Analysis

Technical Details

Dependencies

Installation

Verification

Documentation

License

Package Split Notice

Repository

Installation from GitHub

Known Issues

Contributors

Statistics

Uh oh!

Releases: EricRollei/PDF-Tools

v1.0.0 PDF Tools v1.0.0 - Initial Release

Release Notes

v1.0.0 - Initial Release (November 19, 2025)

Overview

Features

PDF Extraction

OCR Nodes

AI Vision & Layout Analysis

Technical Details

Dependencies

Installation

Verification

Documentation

License

Package Split Notice

Repository

Installation from GitHub

Known Issues

Contributors

Statistics

Uh oh!