Long screenshot to formatted text. Specialized OCR for chat records, meeting transcripts, and articles.
long2text/
├── web/ # Next.js frontend (Vercel)
├── api/ # FastAPI + PaddleOCR backend (Railway/Docker)
cd web
npm install
npm run dev # http://localhost:3000cd api
pip install -r requirements.txt
uvicorn main:app --reload # http://localhost:8000cd api
docker build -t long2text-api .
docker run -p 8000:8000 long2text-apicd web
npx vercel --prodThe Python backend requires PaddleOCR which needs ~2GB RAM.
Option 1: Railway
cd api
railway upOption 2: Docker on your Mac Mini
cd api
docker compose up -dSee web/.env.example for all configuration options.
- Smart splitting of ultra-long screenshots with overlap
- PaddleOCR for high-accuracy Chinese + English recognition
- Scene-specific formatting (chat, meeting, article)
- Markdown output preserving structure
- Bilingual UI (Chinese/English)
- Free tier + paid full results