A small tool to scan PDF to YAML
- Python 100%
| app | ||
| .gitignore | ||
| .python-version | ||
| LICENSE.md | ||
| pyproject.toml | ||
| README.md | ||
| uv.lock | ||
PDF Table Scanner
This tools is used to capture data from course descriptions tied to PDF files into a machine readable format
Prerequisites
This is a development version using uv
Usage
[me@machine pdf-scanner]$ uv run scanner ~/Documents/ModulHandBuch.pdf --to-json > data/ma_widt.json
Options
Usage: scanner [OPTIONS] FILENAME
╭─ Arguments ────────────────────────────────────────────────────────────────────────────────────────────────╮
│ * filename TEXT [required] │
╰────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Options ──────────────────────────────────────────────────────────────────────────────────────────────────╮
│ --to-json --no-to-json [default: no-to-json] │
│ --to-yaml --no-to-yaml [default: no-to-yaml] │
│ --sanitize --no-sanitize [default: sanitize] │
│ --debug --no-debug [default: no-debug] │
│ --install-completion Install completion for the current shell. │
│ --show-completion Show completion for the current shell, to copy it or customize │
│ the installation. │
│ --help Show this message and exit. │
╰────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
© 2025-2026 Hartmut Seichter, published under the MIT License