New instructions / functions
This commit is contained in:
Sebastian Mondial 2025-02-19 21:57:03 +00:00
parent 4c18bd2cf9
commit 09e0f8e39b

View file

@ -1,10 +1,10 @@
# pdf-mass-cleanuptools v5 # pdf-mass-cleanuptools v6
Clean up that MetaDataMess Clean up that MetaDataMess
## Needs: ## Needs:
+ pip install pdf2image anthropic tqdm + pip install pdf2image anthropic tqdm PyPDF2
+ sudo apt-get install poppler-utils + sudo apt-get install poppler-utils
before running: export ANTHROPIC_API_KEY='your-api-key-here' before running: export ANTHROPIC_API_KEY='your-api-key-here'
@ -21,4 +21,10 @@ python pdf_processor.py -i /path/to/pdfs -o /path/to/output --test
python pdf_processor.py -i /path/to/pdfs -o /path/to/output --pattern "magazine_*.pdf" python pdf_processor.py -i /path/to/pdfs -o /path/to/output --pattern "magazine_*.pdf"
### Keep temporary files for inspection ### Keep temporary files for inspection
python pdf_processor.py -i /path/to/pdfs -o /path/to/output --no-cleanup python pdf_processor.py -i /path/to/pdfs -o /path/to/output --no-cleanup
### With MetaData
python pdf_processor.py -i /path/to/pdfs -o /path/to/output --write-metadata
### With MetaData - and skip Backups if you dare
python pdf_processor.py -i /path/to/pdfs -o /path/to/output --write-metadata --no-backup