piighost-proofreader
An LLM CV proofreader that anonymizes documents before any LLM call.
What it does
piighost-proofreader is an LLM-powered proofreader for CVs. You upload a PDF and get an annotated list of mistakes with click-to-highlight on the rendered pages. The document is anonymized with piighost-api before any LLM call.
How it works
- opendataloader-pdf converts the PDF to Markdown for the LLM.
- PyMuPDF renders each page and emits per-word bounding boxes.
- piighost-api anonymizes the Markdown before the LLM sees it.
- A LangChain and LiteLLM chain runs structured-output proofreading.
- A locator re-anchors each mistake to a page and bounding box.
- Streamlit renders the pages with overlays; clicking a mistake highlights it.
Run it
uv sync --group dev
cp .env.example .env # fill in LITELLM_API_KEY etc.
uv run streamlit run app.pyYou also need a running piighost-api at the URL declared in your .env.