DocuClipper logo

Managing Project Documents

Use the Documents tab inside a DocuClipper project to re-extract a single statement, delete a source PDF, download the original, and trace transactions back to their source.

Last updated

DocuClipper organizes every source PDF you upload to a project under a dedicated Documents sub-tab, alongside Bank/CC Transactions. From there you can re-extract a single statement, download the original PDF, delete a document, and see extraction status for each file.

Documents, transactions, and projects sit in a clear hierarchy: a project contains many documents, and each document contributes the transactions extracted from it. Understanding that relationship makes troubleshooting and cleanup much faster.

How are documents, transactions, and projects related?

A project is the workspace you create for a client, an account, or a review period. Inside that project, every PDF you upload becomes a document. The transactions that appear in the Bank/CC Transactions tab are extracted from those documents — each transaction is owned by exactly one source document.

When you delete a document, the transactions extracted from it go with it. When you re-extract a document, only that document's transactions are replaced; everything else in the project is untouched.

What is the Documents sub-tab?

Open any project and click Documents in the sub-tab row. You'll see a list of every PDF uploaded to the project, with columns for the file name, the page count, the extraction status (extracted, processing, failed), and whether the document is reconciled.

The list is the single source of truth for what's in the project. If a statement isn't here, its transactions aren't in the project either.

How do I re-extract a single bank statement?

Find the document row in the Documents tab and use the row action to re-extract it. DocuClipper will re-run the extraction pipeline on that PDF only, replace the transactions it previously produced, and update the reconciliation status when it finishes.

Use this when:

  • A statement extracted but the totals don't reconcile and you want a fresh pass.
  • You uploaded the wrong period and replaced the file — re-extract to refresh.
  • A new extraction model rolled out and you want a specific statement re-processed.

Re-extraction does not consume a fresh page allowance the same way a brand-new upload does in most plans, but billing behavior can vary — confirm against your plan's terms if pages are scarce.

How do I delete a document?

Use the delete action on the document row. DocuClipper will ask you to confirm, then remove the PDF and every transaction extracted from it. Tags, categorizations, and notes attached to those transactions are removed with them.

Delete is permanent within the project. If you might need the file again, download the original PDF first.

Can I download the original PDF I uploaded?

Yes. The Documents tab includes a download action that returns the exact PDF you uploaded, byte-for-byte. This is useful when you need to share the source with a client, send it to an auditor, or re-upload it to a different project.

How do I find which statement a transaction came from?

Open the Bank/CC Transactions tab and locate the transaction. The source document is shown on the transaction row (or in the row detail), and clicking it takes you to the document in the Documents tab. From there you can download the original PDF or re-extract it.

This trace-back is what makes DocuClipper auditable: every transaction in the project can be tied to the exact PDF page it came from.

What does reconciled vs not-reconciled mean for a document?

A document is reconciled when its extracted transactions match the printed beginning and ending balances on the statement. Not reconciled means the running total in the extraction doesn't tie to the printed balance — usually because a transaction was missed, duplicated, or amounts were OCR'd incorrectly.

A not-reconciled document is a signal to investigate before trusting downstream analysis. Re-extracting often resolves it; if it doesn't, see How to Fix "Not Reconciled" Statements.

FAQs

How do I re-extract a single bank statement?

Open the project, click the Documents sub-tab, find the statement in the list, and use the row's re-extract action. Only that document's transactions are replaced; the rest of the project is untouched.

What happens to transactions when I delete a document?

Deleting a document permanently removes every transaction that was extracted from it, along with any tags, categorizations, and notes attached to those transactions. The rest of the project's data is unaffected.

Can I download the original PDF I uploaded?

Yes. The Documents tab has a download action on each row that returns the original PDF exactly as you uploaded it.

How do I find which statement a transaction came from?

Each transaction is linked to its source document. From the Bank/CC Transactions tab, the source document is shown on the transaction row, and you can click through to the document in the Documents tab.

What does reconciled vs not-reconciled mean for a document?

Reconciled means the extracted transactions tie to the printed beginning and ending balances on the statement. Not reconciled means the running total doesn't match, usually due to a missed, duplicate, or mis-OCR'd transaction.

How many documents can I upload to a project?

There is no fixed per-project document limit; uploads are bounded by your plan's page allowance, not by the number of files. Very large projects with hundreds of statements work fine, though analysis screens may take longer to load.

Related