Skip to content

Validating Documents

Documents, images, and text files can be validated upon import. It can also be accessed by first checking the documents you wish to validate, clicking on the “Process” tab, and in the “Admin” section, clicking on “Check Documents”.

Then this window appears.


A brief description of the options:

  • Check Files Exist - Combines the root data path with the document's resource path to verify the file can be found. A warning will be reported if the file could not be found.

    • To view the field for natives, right click on a field name header, select “Field Chooser”, check the box for “HAS NATIVES*”, and then close out of the window.
  • Check Page Counts - The total number of frames found in all of a document's page files will be matched against the total number of pages imported for the document. A warning will be reported if there is a discrepancy between the page counts.

  • Check Image Compression - Attempts to decompress each image file and frame. A warning will be reported if an image frame can't be properly decompressed.

  • Check Document Ranges - Reports gaps in numbers between page and document identifiers. Note that this option is dependent upon the sort order of the documents.

  • Check for Duplicate Pages - Checks the list of page identifiers across the document set for duplicates. A warning will be reported if a duplicate page (bates) identifier was found.

  • Generate Hashes - Populates MD5 hashes for image files, native files, and/or text files. Those options will be available for selection after checking the “Generate Hashes” checkbox.

    • To view the file MD5 hash, right click on a field name header, select “Field Chooser”, check the box for “FILE HASH*”, and then close out of the window.
  • Detect Color Pages - Populates the “HAS COLOR” field for documents and pages that have color.

    • To view this field, right click on a field name header, select “Field Chooser”, check the box for “HAS COLOR*”, and then close out of the window.
  • Detect Blank Pages - Populates the “HAS BLANK” field for documents and pages that are blank.

    • To view this field, right click on a field name header, select “Field Chooser”, check the box for “HAS BLANK*”, and then close out of the window.
  • Detect Text in PDF Files - Populates the “TEXT LEVEL” field for PDF files that contain selectable text.

    • To view this field, right click on a field name header, select “Field Chooser”, check the box for “TEXT LEVEL*”, and then close out of the window.