Zone OCR is used to read document indexes or tags from text on the page. It is a great way to automate the data entry associated with scanning documents.
However, there are several limitations to zone OCR that must be overcome:
- Index information must be in the exact same place on every page
- Documents shift and skew during scanning, causing the zones to not line up
- If surrounding lines or text on the document are too close, they can encroach on the zone
SimpleIndex 11.4 Release Notes
Friday, 05 September 2025
SimpleIndex 11.4 new version, adds major new features like an easy option to customize what emails need to be processed from email boxes by setting dates and subfolders and automatically updating them each time these emails are processed. Additionally, ChatGPT integration was added. This allows you to extract index values and text from any document
- Published in Release Notes
Automatic Image Splitting
Tuesday, 10 October 2023
Speed up scanning of multi-page layouts and spreads with SimpleIndex’s built-in tools Magazines, pamphlets, booklets and similar documents can be a scanning headache, particularly when you want to capture each page, or even each block of text, as its own separate file. Splitting the pages and cropping each one to its final format adds hours
Zone OCR and Dynamic OCR
Monday, 07 November 2022
Other document scanning applications in this price range use Zone OCR to obtain index data from the page. SimpleIndex improves upon this time-tested but limited model with its Dynamic OCR feature. Let’s look at the difference between the two methods: Zone OCR Zone OCR is used to read document indexes or tags from text on
Language Pack for Standard/Tesseract OCR
Monday, 01 November 2021
Please refer to the Wiki Documentation for the complete Global Settings Wizard reference. All versions of the SimpleIndex software include OCR with the Standard/Tesseract OCR engine. The SimpleIndex download only includes a limited set of languages with the installation. If the language you would like to OCR with SimpleIndex isn’t one of the languages included then you can download
Languages Supported in SimpleSoftware OCR Engines
Monday, 02 December 2019
Please refer to the Wiki Documentation for the complete Languages reference. SimpleSoftware OCR engines are using two different systems for language support. In the end languages supported by your OCR is based on your version of SimpleIndex installed, any addons (SimpleIndex Server, SimpleCoversheet, and so on) do not add any additional language support. All SimpleSoftware products have Tesseract 3.02
Change the OCR Font or Type
Monday, 29 July 2019
Please refer to the Wiki Documentation for the complete OCR Options reference. This is used to changed the default OCR recognition font or type from the default, which is “To Be Detected”. This can be used to look for a specific type of OCR font and is especially useful for recognizing things like Dotmatrix, OCR A and OCR B.
I’m using full page OCR. The information is all appearing in the txt file but it is losing format about half way through. Data to the right is ending up at the end of the txt doc. Can this be fixed?
Wednesday, 28 February 2018
Please refer to the Wiki Documentation for the complete Full-Page OCR reference. SimpleIndex version 7 solves this problem with the incorporation of the FineReader OCR engine. Full text in PDFs will now flow with the formatting of the PDF. Legacy Versions: SimpleIndex can also be used with other OCR applications and servers to improve accuracy, formatting and
- Published in OCR
How do you train the OCR engine for better accuracy?
Wednesday, 28 February 2018
Training has been removed with version 7 due to the addition of the ABBYY FineReader OCR engine.
- Published in OCR
How do you configure full text searching in Retrieval mode?
Wednesday, 28 February 2018
Please refer to the Wiki Documentation for the complete Database Settings reference. On the Database tab there dropdown in the lower portion of the panel for Full Text OCR Field. Put the name of the field that will store the full-text data there. This must be configured both for Insert and Retrieval mode configurations. The database field
- Published in Database & Retrieval, OCR
How can I improve recognition rates for my OCR fields?
Wednesday, 28 February 2018
There are several things you can do to improve accuracy for OCR. Scan at 300dpi, black & white for best results. Adjust the scan settings to remove background noise and improve the definition of characters. For Zone OCR, field recognition can often vary based on the surrounding white space and text in the zone. Try
- Published in OCR
Can OCR text be saved to Office, Text, HTML or other formats?
Wednesday, 28 February 2018
Yes. On the OCR step of the Job Settings Wizard you can select the text output format need in the “Full-page OCR file type” drop down. By default it is set to PDF, but can be changed to Text (txt), Word (docx), Rich Text (rtf), Open Office (odt), Excel (xlsx), PowerPoint (pptx), ePub Zip (epub),
- Published in Licensing & Installation, OCR
Can SimpleIndex create searchable PDF Image+Text files with hidden text?
Wednesday, 28 February 2018
Yes, it can. You can configure this setting in the Job Settings Wizard by going to the OCR step and checking “Enable full-page OCR”. There are many settings in the OCR step that you can used to customize the output and recognition of images. SimpleIndex has two different OCR engines (Standard and Professional) that can
- Published in Export, OCR, Office PDF Text Processing
- 1
- 2

