


Matters are also complicated by the fact that OCR computer software needs very sophisticated algorithms to translate the image of text into accurate actual text. OCR software is not mainstream so open source alternatives to proprietary heavyweight software are fairly thin on the ground. We cover OCR engines as well as front-end tools. This article focuses on desktop, open source OCR software that offer good recognition accuracy and file formats. For some, online OCR services may be useful, but there are privacy concerns and file size limitations. The selection of the right OCR tool is dependent on specific needs. OCR technology is vital for gaining access to paper-based information, as well as integrating that information in digital workflows. The benefit of scanning documents is not purely for archival reasons. There is computer software that makes this conversion possible. Paper documents contain a wealth of important management data and information that would be better stored electronically. Things have changed in the past few years, with a marked shift in the paperless office concept. However, the office environment has shown a resistance to remove the mountain of paper generated. We have witnessed talk of a paperless office for more than 40 years. For example, the vast majority of journeys on the London Underground are made using the Oyster card without a paper ticket being issued. The use of paper has been displaced from some activities. OCR software is able to recognise the difference between characters and images, and between characters themselves. Optical Character Recognition (OCR) is the conversion of scanned images of handwritten, typewritten or printed text into searchable, editable documents.
