Optical character recognition

Optical character recognition (OCR) is the recognition of printed or written characters by a computer running specialized software.

Process
Each page of text is converted to a digital image using a scanner and then OCR software is then run to process this image to produce a text file. This involves complex image processing algorithms and does not always achieve 100% accuracy so manual proof reading is recommended.