- http://dokupuppylinux.co.cc/programs:ocr (download)
- http://en.wikipedia.org/wiki/Ocrad
html man is included: type man ocrad to see
GNU Ocrad is an OCR (Optical Character Recognition) program and library based on a feature extraction method. It reads images in pbm (bitmap), pgm (greyscale) or ppm (color) formats and produces text in byte (8-bit) or UTF-8 formats. The pbm, pgm and ppm formats are collectively known as pnm.
Ocrad includes a layout analyser able to separate the columns or blocks of text normally found on printed pages.