Document Scanner and OCR

Overview

A simple document scanner with OCR implemented using Python and OpenCV – OCR is implemented using tesseract.
Optical character recognition or optical character reader is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo or from subtitle text superimposed on an image.

Python-tesseract is an optical character recognition (OCR) tool for python. That is, it will recognize and “read” the text embedded in images.

Python-tesseract is a wrapper for Google’s Tesseract-OCR Engine. It is also useful as a stand-alone invocation script to tesseract, as it can read all image types supported by the Pillow and Leptonica imaging libraries, including jpeg, png, gif, bmp, tiff, and others. Additionally, if used as a script, Python-tesseract will print the recognized text instead of writing it to a file.

Project Link

 

Dependencies

  1. Numpy
  2. OpenCV
  3. imutils
  4. skimage

How to run

  1. Download the project
  2. Give appropriate image path in scanner,py main() function
  3. Run python scanner.py to execute the project

Courtesy: pyimagesearch.com

 

 

 

Free

xpertup

Let's Expert Up