• Priti Sahoo

An optimal tool for text recognition from images- TESSERACT using python

Earlier we came across some blogs about the OCR where we could do text recognition using python. But building an OCR system isn't always an easygoing as developers always come many challenges like different fonts in images, poor contrast , cursive writings, multiple objects in images, etc.

Here, we will discuss about an tool rather a library ,whose use can be really effective .

Tesseract does this by two processes text detection, where the textual part within the image is detected, then the text recognition where the text location is derived and text is extracted from the image.


  • 2.7 or above

  • Python Image library or Pillow

  • pytesseract

  • Tesseract OCR .exe