Sunday, November 30, 2008

Text to Image Converter (for Linux)

For several purposes of our OCR research we needed a converter that can generate an image from any given Bangla/ Bengali text. One of the reasons was to prepare artificial training and testing data. For this reason we finally created a very simple text to image converter. Actually the converter is a collection of two python script files.

Prerequisites:
pango-view

Feedback:
Please reply your feedback to the following email address: bangla.ocr@gmail.com

Download:
TTI for Linux

Interesting test on recognition performance of BanglaOCR : DPI & scan vs computer generated image.doc

I was planning to test the performance of BanglaOCR for different type image (scanned image and computer generated image) in different dpi.

First of all I generated 4 images with dpi 100, 200, 300 and 400 ant test the performance. The output is as follows:

100 dpi:

Output: আরো তুমি ভো সারাদিনই খেনা এ।ন আমাদের ধেনভে দাংবা

200 dpi:

Output: আরেঢূ তুমি তো সার।দিনই খেল । এখন আমাদের খেলতে দ।ও।

300 dpi:

Output: আরে! তুমি তো সারাদিনই খেল। এখন আমাদের খেলভে দাও।

400 dpi:

Output: আরে! তুমি তো সারাদিনই খেল। এখন আমাদের খেলতে দাও।

In the second of this testing I scanned a line of text (font: SolaimanLipi, size 16) with dpi 100, 200, 300 and 400 ant test the performance. The output is as follows:

100 dpi:

Output: জারে! ড়ুমি ড়ো পীরুাদিনই খেদীা এখন থীনীদের খেঢৗতে দভো

200 dpi:

Output: আরে! তুমি ভো সারাদিনই খেল। এখন অ।ম।দের খেলভে দ।ও।

300 dpi:

Output: আরে! তুমি ভো সারাদিনই খেল। এখন আমাদের খেলভে দাও।

400 dpi:

Output: আরে! তুমি তো সারাদিনই খেল। এখন আমাদের খেলভে দাও।

The above observations make this clear that if we increase the dpi then the output will be better.

Monday, November 24, 2008

BanglaOCR V 0.5 (for Linux users) | Released

This is the announcement about the release of BanglaOCR V 0.5 (for Linux users). The release information is provided below:

Perquisites:
1. Tesseract OCR
2. Tidy
3. Java Runtime Environment (v 1.6)
4. Font : SolaimanLipi

User's Manual:
The manual is available with the package (BanglaOCR User Guide.pdf).

Feedback:
Any feedback about the application is highly appreciable. At present you can post feedback on the following email address: bangla.ocr@gmail.com

Test Images:
A set of test images are also available in the same site where the OCR setup package is available. So, please feel free to download them and test. The images were randomly selected to test the application and obviously not based on the best performance.

Download Link:
BanglaOCR V 0.5 (Linux Users)
Test Images

Wednesday, November 19, 2008

BanglaOCR V 0.6 | Released

This is the announcement about the release of BanglaOCR V 0.6. We already fixed few problems in V 0.5 and after solving those we moved to the next release. The release information is provided below:

Perquisites:
1. Microsoft .NET Framework Version 2.0 Redistributable Package (x86)
2. Microsoft Visual C++ 2005 Redistributable Package (x86)
3. Java Runtime Environment

User's Manual:
The manual will be available after installing BanglaOCR. Go to Start>> All Programs >> BanglaOCR>> BanglaOCR User Guide.

Feedback:
Any feedback about the application is highly appreciable. At present you can post feedback on the following email address: bangla.ocr@gmail.com
I am planning to build a document image database. So, if you would like, then send us your document images.

Test Images:
A set of test images are also available in the same site where the OCR setup package is available. So, please feel free to download them and test. The images were randomly selected to test the application and obviously not based on the best performance.

Download Link:
BanglaOCR V 0.6
Test Images

Saturday, November 15, 2008

BanglaOCR V 0.5 | Internal Release

I would be happier if it is possible to avoid the word "Internal Release" now. But unfortunately after waiting for 15 days I am feeling that I should do this. This release is termed as internal as I am experiencing "Memory Leak" problem in my developed application and yet struggling to fix and solve that. However as I am trying to solve the memory leak problem therefore I am releasing this version for the purpose of internal testing and feedback.

Perquisites:
1. Microsoft .NET Framework Version 2.0 Redistributable Package (x86)
2. Microsoft Visual C++ 2005 Redistributable Package (x86)

Cautions:
1. Existence of memory leak problem: You might see that there are error messages about memory allocation. So, in that case you have to close the application and restart the OCR again. This is what the problem that I mentioned above. I am trying to solve this.

Feedback:
Any feedback about the application is highly appreciable. At present you can post feedback on the following email address: bangla.ocr@gmail.com

Test Images:
A set of test images are also available in the same site where the OCR setup package is available. So, please feel free to download them and test. The images were randomly selected to test the application and obviously not based on the best performance.

Download Link:
BanglaOCR V 0.5
Test Images

Saturday, November 1, 2008

Bangla tesseract training data v-2.0 have been uploaded

I just uploaded the Bangla training data for tesseract engine. To be honest there are lot more works to do to improve the training data so that the recognition performance increases. So, I hope we will be able to improve the training data and thus newer version of the data will be available soon. If anyone want to take part of this task (preparing training data) and need any help then please feel free to contact with me. The links of the training data are given below:
http://ocropus-bengali.googlecode.com/files/Bangla%20tesseract%20training%20data%20v-2.0.zip

or

http://mhasnat.googlepages.com/Bangla_training_data_v_2_0.zip