Showing posts with label ABBYY TextGrabber. Show all posts
Showing posts with label ABBYY TextGrabber. Show all posts

Thursday, December 1, 2016

KNFB Reader, a hands on experience

As mentioned in my previous post, I was looking for a mobile OCR apps to help me read the info  on packages.  I tried the TextGrabber (TG) and the result was very disappointing.  I mentioned that KNFB Reader may be an better OCR apps but I didn't want to buy it due to it's high cost.   Recently, I saw a notice on the KNFB website saying that Google is supporting  KNFB development, and the apps is selling for $20 for a limited time.  I thought the price was reasonable so I bought a copy.  I also found out that the apps has try before you buy option.   You can try the full function for 25 pictures, after that, you have to do in apps purchase in order to continue using the apps.

Unlike TG, KNFB Reader was designed for a blind user in mind.  There are couple ways to help a blind user to position the camera to capture a good image.   While TG only has manual mode, KNFB has three ways to capture an image, manual mode, auto mode, and stand mode.  There are extensive online video on on this app.   With that in mind, I would provide a link to each feature follow by my own hands on experience.  This demo video provides an overview of KNFB Reader's features. 

When you open the app, you are at the take picture screen.  In addition to take picture, this screen also let you change various options, like picture mode, language, file management, etc.  There are three ways to take a picture, manual mode, automatic mode, and stand mode.

Manual picture mode is the default mode.  If you can position your phone to take the picture manually, this is the fastest method.  After you press the shutter button, the OCR goes to work immediately and read the text on the reading screen within few seconds.  I use this mode to quickly sort out junk mails from important mails.

If you can't take the picture manually, you can take a picture with the automatic picture mode.  The camera will take a picture after it found all 4 corners of the paper.   This mode is fairly time consuming and doesn't work all the time, but if you put a light color paper on dark background, the automatic picture mode would work much better.  You can use the field of view and tilt assistance as additional help to take a picture.  Note:  the video demo the field of view usage with a split tap gesture on an iPhone, this gesture didn't work on my Android phone.

If you have a lot of material to scan, the stand mode would be very helpful.  You will need to buy an optional scan stand in order to use this mode.   Fopydo scanning stand appear to be a popular choice.  The photo below shows my document scanning station.  I use it read through junk mail and letters.   


Smartphone sits on top of Fopydo scanning stand


One of the major reason that I want a mobile OCR apps is to read info on various packages while shopping.  I tried it out on one of my Costco shopping trip and I was disappointed with result.  The automatic mode was pretty much useless in a shopping environment because of product placement and/or lighting condition.   For example, there were boxes of food placed side by side on a shelf, the automatic mode was not able to take a picture because, I guess, it couldn't find all 4 corners of a box.  I had to pull one box out, put it on my chopping cart, for the automatic mode to work some of the time.  I ended up using manual mode for the rest of the shopping trip.

If the package is not printed on a flat surface, like the text on plastic food packages, the KNFB OCR result is about the same as TextGrabber, that mean both were poor visual aids for shopping.  The  automatic mode didn't work on the long strip of register tape either because again it couldn't find the 4 corners.   So I had to use manual mode to scan the long tape section by section, and because the tape won't lay flat on a table, I had to put a coin at the top and bottom of the tape to hold it flat.   I think the whold shopping experience was more trouble than its worth.

I had a much better result using KNFB reading food package at home because I could remove the food and flatten the plastic package.  Both KNFB and TG had similar OCR result with flattened packages.  I was surprised that KNFB able to read most text on curve surfaces. I took a picture of a plastic jar of peanut butter, a can of soup, and a small bottle of medication, the OCR result were just fair to good but I had enough useful info.  TG  was not able to recognize any text on curve surface. 

In conclusion, if you only want to scan plain black text printed on flat white paper, both KNFB and TG produced about the same OCR result.   The KNFB did slightly better than TG on color magazine pages and news letter with graphics.  Both apps have trouble reading text printed on non flat surfaces.  Unlike TG, KNFB was able to read text printed on curve surfaces.

TG was not designed for blind user in mind, so you must to see well enough to point and shoot the document manually.  You also need to press couple more buttons to have the text read to you after you took the picture.  If you can't do point and shoot, KNFB is the way to go.  This apps would read out the text after a picture is taken without additional action.  I think a better camera and more user experience would probably improve the OCR result on either apps.

Finally, TG is a lot cheaper at $2 a copy, while KNFB costs $100 at regular price, but often on sale.  I bought my copy for only $20.  Only you can determine whether the much higher price of KNFB is worth  for you.  I think I would use a Bluetooth headset when using the KNFB in public to avoid unwanted attention. 

Update July 2017
Microsoft just released a free IOS app called SeeingAI. From what I saw in the following video, I think this app could make the KNFB Reader obsolete.  I don't have an iPhone so I couldn't try it out, hopefully Microsoft would release an Android version in the future.
SeeingAI Demo Video





 

 

Sunday, July 17, 2016

ABBYY TextGrabber, not I was hoping for

As an old man with severe visual impairment due to RP, I depend on technologies to help me read the printed material.  For example, I use the ZoomText screen reader to read the text on the monitor, I use a desktop CCTV to read printed material,   and I have a portable CCTV when I am away from my desk.  I also use the scanner and a OCR program called ABBYY FineReader to convert  the scanned printed material to text format for my Zoomtext reader to read it to me.

When I saw the ABBYY TextGrabber app in the Google play store selling for only $2.00, I bought a copy.  The FineReader works really well on my PC with the scanned material and I was hoping the TextGrabber would work equally well on a smartphone.  What I was hoping the TextGrabber to do is to let me take a photo of the text on a item and let the OCR generate a text file from the photo, after that I would let Talkback read the text to me.  I was hoping I can use this mobile app in the stores so I can read nutritional fact, cooking direction, and any other info before buying that item.   I knew the TextGrabber was designed to recognize text on flat paper, and I wanted to fine out whether it would recognize text on flat surfaces like text on front and back of various packages.

The TextGrabber was not designed for a blind user in mind.  A person who use this app must have enough vision to position the camera about 12 inches away from the text to take a photo.  The app's main user interface screen only has 2 large icons.The upper icon is a camera and the lower icon is a photo album.  The  operation of this app is very easy.  After you tap the camera icon, you  would position the camera and tap the shutter  button  to take a photo of the text that you want the OCR function to work on.   After the photo was taken, tap the read button, the OCR would convert the image into a text file in few seconds. When the text file appear on the screen, you tap the screen to let Talkback read the text to you.  The photo album has the previous photos you have taken.  The TextGrabber app also have a language translation function which I won't test at this time.

After taking photos of many type of packages, the result was very disappointing.  The TextGrabber's OCR function had problem recognize almost all the text at the front of the package.  The OCR did do a little bit better with text at the side of back of the packages.   I think this is not the fault of TextGrabber because it was not design for this purpose.  Anyway, I can't use this app to ID items or get useful info from packages.  This app is pretty much useless to me as a low vision aid, luckily it only cost me $2.00.

Here are few photos:
Box of tea bags

Box of cereal

Bottle of prune juice

Can of SPAM




The app didn't  do that much better with regular printed material, like junk mail, post card, menu, etc.  If I have to guess, it had at most 50% accuracy.  The best OCR result came from a perfectly flat paper, with high contrast text and good lighting.  The ABBYY FineReader running in my PC had much better OCR result, I estimated at least 90% accuracy.  I am sure the resolution of the camera and the skill of a user also affect the OCR result.  I ran the test on my LG 22C smartphone which has a 5MP camera.

Look like I would need to find something else if I like to ID stuff on my own.  I knew another mobile OCR app called KNFB Reader.  This app was specially designed for blind user.  This app usually cost $100 but often on sale for around $75.  I have no idea whether it would do any better  reading text on packages.  I don't want to spend that kind of money to find out at this time.  If anyone has experience on this app, feel free to comment below.  As far as I can tell, no OCR can read text on a curved surface, that mean I couldn't use it the read the cooking direction on a can of soup.
  KNFB Reader Demo Video

Another app have received a lot of media attention isTapTapSee.  This app let you take a photo of an object, send the photo to the cloud, and tell you what the object in the photo in about 10 seconds.
The last I checked, this service is no longer free.  Right now there are 2 price plan.  One costs $8.00 to ID 100 images with no time limit. The other costs $9.00 per month with unlimited number of image.
TapTapSee Demo Video

If you need live help to ID a item, BeMyEyes is an app let that let a blind user and a sighted volunteer  establish a video link on smartphone  so the volunteer can  tell the user what the camera is pointing at.  This is a free service.
BeMyEyes Demo video

The info at the directionsforme.com is what I was hoping in a mobile app.  The website contains product info on thousands of food items, health care items and other misc items.  You can search the info by product name or using an optional barcode  scanner.  Won't it be great if an app can use a smartphone's camera as a scanner and enter the UPC code directly into the database?  Probably someone is working on that reight now, I hope.
Directionsforme website

I really hate blindness,  I have jump through hoops just to get some cooking direction.



ABBYY TextGrabber, not I was hoping for

As an old man with severe visual impairment due to RP, I depend on technologies to help me read the printed material.  For example, I use the ZoomText screen reader to read the text on the monitor, I use a desktop CCTV to read printed material,   and I have a portable CCTV when I am away from my desk.  I also use the scanner and a OCR program called ABBYY FineReader to read the scanned printed material.

When I saw the ABBYY TextGrabber app in the Google play store selling for only $2.00, I bought a copy.  The FineReader works really well on my PC with the scanned material and I was hoping the TextGrabber would work equally well on a smartphone.  What I was hoping the TextGrabber to do is to let me take a photo of the text on a item and let the OCR generate a text file from the photo, after that I would let Talkback read the text to me.  I was hoping I can use this mobile app in the stores so I can read nutritional fact, cooking direction, and any other info before buying that item.   I knew the TextGrabber was designed to recognize text on flat paper, and I wanted to fine out whether it would recognize text on flat surfaces like text on front and back of various packages.

The TextGrabber was not designed for a blind user in mind.  A person who use this app must have enough vision to position the camera about 12 inches away from the text to take a photo.  The app's main user interface screen only has 2 large icons but they were not label, soTalkback unable to  tell you their function.  Anyway,  the upper icon is a camera and the lower icon is a photo album.  The  operation of this app is very easy.  After you tap the camera icon, you  would position the camera and tap the shutter  button  to take a photo of the text that you want the OCR function to work on.   After the photo was taken, tap the read button, the OCR would convert the image into a text file in few seconds. When the text file appear on the screen, you tap the screen to let Talkback read the text to you.  The photo album has the photos you took before and the OCR can work on those photo as well.  The TextGrabber app also have a language translation function which I won't test at this time.

After taking photos of many type of packages, the result was very disappointing.  The TextGrabber's OCR function had problem recognize almost all the text at the front of the package.  The OCR did do a little bit better with text at the side of back of the packages.   I think this is not the fault of TextGrabber because it was not design for this purpose.  Anyway, I can't use this app to ID items or get useful info from packages.  This app is pretty much useless to me as a shopping aid.  luckily it only cost me $2.00.

Here are few photos:
Box of tea bags

Box of cereal

Bottle of prune juice

Can of SPAM




The app didn't  do that much better with regular printed material either, like junk mail, post card, menu, etc.  If I have to guess, it had at most 50% accuracy.  The best OCR result came from a perfectly flat paper, with high contrast text and good lighting.  The ABBYY FineReader running in my PC had much better OCR result, I estimated at least 90% accuracy.  I am sure the resolution of the camera and the skill of a user also affect the OCR result.  I ran the test on my LG 22C smartphone which has a 5MP camera.

Look like I would need to find something else if I like to ID stuff on my own.  I aware of few alternative solutions.

I knew another mobile OCR app called KNFB Reader.  This app was specially designed for blind user.  This app usually cost $100 but often on sale for around $75.  I have no idea whether it would do any better  reading text on packages.  I don't want to spend that kind of money to find out at this time.  If anyone has experience on this app, feel free to comment below.  As far as I can tell, no OCR can read text on a curved surface, that mean I couldn't use it the read the cooking direction on can food.
  KNFB Reader Demo Video

Another app have received a lot of media attention isTapTapSee.  This app let you take a photo of an object, send the photo to the cloud, and tell you what the object in the photo in about 10 seconds.  This app is great if all you want is to ID an object. The last I checked, this service is no longer free.  Right now there are 2 price plan.  One costs $8.00 to ID 100 images with no time limit. The other costs $9.00 per month with unlimited number of image.
TapTapSee Demo Video

If you need live help to ID a item, BeMyEyes is an app let that let a blind user and a sighted volunteer  establish a video conference link on smartphone  so the volunteer can  tell the user what the camera is pointing at and provide additional info if needed.  This is a free service.
BeMyEyes Demo video

The info at the directionsforme.com is what I was hoping in a mobile app.  The website contains product info on thousands of food items, health care items and other misc items.  You can search the info by product name or using an optional barcode  scanner.  Won't it be great if an app can use a smartphone's camera as a scanner and enter the UPC code directly into the database?  Probably someone is working on that reight now, I hope.
Directionsforme website

I really hate RP,  I have to jump through hoops just to get some simple info.