Puppy Linux Discussion Forum Forum Index Puppy Linux Discussion Forum
Puppy HOME page : puppylinux.com
"THE" alternative forum : puppylinux.info
 
 FAQFAQ   SearchSearch   MemberlistMemberlist   UsergroupsUsergroups   RegisterRegister 
 ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 

The time now is Sat 13 Feb 2016, 15:36
All times are UTC - 4
 Forum index » Advanced Topics » Additional Software (PETs, n' stuff) » Documents
Tesseract 2.04
Post new topic   Reply to topic View previous topic :: View next topic
Page 1 of 1 [7 Posts]  
Author Message
auriza


Joined: 05 Jan 2009
Posts: 46
Location: Surakarta, Java

PostPosted: Tue 19 Jan 2010, 04:57    Post subject:  Tesseract 2.04
Subject description: Optical character recognition (OCR) program
 

Pretty accurate OCR program for scanning document to text. I use it together with ImageMagick to break some silly visual CAPTCHA, and it works!

You must install two packages to get it work, main program and language file. I only packaged English language file, but you can get the others from http://code.google.com/p/tesseract-ocr/downloads/list.

Usage:
# tesseract input.tif output
The output will be written to the text file in the same directory.

More info: PuppyForum: tesseract-ocr

Download Mirror:
tesseract-2.04-i486.pet [600kB]
tesseract-2.00.eng.pet [1MB]

MD5sum:
7b8c127764e7c18f41726b2ca4faedc9 - tesseract-2.00.eng.pet
86db54dc487d8da7aaa378be11f23da2 - tesseract-2.04-i486.pet
Back to top
View user's profile Send private message Visit poster's website 
greengeek

Joined: 20 Jul 2010
Posts: 3579
Location: New Zealand

PostPosted: Thu 19 Nov 2015, 14:22    Post subject:  

Unfortunately these links are dead. I am keen to try this older version if anyone has it (sometimes older stuff works better in specific circumstances). cheers!
Back to top
View user's profile Send private message 
Pelo


Joined: 10 Sep 2011
Posts: 6497
Location: Mer méditerrannée (1 kms°)

PostPosted: Sun 31 Jan 2016, 00:56    Post subject: use Puppyocr, it does the job.  

use Puppyocr, it does the job.
_________________
Puppy is muscular, not fat !
Back to top
View user's profile Send private message Yahoo Messenger 
rcrsn51


Joined: 05 Sep 2006
Posts: 10506
Location: Stratford, Ontario

PostPosted: Sun 31 Jan 2016, 08:52    Post subject:  

Here is an OCR project that is still maintained.
Back to top
View user's profile Send private message 
Pelo


Joined: 10 Sep 2011
Posts: 6497
Location: Mer méditerrannée (1 kms°)

PostPosted: Wed 03 Feb 2016, 07:33    Post subject: feed back how applications work once installed.  

what i would like on this forum is to have feed back how applications work once installed. Tesseract is one of those i did'nt succed to use, but with Puppyocr i really convert old documents, very old document to text. Its not easy, much time must be spent words wrongly recognized, but it's possible..
If they are some users having success with tesseract, please point here.
Once installed.. auriza you are welcome
"Unpaper is a tool for straightening pages and removing black edges, including in the middle, where you have photocopied an open book! "
Sometimes you will wonder if you wont be faster by typing directly the text by hand. but there is something magick in OCR, not efficient, but pleasant,
What would be nice, it's voice recognition, you read, the computer writes (in french). Smile or in latin, for very very old papers.

_________________
Puppy is muscular, not fat !
Back to top
View user's profile Send private message Yahoo Messenger 
disciple

Joined: 20 May 2006
Posts: 6631
Location: Auckland, New Zealand

PostPosted: Wed 03 Feb 2016, 14:24    Post subject:  

What engine does Puppyocr use? Anything using tesseract or cuneiform should give you pretty good results if you feed it a good scan.
_________________
Classic Puppy quotes
-
root: n. the superuser or administrator account that has complete control over everything in the machine. Running as root is a taonga of Puppy Linux users.
Back to top
View user's profile Send private message 
rcrsn51


Joined: 05 Sep 2006
Posts: 10506
Location: Stratford, Ontario

PostPosted: Thu 04 Feb 2016, 23:54    Post subject:  

disciple wrote:
What engine does Puppyocr use?

PuppyOCR appears to be Tesseract 2.04 with a GUI front-end. For some alternatives, read here.
Back to top
View user's profile Send private message 
Display posts from previous:   Sort by:   
Page 1 of 1 [7 Posts]  
Post new topic   Reply to topic View previous topic :: View next topic
 Forum index » Advanced Topics » Additional Software (PETs, n' stuff) » Documents
Jump to:  

You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
You cannot attach files in this forum
You can download files in this forum


Powered by phpBB © 2001, 2005 phpBB Group
[ Time: 0.0707s ][ Queries: 11 (0.0059s) ][ GZIP on ]