Page 1 of 1

[Solved] (NOT but, not worth it) Apache OO OCR software

Posted: Sat Jul 13, 2019 6:15 am
by justdfy
Hi.
Is there an OCR (Optical Character Recognition) app in Apache OO I can download to take text from (uploaded) photographs and convert into an odt document?

If Apache doesn't have one, can anyone recommend a FREE one online, I can download.

Using a 2006 Toshiba L300 w later Windows 7 software.
Call me behind the times, I just like old s**t!

Cheers,
Jusro
'Down Asunder' (Australia)

(And, on posting this, I saw the notice at bottom about my using a Macbook Air. The local cop (Victoria Police) got to it and disabled every application's - maybe their drivers, AND stole my charger. Jus' sayin'..... So, having kept the old trusty Toshi, here it is!)

Re: Apache OO OCR software for photographed text?

Posted: Sat Jul 13, 2019 6:19 am
by justdfy
AND! I've update the OO to 4.1.6!

Re: Apache OO OCR software for photographed text?

Posted: Sat Jul 13, 2019 6:35 am
by robleyd
Is there an OCR (Optical Character Recognition) app in Apache OO
Nope.
can anyone recommend a FREE one online
These days most scanners come with software so I would imagine there is little need for a separate application. That said, you could try an internet search for free OCR software which will return a few sites telling you about the "best free OCR software" which might help you.

You might consider updating your signature to reflect your current software and OS.

Re: Apache OO OCR software for photographed text?

Posted: Sat Jul 13, 2019 6:54 am
by justdfy
Thanks David. I don't have my scanner (a Canon MG1500) on hand, and looked thru my laptop's Program Files, etc, but with my low level of computing/programming skills couldn't see which of the number of Canon files was the actual OCR app, so ask on the hope I download one from Apache, so I can upload photos taken by my digital camera (or dump them from the web), then use an OCR app to read them.

Having used Open Office for over 11 years, and reckoning it to be f**king fantastic, I hoped to stay within the same brand for an OCR app.

And yes, I'll do the internet search, as you suggest. It's just that I'm tired of picking an app, downloading it only to find it's below acceptable, and comes with unwanted signups etc. Not to mention a crap product. That's why I asked for a recommendation. And update my profile here.

Cheers.

Re: Apache OO OCR software for photographed text?

Posted: Sat Jul 13, 2019 7:38 am
by RoryOF
On linux I use Tesseract to OCR TIFF files, often using gimageReader as a front end if scanning a book. As far as I know Tesseract is available also for Windows. I cannot say for gimageReader.
 Edit: Just now checked: gimageReader for Windows and linux is at
https://github.com/manisandro/gImageReader

I find best to scan in 200 or 300 dpi, in greyscale. 

Re: Apache OO OCR software for photographed text?

Posted: Sun Jul 14, 2019 5:18 am
by justdfy
Hi RoryOF, and thanks for that Tesseract suggestion.

Just went to it's page, and see that it's "last update; 2013-04-25". Should that be a concern?

Whatever, I downloaded it anyway.

Being on the outer, the terms you use are not understood, I guess I'll do a google for them.

I'm not scanning a book, so the gimageReader is not necessary, yes/no?

And what's 'front end' mean?

Cheers. Wish me 'luck'!
 Edit: Additional off-topic questions beginning "On Linux, RoryOF?" moved to
Open source software: Safe? Easy to configure? -- MrProgrammer 2019-07-14 00:09 

Re: Apache OO OCR software for photographed text?

Posted: Sun Jul 14, 2019 6:34 am
by robleyd
And what's 'front end' mean?
It means a GUI or Graphic User Interface to a command line tool.
I'm not scanning a book, so the gimageReader is not necessary, yes/no?
Depends - are you happy to use the command line tool?

GUIs were designed to allow the introduction of the "point and droolclick" method of using a computer :twisted:

Re: Apache OO OCR software for photographed text?

Posted: Sun Jul 14, 2019 8:02 am
by RoryOF
The GUI allows easy marking of an area or areas to be OCRed in the sequence of scanned images. Tesseract is well developed and reliable so the update date may relate only to minor tweaks. Its accuracy is excellent, given good quality scans. But if the application is the occasional OCR of small documents, it may be simpler and quicker to re-type these than to get to grips with setting up an OCR system. Be aware that there may always be OCR errors, so careful proofreading is required, particularly on numerical data.

Re: Apache OO OCR software for photographed text?

Posted: Mon Jul 15, 2019 9:14 am
by justdfy
Hi and thanks again, RoryOF and David!
Most of the answers above go straight over my head, requiring 1st, that I try fire-up the Tesseract software I downloaded, 2, see if I can get it working, 3, see if I have a clue about what's in front of me?
So, I'll get back to you.......

Cheers

Re: Apache OO OCR software for photographed text?

Posted: Mon Jul 15, 2019 9:41 am
by RoryOF
I suspect you will find it easier to drive Tesseract using a GUI application, such as gimageReader. This distances the user from low level driving of Tesseract.

Re: Apache OO OCR software for photographed text?

Posted: Mon Jul 15, 2019 9:45 am
by justdfy
Ha ha! Yeah...? Thanks Rory! Yet to try the downloaded package. Will do RIGHT NOW!

Re: Apache OO OCR software for photographed text?

Posted: Mon Jul 15, 2019 9:52 am
by justdfy
You know, there's one thing about geeks which quite sickens me, and that when they mess with, as-in HACK we who are not computer savvy, as if they are superior beings.
'
Completely juvenile behavior, come to it.

PFO?

Re: Apache OO OCR software for photographed text?

Posted: Mon Jul 15, 2019 10:00 am
by justdfy
So, aside from 'someone' hacking my laptop today and 'screwing with' thus stopping a few keys from working (which magically returned to working later on), the Tesseract software I downloaded cannot be opened by windows.

I suspect someone has messed with that too.

Any ideas who, 'boys'?

This is why I refuse to play as my mafia family want! EDIT; What I began to write was "why I don't ask for help on these types of forums".

You're all softheaded and cowardly idiots.

"You're all ...."
is too much of a generalization, obviously. Knowing better than to expect any honesty from those who accept bribes to harass me online and off, it is pointless answering any calls for me to defend myself.

The same knowledge is increased by a large margin when such harassments are online, from those who can mask their identities and attack from the "softheaded and cowardly" cover of their advanced hacking and computer/internet skills in general.

The guaranteed reply to this will be you exhibiting your narcissistic, indeed 'superiorist' talents by countering to my 'disdain', with more insult, and banning me.

Your question to this reply, him from Mobtko SA, is that your way of 'justifying' my being banned from this Apache forum?

Why, you should both go into politics, or perhaps a big weapons corporation executive?

Lastly, I know you both message each other 'behind the screen' as it were, and have a good laugh at my WELL-KNOWN sorry fate, as per my 'mafia' comment.

But of course, you'll simply delete this reply in a lame attempt to further frustrate me into my ever-closer suicide.

And I, around 15 to 20 years ago, championed Open Source software above the mainstream systems. What a proven waste of my selflessness, and desire to improve the internet and computing.

Those who essentially are terrorizing me online are the reason why the elite (my mafia) want to decimate humanity down to some 500 million.

Of course, I submit the caveat that if it was neither of you, altho I think otherwise, who hacked, and screwed that Tesseract program to useless, I see no reason why you would be upset by my 'annoyed' words above.

But as it seems one party is asking, and thus is about to find cause to ban me, the likelihood of some 'foul play' on your behalf increases significantly.

Have a nice day, and play, but, please, TRY to be Ethical? That-is, try to be Intelligent. For the two, are the same.

Some people just cannot make life hard enough for the more Honorable.....

Re: Apache OO OCR software for photographed text?

Posted: Mon Jul 15, 2019 11:45 am
by robleyd
You're all softheaded and cowardly idiots.
Is this a reference to the people here who have been trying to help you?

Re: Apache OO OCR software for photographed text?

Posted: Mon Jul 15, 2019 1:45 pm
by RoryOF
You may not have downloaded the correct version of Tesseract for your operating system.

I can answer for myself that I did not touch either your computer or any version of Tesseract that you may have downloaded. I am sure the same applies to robleyd.

You are the person who wishes to install an OCR package: it is up to you to learn how to do that and how to use it. If the suggestions made do not suit you, search engines are available which will return a wide variety of choices of other packages; which you choose is your decision.

The Volunteers on this forum offer their advices on a pro bono basis, and are _entitled_ to be treated with respect.

I don't know which package you will choose; I don't care, and will post no further to this thread.

Re: Apache OO OCR software for photographed text?

Posted: Tue Jul 16, 2019 4:21 am
by justdfy
Yep to most all of what you wrote there. But someone was messing with things 1, as the app was downloaded and later on.
I get a lot of it, being completely exiled by seriously nasty cult members.
No-one by now, online especially, can expect me to believe that I am unknown to them, what with the CIA's MK-ULTRA programming back of my very existence - think zionism's insane protocols of zion and it's plan for a global monarch.
So anyone who denies knowledge of myself, what with my GENETIC names known worldwide as being Windsor and Rothschild, and thinks they can mess with me, is a liar, short and simple.
S**t happens, for some of us. But a**holes always make it so. Especially the egocentric type who love abusing the powers on better or wealthier people.
NOTE; I've lived alone in Australia's forests and deserts for over 25 years now, due to those familial connections and the scum out to harass, terrorize or assassinate me. But no-one knows, right?

Cheers.
Oh? Do you work for AVG by chance?

Re: Apache OO OCR software for photographed text?

Posted: Tue Jul 16, 2019 10:26 am
by RusselB
No need for the profanity and who any of us work for is irrelevant to us trying to help you with the question you posted.
Additionally, please keep your cooments relative to the topic.

Re: Apache OO OCR software for photographed text?

Posted: Thu Jul 18, 2019 4:33 am
by justdfy
You had to have your say. Feel better?
Go back to sleep thanks.