[Solved] (NOT but, not worth it) Apache OO OCR software

Discuss the word processor
Post Reply
justdfy
Posts: 38
Joined: Tue Apr 21, 2015 7:01 am

[Solved] (NOT but, not worth it) Apache OO OCR software

Post by justdfy »

Hi.
Is there an OCR (Optical Character Recognition) app in Apache OO I can download to take text from (uploaded) photographs and convert into an odt document?

If Apache doesn't have one, can anyone recommend a FREE one online, I can download.

Using a 2006 Toshiba L300 w later Windows 7 software.
Call me behind the times, I just like old s**t!

Cheers,
Jusro
'Down Asunder' (Australia)

(And, on posting this, I saw the notice at bottom about my using a Macbook Air. The local cop (Victoria Police) got to it and disabled every application's - maybe their drivers, AND stole my charger. Jus' sayin'..... So, having kept the old trusty Toshi, here it is!)
Last edited by justdfy on Thu Jul 18, 2019 4:35 am, edited 1 time in total.
OpenOffice 4.1.6 on (2006 model) Toshiba L300, using a later (2013?) L300's HDD w Windows7
justdfy
Posts: 38
Joined: Tue Apr 21, 2015 7:01 am

Re: Apache OO OCR software for photographed text?

Post by justdfy »

AND! I've update the OO to 4.1.6!
OpenOffice 4.1.6 on (2006 model) Toshiba L300, using a later (2013?) L300's HDD w Windows7
User avatar
robleyd
Moderator
Posts: 5055
Joined: Mon Aug 19, 2013 3:47 am
Location: Murbko, Australia

Re: Apache OO OCR software for photographed text?

Post by robleyd »

Is there an OCR (Optical Character Recognition) app in Apache OO
Nope.
can anyone recommend a FREE one online
These days most scanners come with software so I would imagine there is little need for a separate application. That said, you could try an internet search for free OCR software which will return a few sites telling you about the "best free OCR software" which might help you.

You might consider updating your signature to reflect your current software and OS.
Cheers
David
OS - Slackware 15 64 bit
Apache OpenOffice 4.1.15
LibreOffice 24.2.1.2; SlackBuild for 24.2.1 by Eric Hameleers
justdfy
Posts: 38
Joined: Tue Apr 21, 2015 7:01 am

Re: Apache OO OCR software for photographed text?

Post by justdfy »

Thanks David. I don't have my scanner (a Canon MG1500) on hand, and looked thru my laptop's Program Files, etc, but with my low level of computing/programming skills couldn't see which of the number of Canon files was the actual OCR app, so ask on the hope I download one from Apache, so I can upload photos taken by my digital camera (or dump them from the web), then use an OCR app to read them.

Having used Open Office for over 11 years, and reckoning it to be f**king fantastic, I hoped to stay within the same brand for an OCR app.

And yes, I'll do the internet search, as you suggest. It's just that I'm tired of picking an app, downloading it only to find it's below acceptable, and comes with unwanted signups etc. Not to mention a crap product. That's why I asked for a recommendation. And update my profile here.

Cheers.
OpenOffice 4.1.6 on (2006 model) Toshiba L300, using a later (2013?) L300's HDD w Windows7
User avatar
RoryOF
Moderator
Posts: 34586
Joined: Sat Jan 31, 2009 9:30 pm
Location: Ireland

Re: Apache OO OCR software for photographed text?

Post by RoryOF »

On linux I use Tesseract to OCR TIFF files, often using gimageReader as a front end if scanning a book. As far as I know Tesseract is available also for Windows. I cannot say for gimageReader.
 Edit: Just now checked: gimageReader for Windows and linux is at
https://github.com/manisandro/gImageReader

I find best to scan in 200 or 300 dpi, in greyscale. 
Apache OpenOffice 4.1.15 on Xubuntu 22.04.4 LTS
justdfy
Posts: 38
Joined: Tue Apr 21, 2015 7:01 am

Re: Apache OO OCR software for photographed text?

Post by justdfy »

Hi RoryOF, and thanks for that Tesseract suggestion.

Just went to it's page, and see that it's "last update; 2013-04-25". Should that be a concern?

Whatever, I downloaded it anyway.

Being on the outer, the terms you use are not understood, I guess I'll do a google for them.

I'm not scanning a book, so the gimageReader is not necessary, yes/no?

And what's 'front end' mean?

Cheers. Wish me 'luck'!
 Edit: Additional off-topic questions beginning "On Linux, RoryOF?" moved to
Open source software: Safe? Easy to configure? -- MrProgrammer 2019-07-14 00:09 
Last edited by MrProgrammer on Sun Jul 14, 2019 6:10 am, edited 1 time in total.
Reason: Add link to split topic
OpenOffice 4.1.6 on (2006 model) Toshiba L300, using a later (2013?) L300's HDD w Windows7
User avatar
robleyd
Moderator
Posts: 5055
Joined: Mon Aug 19, 2013 3:47 am
Location: Murbko, Australia

Re: Apache OO OCR software for photographed text?

Post by robleyd »

And what's 'front end' mean?
It means a GUI or Graphic User Interface to a command line tool.
I'm not scanning a book, so the gimageReader is not necessary, yes/no?
Depends - are you happy to use the command line tool?

GUIs were designed to allow the introduction of the "point and droolclick" method of using a computer :twisted:
Cheers
David
OS - Slackware 15 64 bit
Apache OpenOffice 4.1.15
LibreOffice 24.2.1.2; SlackBuild for 24.2.1 by Eric Hameleers
User avatar
RoryOF
Moderator
Posts: 34586
Joined: Sat Jan 31, 2009 9:30 pm
Location: Ireland

Re: Apache OO OCR software for photographed text?

Post by RoryOF »

The GUI allows easy marking of an area or areas to be OCRed in the sequence of scanned images. Tesseract is well developed and reliable so the update date may relate only to minor tweaks. Its accuracy is excellent, given good quality scans. But if the application is the occasional OCR of small documents, it may be simpler and quicker to re-type these than to get to grips with setting up an OCR system. Be aware that there may always be OCR errors, so careful proofreading is required, particularly on numerical data.
Apache OpenOffice 4.1.15 on Xubuntu 22.04.4 LTS
justdfy
Posts: 38
Joined: Tue Apr 21, 2015 7:01 am

Re: Apache OO OCR software for photographed text?

Post by justdfy »

Hi and thanks again, RoryOF and David!
Most of the answers above go straight over my head, requiring 1st, that I try fire-up the Tesseract software I downloaded, 2, see if I can get it working, 3, see if I have a clue about what's in front of me?
So, I'll get back to you.......

Cheers
OpenOffice 4.1.6 on (2006 model) Toshiba L300, using a later (2013?) L300's HDD w Windows7
User avatar
RoryOF
Moderator
Posts: 34586
Joined: Sat Jan 31, 2009 9:30 pm
Location: Ireland

Re: Apache OO OCR software for photographed text?

Post by RoryOF »

I suspect you will find it easier to drive Tesseract using a GUI application, such as gimageReader. This distances the user from low level driving of Tesseract.
Apache OpenOffice 4.1.15 on Xubuntu 22.04.4 LTS
justdfy
Posts: 38
Joined: Tue Apr 21, 2015 7:01 am

Re: Apache OO OCR software for photographed text?

Post by justdfy »

Ha ha! Yeah...? Thanks Rory! Yet to try the downloaded package. Will do RIGHT NOW!
OpenOffice 4.1.6 on (2006 model) Toshiba L300, using a later (2013?) L300's HDD w Windows7
justdfy
Posts: 38
Joined: Tue Apr 21, 2015 7:01 am

Re: Apache OO OCR software for photographed text?

Post by justdfy »

You know, there's one thing about geeks which quite sickens me, and that when they mess with, as-in HACK we who are not computer savvy, as if they are superior beings.
'
Completely juvenile behavior, come to it.

PFO?
OpenOffice 4.1.6 on (2006 model) Toshiba L300, using a later (2013?) L300's HDD w Windows7
justdfy
Posts: 38
Joined: Tue Apr 21, 2015 7:01 am

Re: Apache OO OCR software for photographed text?

Post by justdfy »

So, aside from 'someone' hacking my laptop today and 'screwing with' thus stopping a few keys from working (which magically returned to working later on), the Tesseract software I downloaded cannot be opened by windows.

I suspect someone has messed with that too.

Any ideas who, 'boys'?

This is why I refuse to play as my mafia family want! EDIT; What I began to write was "why I don't ask for help on these types of forums".

You're all softheaded and cowardly idiots.

"You're all ...."
is too much of a generalization, obviously. Knowing better than to expect any honesty from those who accept bribes to harass me online and off, it is pointless answering any calls for me to defend myself.

The same knowledge is increased by a large margin when such harassments are online, from those who can mask their identities and attack from the "softheaded and cowardly" cover of their advanced hacking and computer/internet skills in general.

The guaranteed reply to this will be you exhibiting your narcissistic, indeed 'superiorist' talents by countering to my 'disdain', with more insult, and banning me.

Your question to this reply, him from Mobtko SA, is that your way of 'justifying' my being banned from this Apache forum?

Why, you should both go into politics, or perhaps a big weapons corporation executive?

Lastly, I know you both message each other 'behind the screen' as it were, and have a good laugh at my WELL-KNOWN sorry fate, as per my 'mafia' comment.

But of course, you'll simply delete this reply in a lame attempt to further frustrate me into my ever-closer suicide.

And I, around 15 to 20 years ago, championed Open Source software above the mainstream systems. What a proven waste of my selflessness, and desire to improve the internet and computing.

Those who essentially are terrorizing me online are the reason why the elite (my mafia) want to decimate humanity down to some 500 million.

Of course, I submit the caveat that if it was neither of you, altho I think otherwise, who hacked, and screwed that Tesseract program to useless, I see no reason why you would be upset by my 'annoyed' words above.

But as it seems one party is asking, and thus is about to find cause to ban me, the likelihood of some 'foul play' on your behalf increases significantly.

Have a nice day, and play, but, please, TRY to be Ethical? That-is, try to be Intelligent. For the two, are the same.

Some people just cannot make life hard enough for the more Honorable.....
Last edited by justdfy on Mon Jul 15, 2019 12:24 pm, edited 1 time in total.
OpenOffice 4.1.6 on (2006 model) Toshiba L300, using a later (2013?) L300's HDD w Windows7
User avatar
robleyd
Moderator
Posts: 5055
Joined: Mon Aug 19, 2013 3:47 am
Location: Murbko, Australia

Re: Apache OO OCR software for photographed text?

Post by robleyd »

You're all softheaded and cowardly idiots.
Is this a reference to the people here who have been trying to help you?
Cheers
David
OS - Slackware 15 64 bit
Apache OpenOffice 4.1.15
LibreOffice 24.2.1.2; SlackBuild for 24.2.1 by Eric Hameleers
User avatar
RoryOF
Moderator
Posts: 34586
Joined: Sat Jan 31, 2009 9:30 pm
Location: Ireland

Re: Apache OO OCR software for photographed text?

Post by RoryOF »

You may not have downloaded the correct version of Tesseract for your operating system.

I can answer for myself that I did not touch either your computer or any version of Tesseract that you may have downloaded. I am sure the same applies to robleyd.

You are the person who wishes to install an OCR package: it is up to you to learn how to do that and how to use it. If the suggestions made do not suit you, search engines are available which will return a wide variety of choices of other packages; which you choose is your decision.

The Volunteers on this forum offer their advices on a pro bono basis, and are _entitled_ to be treated with respect.

I don't know which package you will choose; I don't care, and will post no further to this thread.
Apache OpenOffice 4.1.15 on Xubuntu 22.04.4 LTS
justdfy
Posts: 38
Joined: Tue Apr 21, 2015 7:01 am

Re: Apache OO OCR software for photographed text?

Post by justdfy »

Yep to most all of what you wrote there. But someone was messing with things 1, as the app was downloaded and later on.
I get a lot of it, being completely exiled by seriously nasty cult members.
No-one by now, online especially, can expect me to believe that I am unknown to them, what with the CIA's MK-ULTRA programming back of my very existence - think zionism's insane protocols of zion and it's plan for a global monarch.
So anyone who denies knowledge of myself, what with my GENETIC names known worldwide as being Windsor and Rothschild, and thinks they can mess with me, is a liar, short and simple.
S**t happens, for some of us. But a**holes always make it so. Especially the egocentric type who love abusing the powers on better or wealthier people.
NOTE; I've lived alone in Australia's forests and deserts for over 25 years now, due to those familial connections and the scum out to harass, terrorize or assassinate me. But no-one knows, right?

Cheers.
Oh? Do you work for AVG by chance?
OpenOffice 4.1.6 on (2006 model) Toshiba L300, using a later (2013?) L300's HDD w Windows7
User avatar
RusselB
Moderator
Posts: 6646
Joined: Fri Jan 03, 2014 7:31 am
Location: Sarnia, ON

Re: Apache OO OCR software for photographed text?

Post by RusselB »

No need for the profanity and who any of us work for is irrelevant to us trying to help you with the question you posted.
Additionally, please keep your cooments relative to the topic.
OpenOffice 4.1.7, LibreOffice 7.0.1.2 on Windows 7 Pro, Ultimate & Windows 10 Home (2004)
If you believe your problem has been resolved, please go to your first post in this topic, click the Edit button and add [Solved] to the beginning of the Subject line.
justdfy
Posts: 38
Joined: Tue Apr 21, 2015 7:01 am

Re: Apache OO OCR software for photographed text?

Post by justdfy »

You had to have your say. Feel better?
Go back to sleep thanks.
OpenOffice 4.1.6 on (2006 model) Toshiba L300, using a later (2013?) L300's HDD w Windows7
Post Reply