Can OO import PDF and save as different format?

Discuss setup / installation issues - Add a spell checker, Language pack?
Post Reply
izavorin
Posts: 4
Joined: Tue Jun 17, 2008 9:27 pm

Can OO import PDF and save as different format?

Post by izavorin »

Welcome beginner. Please answer all of the questions below which may provide information necessary to answer your question.
-----------------------------------------------------------------------------------------------------------
Which version of OpenOffice.org are you using? 2.4
What Operating System (version) are you using? XP Pro
What is your question or comment?
Can OpenOffice import (rather than export) a PDF document? What I need is to import a PDF document that has both an image layer and a text layer and then export the text into a different format such as plain text, Word or similar, or RTF. The problem is that the documents I am working with have non-English text in them and saving as text in Adobe Reader produces garbage. Thanks.
OOo 2.4.X on Ms Windows XP
User avatar
squenson
Volunteer
Posts: 1885
Joined: Wed Jan 30, 2008 9:21 pm
Location: Lausanne, Switzerland

Re: Can OO import PDF and save as different format?

Post by squenson »

No, OOo cannot import pdf files.
LibreOffice 4.2.3.3. on Ubuntu 14.04
Caracalla
Volunteer
Posts: 474
Joined: Thu Nov 22, 2007 2:35 pm
Location: Netherlands, EU

Re: Can OO import PDF and save as different format?

Post by Caracalla »

Short answer: No.
Long answer: Not yet, you may find this interesting reading.

EDIT: just saw that you are working with a non-english text. Going by your statement i take it it is not in the latin alphabet? If so, make sure you have support for that language installed or you're not going to get anything else than garbage whatever you do.
Has your question been answered? Then please add [solved] to the title of your thread.
My pet peeve, No support for international ordinal numbering
please vote

Er is nu ook een Nederlandstalig forum!
OOo 3.0.X on Ms Windows XP + Opensuse 11.1
User avatar
ccornell
Volunteer
Posts: 611
Joined: Sun Oct 07, 2007 7:21 am

Re: Can OO import PDF and save as different format?

Post by ccornell »

If you have OpenOffice.org 3.0 Beta and install the PDF Import Extension, then yes you can import PDFs (into Draw) and even edit them. It is not perfect, but it is pretty good. You can save the PDF in the new ODF Hybrid format as well which gives you both the PDF and the ODF in the same file.

OpenOffice.org 3.0 Beta is found here: http://download.openoffice.org/680/index.html
The Sun PDF Import extension is found here: http://extensions.services.openoffice.o ... /pdfimport
openSUSE 11.4, KDE4.6 with OpenOffice.org 3.3
izavorin
Posts: 4
Joined: Tue Jun 17, 2008 9:27 pm

Re: Can OO import PDF and save as different format?

Post by izavorin »

ccornell wrote:If you have OpenOffice.org 3.0 Beta and install the PDF Import Extension, then yes you can import PDFs (into Draw) and even edit them. It is not perfect, but it is pretty good. You can save the PDF in the new ODF Hybrid format as well which gives you both the PDF and the ODF in the same file.

OpenOffice.org 3.0 Beta is found here: http://download.openoffice.org/680/index.html
The Sun PDF Import extension is found here: http://extensions.services.openoffice.o ... /pdfimport
Thanks, ccornell. have a problem, though: I installed 3.0 and launched Writer once. When I tried to install the extension, I got the following message: "the extension can't be installed as the following system dependencies are not fulfilled: OO.org 3.0". Do you know why? BTW, I also previously installed OOo 2.4 on my machine. Thanks.
OOo 2.4.X on Ms Windows XP
User avatar
ccornell
Volunteer
Posts: 611
Joined: Sun Oct 07, 2007 7:21 am

Re: Can OO import PDF and save as different format?

Post by ccornell »

Are you installing the extension from within OOo3.0Beta or from just clicking on the OXT? If you have 2.4 and 3.0 installed at the same time, it's possible that when you click the OXT it's trying to install it in OOo2.4 not 3.0. So... start OOo3.0, and go to Tools > Extension manager and install the extension there.
openSUSE 11.4, KDE4.6 with OpenOffice.org 3.3
izavorin
Posts: 4
Joined: Tue Jun 17, 2008 9:27 pm

Re: Can OO import PDF and save as different format?

Post by izavorin »

ccornell wrote:Are you installing the extension from within OOo3.0Beta or from just clicking on the OXT? If you have 2.4 and 3.0 installed at the same time, it's possible that when you click the OXT it's trying to install it in OOo2.4 not 3.0. So... start OOo3.0, and go to Tools > Extension manager and install the extension there.
Thanks, the installation worked. I tried it on a couple of PDFs with mixed results. My problem is that I have a bunch of PDFs that contain text using the Arabic script (e.g. Arabic, Farsi, etc.). All I need is to be able to extract the text layer properly and save it into something like a text file with encoding or a Word document. Several applications that I tried can't do it correctly. In the case of OOo3, I tried 3 docs. All loaded without errors, but in case of two of those, it displayed gibberish on the screen. The third it displayed coreectly and I saved it, but I am not sure how I can extract the text.
OOo 2.4.X on Ms Windows XP
User avatar
ccornell
Volunteer
Posts: 611
Joined: Sun Oct 07, 2007 7:21 am

Re: Can OO import PDF and save as different format?

Post by ccornell »

For the gibberish problem, as Caracalla said, make absolutely sure you have the fonts installed. PDFs can have embedded fonts, so can display a font type you do not have installed locally. A missing font type is a likely source of your prob.

Once the PDF is imported.. just copy the text into Writer... should be easy.
openSUSE 11.4, KDE4.6 with OpenOffice.org 3.3
Post Reply