Edit PDF file

Discuss the drawing application
Post Reply
JHolliday
Posts: 1
Joined: Wed May 10, 2017 3:34 am

Edit PDF file

Post by JHolliday »

I have a copy of a file that is only in pdf form. Is it possible to convert this file to text so I can edit it in OpenOffice?
OpenOffice 4.1.3 / Windows 10
User avatar
robleyd
Moderator
Posts: 5082
Joined: Mon Aug 19, 2013 3:47 am
Location: Murbko, Australia

Re: pdf file

Post by robleyd »

Hi, and welcome to the community forum.

If you can select the text in the PDF file, simply copy and paste into Writer. This will only work provided the PDF is not a scanned image.
Cheers
David
OS - Slackware 15 64 bit
Apache OpenOffice 4.1.15
LibreOffice 24.2.2.2; SlackBuild for 24.2.2 by Eric Hameleers
User avatar
Zizi64
Volunteer
Posts: 11359
Joined: Wed May 26, 2010 7:55 am
Location: Budapest, Hungary

Re: pdf file

Post by Zizi64 »

I have a copy of a file that is only in pdf form. Is it possible to convert this file to text so I can edit it in OpenOffice?
The PDF fileformat was developed in the past for reading and printing (and for filling today), but not for full featured editing.

The AOO and LO has a feature named Import PDF. It is a limited PDF editor capability: You can open a PDF file in the Draw application, and you can make some cosmetics on the file: you can edit the texts "as labels"; you can delete some texts and/or pictures from the PDF file; etc...


Some PDF files are protected. In this case maybe you will not able to edit it nor copy the content from the pdf.
Tibor Kovacs, Hungary; LO7.5.8 /Win7-10 x64Prof.
PortableApps/winPenPack: LO3.3.0-7.6.2;AOO4.1.14
Please, edit the initial post in the topic: add the word [Solved] at the beginning of the subject line - if your problem has been solved.
User avatar
RoryOF
Moderator
Posts: 34613
Joined: Sat Jan 31, 2009 9:30 pm
Location: Ireland

Re: Edit PDF file

Post by RoryOF »

If serious editing is needed, such as re-layout of the file, or extensive extraction of the text from it, it may be necessary to pass the file through an OCR (Optical Character Recognition) program, and then edit the resulting text, both for correction of OCR errors (typically 2 - 3% of the text) and rectification of the formatting which the OCR will attempt to preserve. For a multipage document this is not a trivial task.

As Zizi64 says, it may not easily be possible, as PDF files can be locked to prevent extraction of text.
Apache OpenOffice 4.1.15 on Xubuntu 22.04.4 LTS
Post Reply