MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx)

Home made tutorials, by users, for users
Forum rules
No question in this section please
For any question related to a topic, create a new thread in the relevant section.
Post Reply
User avatar
Hagar Delest
Moderator
Posts: 32594
Joined: Sun Oct 07, 2007 9:07 pm
Location: France

MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx)

Post by Hagar Delest »

Compatibility with the MS Office 2007 new file format OOXML (docx, xslx, pptx, ppsx) is a frequent issue. Here are some links to articles about the nature of Office Open XML.
Note: if you find broken links, please report them so that we update them. Sites sometimes change their address causing this kind of trouble.

What is called OOXML?
The OOXML can be misleading because at the beginning, it could also aply to the OpenOffice XML developed by the OASIS. It has then been renamed to ODF (Open Document Format [for Office Applications]). For more information about ODF and OASIS, see: https://www.oasis-open.org/org and https://www.oasis-open.org/committees/t ... rev=office
Now (and especially in this topic), OOXML stands for the Microsoft file format Office Open XML.
Additional resources: OpenOffice 3.# and later: supports the import of .docx, .xlsx and .pptx.[/size]
First an interesting comparison of the feature of both ODF and OOXML formats: White Paper: Open Document Format and Office Open XML.

But don't forget:
And these articles spotted by ccornell here:


OpenOffice.org 2.#:
Some old links with pointers to additional resources:
- About the ability to handle MS Office 2007 files
- Unable to open MS Office 2007 Word doc(x)s sent in open xml
- Compatibility with Office 2007
huw
Volunteer
Posts: 417
Joined: Wed Nov 21, 2007 1:57 pm

Re: MS Office 2007 file format (docx, xslx, pptx, ppsx)

Post by huw »

The OOXML (Office OpenXML) container is a ZIP style archive, just like ODF - so in an emergency you can access most text content that way, without needing to install or download anything at all. Just rename the file to end with .ZIP
enderandrew
Posts: 6
Joined: Sat May 03, 2008 7:57 pm

Re: MS Office 2007 file format (docx, xslx, pptx, ppsx)

Post by enderandrew »

Color me stupid but the odf-converter-integrator for docx/xlsx/pptx tool is apparently the best means of conversion, yet it says it can't be properly integrated directly into OOo. Why is this?
User avatar
Hagar Delest
Moderator
Posts: 32594
Joined: Sun Oct 07, 2007 9:07 pm
Location: France

Re: MS Office 2007 file format (docx, xslx, pptx, ppsx)

Post by Hagar Delest »

The converter is done by Novell and their build is slightly different from the Sun's version. They don't have the same position on development of OOo (even if we can't say there is any fork), that's why the Novell build has often more features than the Sun's version.
LibreOffice 7.6.2.1 on Xubuntu 23.10 and 7.6.4.1 portable on Windows 10
jvanriper
Posts: 1
Joined: Mon Sep 15, 2008 4:15 pm

Re: MS Office 2007 file format (docx, xslx, pptx, ppsx)

Post by jvanriper »

Is there an ODF plug in for Mac OSX? I am trying to send Open Office documents done on a PC, Windows Vista to a Mac running OSX 10.5.4, using Microsoft Office 2004. thanks,
OOo 2.3.X on MS Windows Vista + MAC OSX 10.5.4
User avatar
Hagar Delest
Moderator
Posts: 32594
Joined: Sun Oct 07, 2007 9:07 pm
Location: France

Re: MS Office 2007 file format (docx, xslx, pptx, ppsx)

Post by Hagar Delest »

Why do you need a plug in since you work with OOo and MS Office 2004 (still using the binary format .doc)? Can't you save in .doc?
LibreOffice 7.6.2.1 on Xubuntu 23.10 and 7.6.4.1 portable on Windows 10
delboy4369
Posts: 2
Joined: Mon Dec 08, 2008 3:36 pm

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx)

Post by delboy4369 »

HI all,
i too am having touble with the file extensions, however i have tried the office support pack from microsoft, and i have version 3 of OO.o, and i still can't open the files, can any one help?
OOo 3.0.X on MS Windows Vista
fichy
Posts: 1
Joined: Sun May 24, 2009 9:23 pm

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx)

Post by fichy »

Hi,

I had a look at the thread and opened quite a lot of links. I also had a look at the odf-converter-integrator for docx/xlsx/pptx. However, it supports only Windows and Linux... :(

I am using Mac OS X Version 10.5.6 and I would really need to convert these stupied MS Office 2007 documents (docx, xslx, pptx, ppsx).
Is there anything, an extension or whatever that I could download for Mac OS X to open these documents.

Please, I need HELP !!

Thanks,
fichy
OOo 3.0.X on Mac OSx other
User avatar
Hagar Delest
Moderator
Posts: 32594
Joined: Sun Oct 07, 2007 9:07 pm
Location: France

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx)

Post by Hagar Delest »

3.x should open them natively. Can you give a link to such a file so that we can try?
LibreOffice 7.6.2.1 on Xubuntu 23.10 and 7.6.4.1 portable on Windows 10
thomasjk
Volunteer
Posts: 4451
Joined: Tue Dec 25, 2007 4:52 pm
Location: North Carolina

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Post by thomasjk »

And you point is?????????????????
Tom K.
Windows 10 Home version 1803 17134.165
LibreOffice 5.4.7.2
AnrDaemon
Posts: 23
Joined: Fri Mar 27, 2009 6:53 pm

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Post by AnrDaemon »

If you ask me, OOo is unable to open MS Office xml documents of certain size (several megabytes of XML code, which means 40+ pages) - Office just hangs of throw an exception that document is broken.
Knowing that the document only contains simple formatting (easily opened after re-saving in binary format), I think it's about perfect indication, that MS does not comply with their own standards.
AOo 3.4.1 on Ms Windows XP/ OOo 3.3.0 on Mac OS X 10.7
User avatar
Villeroy
Volunteer
Posts: 31264
Joined: Mon Oct 08, 2007 1:35 am
Location: Germany

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Post by Villeroy »

Preliminary Google reply to DIS 29500 : the consideration of ECMA-376 OOXML for ISO standardization. [PDF]

where Jeremy Allison (@Google) complains about OOXML mainly because of 2 technical reasons:
Google wrote:No useful guide to specification. Although it has made the specification available, Microsoft has made
it impossible to find the relevant information swiftly as the specification is over 6,000 pages long.
Google wrote:Continued use of binary code. In addition, the specification continues to use binary code although the
significant advantage of XML is that it is generally text based and therefore humanly readable.
[...]
In addition, many of these binary sections refer back to the Windows architecture which is unknown to
software engineers outside Microsoft, ...
As an example Jeremy Allison cites a binary font declaration and mentions
• Paragraph conditional formatting
• Table cell conditional formatting
• Table row conditional formatting
• Table style conditional formatting settings exception
• Pane format filter.

IMHO, the main conclusion on the technical side of the problem is this:
Although OOXML may formally comply with Ecma, it was clearly not designed with an “open” spirit.
Comparing the current with the future situation, interoperability is likely to become more difficult
instead of easier. The implementation of a fully compatible ODF importer (the current efforts regarding
.doc and .xls) is not an easy task, but it is dwarfed by the implementation of a fully compatible
OOXML importer, which we estimate to take something between 50 – 500 person years, or even
longer. Therefore, although it is theoretically possible to generate an OOXML document, this document
will probably only use a very small subset of the standard.
Then he states the judicial implications of what MS refers to as “Open Specification Promise”.

Hundreds of man years for an acceptable import filter (leaving aside any export filter) and no guarantee (just a promise) that you won't be sued by MS or even third parties!
Please, edit this topic's initial post and add "[Solved]" to the subject line if your problem has been solved.
Ubuntu 18.04 with LibreOffice 6.0, latest OpenOffice and LibreOffice
sampei
Posts: 1
Joined: Fri May 31, 2013 1:48 pm

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Post by sampei »

Thanks for this post it is useful to me.

However I thought I would report that one link is no longer working

Implementation of OOXML: What Governments Need to Know (PDF)

I am getting a Error 404 - Not Found

Has the article been moved?
OpenOffice 3.1 on Windows Vista Free hosting
User avatar
RoryOF
Moderator
Posts: 34570
Joined: Sat Jan 31, 2009 9:30 pm
Location: Ireland

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Post by RoryOF »

This document is (at time of this post) at
http://wiki.openoffice.org/w/images/9/9 ... ct2010.pdf
Apache OpenOffice 4.1.15 on Xubuntu 22.04.4 LTS
mdotley
Posts: 1
Joined: Sun Oct 20, 2013 11:32 pm

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Post by mdotley »

Both links for "Microsoft OOXML: Dead Format Walking" appear to be broken. Try this for what appears to be the original article at a new URI: http://www.eweek.com/reviews/microsoft_ ... t_walking/
MD Otley

Running Apache OpenOffice 4.0.0 on a Windows XP dinosaur
jackmira
Posts: 3
Joined: Thu Apr 28, 2016 4:22 am

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Post by jackmira »

The file format is really complicated. It would great if there is a dedicated tool to view/learn these formats.
OpenOffice 3.1 on Windows 7
John_Ha
Volunteer
Posts: 9583
Joined: Fri Sep 18, 2009 5:51 pm
Location: UK

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Post by John_Ha »

See textbox.docx which has a Microsoft Textbox containing some (Latin lorem ipsum ...) text in it. Nothing is visible in AOO (though all is visible in LO as LO supports Microsoft Textboxes).

I extracted document.xml from the .docx file. The file, when pretty printed as below, is 143 lines long. From what I understand, everything between the start tag <mc:AlternateContent> on line 16, and its matching end tag </mc:AlternateContent> on line 130, 13 lines from the end, is outside the OOXML Specification, so not visible in AOO. So much for Microsoft's using its own standards.

Code: Select all

<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<w:document xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:r="http://schemas.openxmlformats.org/officeDocument/2006/relationships" xmlns:v="urn:schemas-microsoft-com:vml" xmlns:w="http://schemas.openxmlformats.org/wordprocessingml/2006/main" xmlns:w10="urn:schemas-microsoft-com:office:word" xmlns:wp="http://schemas.openxmlformats.org/drawingml/2006/wordprocessingDrawing" xmlns:wps="http://schemas.microsoft.com/office/word/2010/wordprocessingShape" xmlns:wpg="http://schemas.microsoft.com/office/word/2010/wordprocessingGroup" xmlns:mc="http://schemas.openxmlformats.org/markup-compatibility/2006" xmlns:wp14="http://schemas.microsoft.com/office/word/2010/wordprocessingDrawing" xmlns:w14="http://schemas.microsoft.com/office/word/2010/wordml" mc:Ignorable="w14 wp14">
	<w:body>
		<w:p>
			<w:pPr>
				<w:pStyle w:val="Normal"/>
				<w:spacing w:before="0" w:after="200"/>
				<w:rPr/>
			</w:pPr>
			<w:r>
				<w:rPr/>
				<mc:AlternateContent>
					<mc:Choice Requires="wps">
						<w:drawing>
							<wp:anchor behindDoc="0" distT="0" distB="0" distL="114300" distR="114300" simplePos="0" locked="0" layoutInCell="1" allowOverlap="1" relativeHeight="2">
								<wp:simplePos x="0" y="0"/>
								<wp:positionH relativeFrom="column">
									<wp:posOffset>394335</wp:posOffset>
								</wp:positionH>
								<wp:positionV relativeFrom="paragraph">
									<wp:posOffset>473710</wp:posOffset>
								</wp:positionV>
								<wp:extent cx="4606925" cy="1894840"/>
								<wp:effectExtent l="0" t="0" r="12700" b="21590"/>
								<wp:wrapNone/>
								<wp:docPr id="1" name="Text Box 1"/>
								<a:graphic xmlns:a="http://schemas.openxmlformats.org/drawingml/2006/main">
									<a:graphicData uri="http://schemas.microsoft.com/office/word/2010/wordprocessingShape">
										<wps:wsp>
											<wps:cNvSpPr/>
											<wps:spPr>
												<a:xfrm>
													<a:off x="0" y="0"/>
													<a:ext cx="4606200" cy="1894320"/>
												</a:xfrm>
												<a:prstGeom prst="rect">
													<a:avLst/>
												</a:prstGeom>
												<a:solidFill>
													<a:schemeClr val="lt1"/>
												</a:solidFill>
												<a:ln w="6480">
													<a:solidFill>
														<a:srgbClr val="000000"/>
													</a:solidFill>
													<a:round/>
												</a:ln>
											</wps:spPr>
											<wps:style>
												<a:lnRef idx="0">
													<a:schemeClr val="accent1"/>
												</a:lnRef>
												<a:fillRef idx="0">
													<a:schemeClr val="accent1"/>
												</a:fillRef>
												<a:effectRef idx="0">
													<a:schemeClr val="accent1"/>
												</a:effectRef>
												<a:fontRef idx="minor"/>
											</wps:style>
											<wps:txbx>
												<w:txbxContent>
													<w:p>
														<w:pPr>
															<w:pStyle w:val="FrameContents"/>
															<w:spacing w:lineRule="auto" w:line="240" w:before="0" w:after="200"/>
															<w:jc w:val="left"/>
															<w:rPr>
																<w:b w:val="false"/>
																<w:b w:val="false"/>
																<w:bCs w:val="false"/>
															</w:rPr>
														</w:pPr>
														<w:r>
															<w:rPr>
																<w:b w:val="false"/>
																<w:bCs w:val="false"/>
																<w:color w:val="00000A"/>
																<w:sz w:val="28"/>
															</w:rPr>
															<w:t>Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.</w:t>
														</w:r>
													</w:p>
												</w:txbxContent>
											</wps:txbx>
											<wps:bodyPr>
												<a:prstTxWarp prst="textNoShape"/>
												<a:noAutofit/>
											</wps:bodyPr>
										</wps:wsp>
									</a:graphicData>
								</a:graphic>
							</wp:anchor>
						</w:drawing>
					</mc:Choice>
					<mc:Fallback>
						<w:pict>
							<v:rect id="shape_0" ID="Text Box 1" fillcolor="white" stroked="t" style="position:absolute;margin-left:31.05pt;margin-top:37.3pt;width:362.65pt;height:149.1pt">
								<w10:wrap type="square"/>
								<v:fill o:detectmouseclick="t" type="solid" color2="black"/>
								<v:stroke color="black" weight="6480" joinstyle="round" endcap="flat"/>
								<v:textbox>
									<w:txbxContent>
										<w:p>
											<w:pPr>
												<w:pStyle w:val="FrameContents"/>
												<w:spacing w:lineRule="auto" w:line="240" w:before="0" w:after="200"/>
												<w:jc w:val="left"/>
												<w:rPr>
													<w:b w:val="false"/>
													<w:b w:val="false"/>
													<w:bCs w:val="false"/>
												</w:rPr>
											</w:pPr>
											<w:r>
												<w:rPr>
													<w:b w:val="false"/>
													<w:bCs w:val="false"/>
													<w:color w:val="00000A"/>
													<w:sz w:val="28"/>
												</w:rPr>
												<w:t>Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.</w:t>
											</w:r>
										</w:p>
									</w:txbxContent>
								</v:textbox>
							</v:rect>
						</w:pict>
					</mc:Fallback>
				</mc:AlternateContent>
			</w:r>
		</w:p>
		<w:sectPr>
			<w:type w:val="nextPage"/>
			<w:pgSz w:w="11906" w:h="16838"/>
			<w:pgMar w:left="1440" w:right="1440" w:header="0" w:top="1440" w:footer="0" w:bottom="1440" w:gutter="0"/>
			<w:pgNumType w:fmt="decimal"/>
			<w:formProt w:val="false"/>
			<w:textDirection w:val="lrTb"/>
			<w:docGrid w:type="default" w:linePitch="360" w:charSpace="4096"/>
		</w:sectPr>
	</w:body>
</w:document>
Attachments
Textbox.docx
A one page document containing a Microsoft textbox with some text inside it.
(8.78 KiB) Downloaded 910 times
LO 6.4.4.2, Windows 10 Home 64 bit

See the Writer Guide, the Writer FAQ, the Writer Tutorials and Writer for students.

Remember: Always save your Writer files as .odt files. - see here for the many reasons why.
Post Reply