Page 1 of 1

MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx)

Posted: Wed Apr 09, 2008 9:42 pm
by Hagar Delest
Compatibility with the MS Office 2007 new file format OOXML (docx, xslx, pptx, ppsx) is a frequent issue. Here are some links to articles about the nature of Office Open XML.
Note: if you find broken links, please report them so that we update them. Sites sometimes change their address causing this kind of trouble.

What is called OOXML?
The OOXML can be misleading because at the beginning, it could also aply to the OpenOffice XML developed by the OASIS. It has then been renamed to ODF (Open Document Format [for Office Applications]). For more information about ODF and OASIS, see: https://www.oasis-open.org/org and https://www.oasis-open.org/committees/t ... rev=office
Now (and especially in this topic), OOXML stands for the Microsoft file format Office Open XML.
Additional resources: OpenOffice 3.# and later: supports the import of .docx, .xlsx and .pptx.[/size]
First an interesting comparison of the feature of both ODF and OOXML formats: White Paper: Open Document Format and Office Open XML.

But don't forget:
And these articles spotted by ccornell here:


OpenOffice.org 2.#:
Some old links with pointers to additional resources:
- About the ability to handle MS Office 2007 files
- Unable to open MS Office 2007 Word doc(x)s sent in open xml
- Compatibility with Office 2007

Re: MS Office 2007 file format (docx, xslx, pptx, ppsx)

Posted: Thu Apr 10, 2008 9:52 am
by huw
The OOXML (Office OpenXML) container is a ZIP style archive, just like ODF - so in an emergency you can access most text content that way, without needing to install or download anything at all. Just rename the file to end with .ZIP

Re: MS Office 2007 file format (docx, xslx, pptx, ppsx)

Posted: Sat May 03, 2008 8:49 pm
by enderandrew
Color me stupid but the odf-converter-integrator for docx/xlsx/pptx tool is apparently the best means of conversion, yet it says it can't be properly integrated directly into OOo. Why is this?

Re: MS Office 2007 file format (docx, xslx, pptx, ppsx)

Posted: Sat May 03, 2008 9:50 pm
by Hagar Delest
The converter is done by Novell and their build is slightly different from the Sun's version. They don't have the same position on development of OOo (even if we can't say there is any fork), that's why the Novell build has often more features than the Sun's version.

Re: MS Office 2007 file format (docx, xslx, pptx, ppsx)

Posted: Mon Sep 15, 2008 4:22 pm
by jvanriper
Is there an ODF plug in for Mac OSX? I am trying to send Open Office documents done on a PC, Windows Vista to a Mac running OSX 10.5.4, using Microsoft Office 2004. thanks,

Re: MS Office 2007 file format (docx, xslx, pptx, ppsx)

Posted: Mon Sep 15, 2008 4:51 pm
by Hagar Delest
Why do you need a plug in since you work with OOo and MS Office 2004 (still using the binary format .doc)? Can't you save in .doc?

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx)

Posted: Mon Dec 08, 2008 3:42 pm
by delboy4369
HI all,
i too am having touble with the file extensions, however i have tried the office support pack from microsoft, and i have version 3 of OO.o, and i still can't open the files, can any one help?

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx)

Posted: Sun May 24, 2009 9:32 pm
by fichy
Hi,

I had a look at the thread and opened quite a lot of links. I also had a look at the odf-converter-integrator for docx/xlsx/pptx. However, it supports only Windows and Linux... :(

I am using Mac OS X Version 10.5.6 and I would really need to convert these stupied MS Office 2007 documents (docx, xslx, pptx, ppsx).
Is there anything, an extension or whatever that I could download for Mac OS X to open these documents.

Please, I need HELP !!

Thanks,
fichy

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx)

Posted: Sun May 24, 2009 9:51 pm
by Hagar Delest
3.x should open them natively. Can you give a link to such a file so that we can try?

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Posted: Thu Dec 16, 2010 4:56 pm
by thomasjk
And you point is?????????????????

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Posted: Wed Dec 21, 2011 5:39 pm
by AnrDaemon
If you ask me, OOo is unable to open MS Office xml documents of certain size (several megabytes of XML code, which means 40+ pages) - Office just hangs of throw an exception that document is broken.
Knowing that the document only contains simple formatting (easily opened after re-saving in binary format), I think it's about perfect indication, that MS does not comply with their own standards.

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Posted: Mon Mar 11, 2013 10:19 am
by Villeroy
Preliminary Google reply to DIS 29500 : the consideration of ECMA-376 OOXML for ISO standardization. [PDF]

where Jeremy Allison (@Google) complains about OOXML mainly because of 2 technical reasons:
Google wrote:No useful guide to specification. Although it has made the specification available, Microsoft has made
it impossible to find the relevant information swiftly as the specification is over 6,000 pages long.
Google wrote:Continued use of binary code. In addition, the specification continues to use binary code although the
significant advantage of XML is that it is generally text based and therefore humanly readable.
[...]
In addition, many of these binary sections refer back to the Windows architecture which is unknown to
software engineers outside Microsoft, ...
As an example Jeremy Allison cites a binary font declaration and mentions
• Paragraph conditional formatting
• Table cell conditional formatting
• Table row conditional formatting
• Table style conditional formatting settings exception
• Pane format filter.

IMHO, the main conclusion on the technical side of the problem is this:
Although OOXML may formally comply with Ecma, it was clearly not designed with an “open” spirit.
Comparing the current with the future situation, interoperability is likely to become more difficult
instead of easier. The implementation of a fully compatible ODF importer (the current efforts regarding
.doc and .xls) is not an easy task, but it is dwarfed by the implementation of a fully compatible
OOXML importer, which we estimate to take something between 50 – 500 person years, or even
longer. Therefore, although it is theoretically possible to generate an OOXML document, this document
will probably only use a very small subset of the standard.
Then he states the judicial implications of what MS refers to as “Open Specification Promise”.

Hundreds of man years for an acceptable import filter (leaving aside any export filter) and no guarantee (just a promise) that you won't be sued by MS or even third parties!

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Posted: Fri May 31, 2013 1:53 pm
by sampei
Thanks for this post it is useful to me.

However I thought I would report that one link is no longer working

Implementation of OOXML: What Governments Need to Know (PDF)

I am getting a Error 404 - Not Found

Has the article been moved?

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Posted: Fri May 31, 2013 2:05 pm
by RoryOF
This document is (at time of this post) at
http://wiki.openoffice.org/w/images/9/9 ... ct2010.pdf

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Posted: Sun Oct 20, 2013 11:42 pm
by mdotley
Both links for "Microsoft OOXML: Dead Format Walking" appear to be broken. Try this for what appears to be the original article at a new URI: http://www.eweek.com/reviews/microsoft_ ... t_walking/

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Posted: Thu Apr 28, 2016 4:31 am
by jackmira
The file format is really complicated. It would great if there is a dedicated tool to view/learn these formats.

Re: MS Office 2007 OOXML file format (docx, xslx, pptx, ppsx

Posted: Sun Jul 22, 2018 5:21 pm
by John_Ha
See textbox.docx which has a Microsoft Textbox containing some (Latin lorem ipsum ...) text in it. Nothing is visible in AOO (though all is visible in LO as LO supports Microsoft Textboxes).

I extracted document.xml from the .docx file. The file, when pretty printed as below, is 143 lines long. From what I understand, everything between the start tag <mc:AlternateContent> on line 16, and its matching end tag </mc:AlternateContent> on line 130, 13 lines from the end, is outside the OOXML Specification, so not visible in AOO. So much for Microsoft's using its own standards.

Code: Select all

<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<w:document xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:r="http://schemas.openxmlformats.org/officeDocument/2006/relationships" xmlns:v="urn:schemas-microsoft-com:vml" xmlns:w="http://schemas.openxmlformats.org/wordprocessingml/2006/main" xmlns:w10="urn:schemas-microsoft-com:office:word" xmlns:wp="http://schemas.openxmlformats.org/drawingml/2006/wordprocessingDrawing" xmlns:wps="http://schemas.microsoft.com/office/word/2010/wordprocessingShape" xmlns:wpg="http://schemas.microsoft.com/office/word/2010/wordprocessingGroup" xmlns:mc="http://schemas.openxmlformats.org/markup-compatibility/2006" xmlns:wp14="http://schemas.microsoft.com/office/word/2010/wordprocessingDrawing" xmlns:w14="http://schemas.microsoft.com/office/word/2010/wordml" mc:Ignorable="w14 wp14">
	<w:body>
		<w:p>
			<w:pPr>
				<w:pStyle w:val="Normal"/>
				<w:spacing w:before="0" w:after="200"/>
				<w:rPr/>
			</w:pPr>
			<w:r>
				<w:rPr/>
				<mc:AlternateContent>
					<mc:Choice Requires="wps">
						<w:drawing>
							<wp:anchor behindDoc="0" distT="0" distB="0" distL="114300" distR="114300" simplePos="0" locked="0" layoutInCell="1" allowOverlap="1" relativeHeight="2">
								<wp:simplePos x="0" y="0"/>
								<wp:positionH relativeFrom="column">
									<wp:posOffset>394335</wp:posOffset>
								</wp:positionH>
								<wp:positionV relativeFrom="paragraph">
									<wp:posOffset>473710</wp:posOffset>
								</wp:positionV>
								<wp:extent cx="4606925" cy="1894840"/>
								<wp:effectExtent l="0" t="0" r="12700" b="21590"/>
								<wp:wrapNone/>
								<wp:docPr id="1" name="Text Box 1"/>
								<a:graphic xmlns:a="http://schemas.openxmlformats.org/drawingml/2006/main">
									<a:graphicData uri="http://schemas.microsoft.com/office/word/2010/wordprocessingShape">
										<wps:wsp>
											<wps:cNvSpPr/>
											<wps:spPr>
												<a:xfrm>
													<a:off x="0" y="0"/>
													<a:ext cx="4606200" cy="1894320"/>
												</a:xfrm>
												<a:prstGeom prst="rect">
													<a:avLst/>
												</a:prstGeom>
												<a:solidFill>
													<a:schemeClr val="lt1"/>
												</a:solidFill>
												<a:ln w="6480">
													<a:solidFill>
														<a:srgbClr val="000000"/>
													</a:solidFill>
													<a:round/>
												</a:ln>
											</wps:spPr>
											<wps:style>
												<a:lnRef idx="0">
													<a:schemeClr val="accent1"/>
												</a:lnRef>
												<a:fillRef idx="0">
													<a:schemeClr val="accent1"/>
												</a:fillRef>
												<a:effectRef idx="0">
													<a:schemeClr val="accent1"/>
												</a:effectRef>
												<a:fontRef idx="minor"/>
											</wps:style>
											<wps:txbx>
												<w:txbxContent>
													<w:p>
														<w:pPr>
															<w:pStyle w:val="FrameContents"/>
															<w:spacing w:lineRule="auto" w:line="240" w:before="0" w:after="200"/>
															<w:jc w:val="left"/>
															<w:rPr>
																<w:b w:val="false"/>
																<w:b w:val="false"/>
																<w:bCs w:val="false"/>
															</w:rPr>
														</w:pPr>
														<w:r>
															<w:rPr>
																<w:b w:val="false"/>
																<w:bCs w:val="false"/>
																<w:color w:val="00000A"/>
																<w:sz w:val="28"/>
															</w:rPr>
															<w:t>Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.</w:t>
														</w:r>
													</w:p>
												</w:txbxContent>
											</wps:txbx>
											<wps:bodyPr>
												<a:prstTxWarp prst="textNoShape"/>
												<a:noAutofit/>
											</wps:bodyPr>
										</wps:wsp>
									</a:graphicData>
								</a:graphic>
							</wp:anchor>
						</w:drawing>
					</mc:Choice>
					<mc:Fallback>
						<w:pict>
							<v:rect id="shape_0" ID="Text Box 1" fillcolor="white" stroked="t" style="position:absolute;margin-left:31.05pt;margin-top:37.3pt;width:362.65pt;height:149.1pt">
								<w10:wrap type="square"/>
								<v:fill o:detectmouseclick="t" type="solid" color2="black"/>
								<v:stroke color="black" weight="6480" joinstyle="round" endcap="flat"/>
								<v:textbox>
									<w:txbxContent>
										<w:p>
											<w:pPr>
												<w:pStyle w:val="FrameContents"/>
												<w:spacing w:lineRule="auto" w:line="240" w:before="0" w:after="200"/>
												<w:jc w:val="left"/>
												<w:rPr>
													<w:b w:val="false"/>
													<w:b w:val="false"/>
													<w:bCs w:val="false"/>
												</w:rPr>
											</w:pPr>
											<w:r>
												<w:rPr>
													<w:b w:val="false"/>
													<w:bCs w:val="false"/>
													<w:color w:val="00000A"/>
													<w:sz w:val="28"/>
												</w:rPr>
												<w:t>Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.</w:t>
											</w:r>
										</w:p>
									</w:txbxContent>
								</v:textbox>
							</v:rect>
						</w:pict>
					</mc:Fallback>
				</mc:AlternateContent>
			</w:r>
		</w:p>
		<w:sectPr>
			<w:type w:val="nextPage"/>
			<w:pgSz w:w="11906" w:h="16838"/>
			<w:pgMar w:left="1440" w:right="1440" w:header="0" w:top="1440" w:footer="0" w:bottom="1440" w:gutter="0"/>
			<w:pgNumType w:fmt="decimal"/>
			<w:formProt w:val="false"/>
			<w:textDirection w:val="lrTb"/>
			<w:docGrid w:type="default" w:linePitch="360" w:charSpace="4096"/>
		</w:sectPr>
	</w:body>
</w:document>