Office applications Toolkit
Story: Google: OOXML 'insufficient and unnecessary'
Terse markup for speed -- NOT
Albert, you're repeating a myth propagated by Microsoft. I'm not in love with either ODF or OOXML. Both are standards in name only.
But the OpenDocument Foundation's da Vinci plug-in for Excel repeatedly loaded a million-row ODF spreadsheet faster than than Excel loaded the same spreadsheet rendered in OOXML. The margin decreases with repeated loading due to caching, but ODF is still the winner in that regard.
And if you study the OOXML specification, you will see that much of the markup, e.g., the entire volume in Part 5, specifies verbose markup. If processing speed were actually the criteria, one would expect that all markup would be terse, not just some of it.
It is important to realize that the Office apps still use the binary formats in their internal processes. OOXML support is via conversions performed by the major apps' native file support APIs using plug-ins. OOXML processing speed, to the extent it is a factor at all, is only relevant to the conversion of OOXML to the binary formats and vice versa. OOXML is out of the processing picture entirely until it comes time to convert the file to OOXML when saving.
If you study the OOXML spec, it quickly becomes apparent that the drawing line between terse markup and verbose markup is between the portions Microsoft originally submitted to Ecma and the unique portions developed through the Ecma standard development process.
The reason is fairly simple. The terse markup is a dump to XML of the markup used by particular builds of the Office apps native file support APIs' intermediate formats used in the conversion processes, the builds that were the beginning point for the standards work at Ecma. They are terse because it was far easier for Microsoft engineers to simply convert that markup to XML.
But the markup created later in the Ecma standardization process is verbose, as XML markup should be. See e.g., W3C XML v.1.0 (4th ed.).
"XML documents should be human-legible and reasonably clear. ... XML documents shall be easy to create. ... Terseness in XML markup is of minimal importance."
Terse markup mattered in the days when documents were stored on floppy discs or slow hard drives and storage space and memory capacity was a barrier, making a necessity of compressed binary file formats that were dumps to file of the in-memory binary representation ("IMBR") of a document. But verbose OOXML markup gets converted to terse markup anyway once the OOXML document is converted to IMBR for internal processing.
The better question is why Microsoft inflicted terse markup on developers. One might plausibly answer that it was to save money and development time or even hypothesize something more Machiavellian.
But the bottom line is that OOXML markup terseness is a bug, not a feature. The terse markup forces developers and authors to constantly refer to reference materials to determine markup functionality and gives scant clues for the electronic archaeologists of the future. OOXML violates the principle that "XML documents shall be easy to create .. [and] be human-legible and reasonably clear."
Full Talkback thread
Story: Google: OOXML 'insufficient and unnecessary'
-
Very nice Albert lars -
Microsoft double-tongued Anonymous123 -
ODF useless for Microsoft needs Albert -
OOXML is fully open Albert -
Sorry, the comment was cut short. Here'... garyedwards -
Reasons for lack of interoperbility in ODF Albert -
ODF, The Big Picture Goldie Simmons -
Breaking the Web garyedwards -
Google has invested in competing format Albert -
Document standards 2000355890 -
Questioning Google’s objectiveness harpless -
Microsoft's Argument is Ridiculous Goldie Simmons -
insufficient and unnecessary standard, designed pu... ator1940 -
Interoperability and the binary ODF conversion di... garyedwards -
A bit of background... Anonymous123 -
Microsoft moves forward with OOXML SDK Karen Friar
-
The rest of the text in the previous tal... lars -
Google motivation Albert -
Which OOXML features in particular can't... Chris Rankin -
XML in spirit isn't going to be as effic... Anonymous123 -
But does even Microsoft Office use OOXML... Chris Rankin -
Thanks Gary, very informative Goldie Simmons -
Durusau's proposal is preposterous Marbux -
A very Interesting Take Moley -
Features not in ODF Albert -
OOXML performance explained Albert -
Office and OOXML David Meyer
-
MS Office 2007 does fully support O... Albert -
ISO Credibility garyedwards -
Thank you for an intelligent r... Anonymous123 -
Of course ODF isn't backwards... Chris Rankin -
Then why add "read"... Chris Rankin -
00o writes compliant files Goldie Simmons -
You are contradicting Rupert G... Chris Rankin -
Terse markup for speed -- NOT Marbux -
Tail end of previous comment garyedwards -
Cut to the chase garyedwards -
ODF also has backwards compati... Albert -
MS influencing ODF development... Albert -
No, OOo is not fully complient... Albert -
MS Office 2007 files fully val... Albert -
That's OK, I contradict R... Rupert Goodwins
-
Widespread support for OOXML a... Albert -
Actually OOXML is not really t... Albert -
Actually MS Office 2007 compli... Albert -
Actually, you're making all th... Chris Rankin -
It's a question of greate... Chris Rankin -
Questions for the community Goldie Simmons -
Do tell me more, Albert Marbux -
Albert, give me a single examp... Marbux -
Open Standards Moley -
If you plan for incompatiblity... Albert -
Believe is in the prove Albert -
ODF and OOXML are standards in... Marbux -
explaination Albert -
ODF better readable but less g... Albert -
Interoperability Albert -
OOXML is Open Albert -
Then why does the same spreads... Marbux -
Extensions are bugs, not featu... Marbux -
OOXML interop is abysmal Marbux -
Extensions to ODF realistic ? Albert -
Undocumented eXtensions and St... garyedwards -
Not 1,500 extensions Marbux -
You've got to be kidding,... Marbux









