ZDNet UK


Skip to Main Content

ZDNet.co.uk - Winner of Best Business Website 2007
  1. Home
  2. News
  3. Blogs
  4. Reviews
  5. Prices
  6. Resources
  7. Community
  8. My ZDNet

 

ZDNet UK RSS Feeds


IT Jobs

Office applications Toolkit

Story: Google: OOXML 'insufficient and unnecessary'

  • Previous comment

Posted by: Marbux (Thursday 28 February 2008, 10:05 PM)

  • Reply

Terse markup for speed -- NOT

Albert, you're repeating a myth propagated by Microsoft. I'm not in love with either ODF or OOXML. Both are standards in name only.

But the OpenDocument Foundation's da Vinci plug-in for Excel repeatedly loaded a million-row ODF spreadsheet faster than than Excel loaded the same spreadsheet rendered in OOXML. The margin decreases with repeated loading due to caching, but ODF is still the winner in that regard.

And if you study the OOXML specification, you will see that much of the markup, e.g., the entire volume in Part 5, specifies verbose markup. If processing speed were actually the criteria, one would expect that all markup would be terse, not just some of it.

It is important to realize that the Office apps still use the binary formats in their internal processes. OOXML support is via conversions performed by the major apps' native file support APIs using plug-ins. OOXML processing speed, to the extent it is a factor at all, is only relevant to the conversion of OOXML to the binary formats and vice versa. OOXML is out of the processing picture entirely until it comes time to convert the file to OOXML when saving.

If you study the OOXML spec, it quickly becomes apparent that the drawing line between terse markup and verbose markup is between the portions Microsoft originally submitted to Ecma and the unique portions developed through the Ecma standard development process.

The reason is fairly simple. The terse markup is a dump to XML of the markup used by particular builds of the Office apps native file support APIs' intermediate formats used in the conversion processes, the builds that were the beginning point for the standards work at Ecma. They are terse because it was far easier for Microsoft engineers to simply convert that markup to XML.

But the markup created later in the Ecma standardization process is verbose, as XML markup should be. See e.g., W3C XML v.1.0 (4th ed.).

"XML documents should be human-legible and reasonably clear. ... XML documents shall be easy to create. ... Terseness in XML markup is of minimal importance."

Terse markup mattered in the days when documents were stored on floppy discs or slow hard drives and storage space and memory capacity was a barrier, making a necessity of compressed binary file formats that were dumps to file of the in-memory binary representation ("IMBR") of a document. But verbose OOXML markup gets converted to terse markup anyway once the OOXML document is converted to IMBR for internal processing.

The better question is why Microsoft inflicted terse markup on developers. One might plausibly answer that it was to save money and development time or even hypothesize something more Machiavellian.

But the bottom line is that OOXML markup terseness is a bug, not a feature. The terse markup forces developers and authors to constantly refer to reference materials to determine markup functionality and gives scant clues for the electronic archaeologists of the future. OOXML violates the principle that "XML documents shall be easy to create .. [and] be human-legible and reasonably clear."

Private message disabled

Marbux

Marbux
n/a
Member since: February 2008

Site Activity Rating:

1

 


  • Previous comment

  • Reply to this comment
  • Return to story
  • Report this as offensive


Full Talkback thread

Story: Google: OOXML 'insufficient and unnecessary'

  1. Very nice Albert lars
  2. Microsoft double-tongued Anonymous123
  3. ODF useless for Microsoft needs Albert
  4. OOXML is fully open Albert
  5. Sorry, the comment was cut short. Here'... garyedwards
  6. Reasons for lack of interoperbility in ODF Albert
  7. ODF, The Big Picture Goldie Simmons
  8. Breaking the Web garyedwards
  9. Google has invested in competing format Albert
  10. Document standards 2000355890
  11. Questioning Google’s objectiveness harpless
  12. Microsoft's Argument is Ridiculous Goldie Simmons
  13. insufficient and unnecessary standard, designed pu... ator1940
  14. Interoperability and the binary ODF conversion di... garyedwards
  15. A bit of background... Anonymous123
  16. Microsoft moves forward with OOXML SDK Karen Friar ZD
  17. The rest of the text in the previous tal... lars
  18. Google motivation Albert
  19. Which OOXML features in particular can't... Chris Rankin
  20. XML in spirit isn't going to be as effic... Anonymous123
  21. But does even Microsoft Office use OOXML... Chris Rankin
  22. Thanks Gary, very informative Goldie Simmons
  23. Durusau's proposal is preposterous Marbux
  24. A very Interesting Take Moley
  25. Features not in ODF Albert
  26. OOXML performance explained Albert
  27. Office and OOXML David Meyer ZD
  28. MS Office 2007 does fully support O... Albert
  29. ISO Credibility garyedwards
  30. Thank you for an intelligent r... Anonymous123
  31. Of course ODF isn't backwards... Chris Rankin
  32. Then why add "read"... Chris Rankin
  33. 00o writes compliant files Goldie Simmons
  34. You are contradicting Rupert G... Chris Rankin
  35. Terse markup for speed -- NOT Marbux
  36. Tail end of previous comment garyedwards
  37. Cut to the chase garyedwards
  38. ODF also has backwards compati... Albert
  39. MS influencing ODF development... Albert
  40. No, OOo is not fully complient... Albert
  41. MS Office 2007 files fully val... Albert
  42. That's OK, I contradict R... Rupert Goodwins ZD
  43. Widespread support for OOXML a... Albert
  44. Actually OOXML is not really t... Albert
  45. Actually MS Office 2007 compli... Albert
  46. Actually, you're making all th... Chris Rankin
  47. It's a question of greate... Chris Rankin
  48. Questions for the community Goldie Simmons
  49. Do tell me more, Albert Marbux
  50. Albert, give me a single examp... Marbux
  51. Open Standards Moley
  52. If you plan for incompatiblity... Albert
  53. Believe is in the prove Albert
  54. ODF and OOXML are standards in... Marbux
  55. explaination Albert
  56. ODF better readable but less g... Albert
  57. Interoperability Albert
  58. OOXML is Open Albert
  59. Then why does the same spreads... Marbux
  60. Extensions are bugs, not featu... Marbux
  61. OOXML interop is abysmal Marbux
  62. Extensions to ODF realistic ? Albert
  63. Undocumented eXtensions and St... garyedwards
  64. Not 1,500 extensions Marbux
  65. You've got to be kidding,... Marbux

Back to: Google: OOXML 'insufficient and unnecessary'

Featured Talkback

Why do so many (virtually all) software packages think that they are so important that they have to be started automatically every time the computer boots? What is the largest number of "speed access", "update check", "camera download" and whatever other background programs you have ever seen running? Of those, how many did you really need?

By: J.A. Watson

Read full story:
Annoying software: a rogues' gallery

Vista Upgrade Blog

Microsoft's pre-modern message puts a...

Over at ZDNet.com, Ed Bott reports a first sighting of Microsoft's eagerly awaited $300 million ad campaign. Already the cause of much speculation, the consensus is that this will be... More

7 comments

A $40 CONSUMER-class router has create...

Believe it or not I don't work in IT, haven't for 7 years. Yes I work with Microsoft's Windows XP Embedded and as a result I have to know a lot about the OS, the kernal, Win API calls... More

Post a comment

Sick Puppy Redo

I generally follow a dispassionate investigative process when trying to discern what happened when a project goes bad. Although its a low priority item, it gets done simply because... More

Post a comment