XML: Whines and Battles | WebReference

XML: Whines and Battles

XML: Old whines in new battles

By Peter Flynn (pflynn@m-net.arbornet.org)

Will the Web’s new markup grammar repeat history?


In the rush to learn, implement, and profit from XML, many of the questions being asked are the same ones which came up when HTML was new. Can we learn from the lessons of HTML and make the Web a more productive and reliable resource for providers and users alike?

Since its formal launch last December, XML (see ‘XML and HTML’) has had a lot of support from software vendors and authors, content providers, service providers, and even the Sunday papers. However, the pressure to adopt XML is having some interesting side-effects, and not all of them are desirable. It’s still too early to say for sure, but it looks like some of the early misunderstandings with HTML are about to surface again. Part of the technique for staying ahead of the posse is therefore to know the answers (or at least to avoid asking the wrong questions).


XML, if you haven’t come across it yet, is a new system for defining how you can write Web files. It’s a sawn-off version of ISO 8859, the standard used to define HTML and hundreds of other business applications.

Full ISO 8859 (SGML) is quite powerful, and can be fairly heavy going at times: it has a load of optional bells and whistles. By contrast, XML leaves out all the optional bits, so it’s easier to write programs for, but it keeps most of the useful bits, so you can use it to design your own markup language instead of being restricted to HTML.

XML has some significant benefits for content providers as well as designers:

Some side-effects of XML

However, along with death and taxes there is a third certainty in life, and that is if you want to use a new technology while it’s still new, you have to learn it, usually from scratch. There aren’t a lot of books on XML out yet, but the number is growing (half a dozen at the last count). These are high quality books, although most are pretty technical. Remember how things were when the Web was still new? Pointy brackets and plain-text editors and having to learn why <H2> made more sense than <BIG><B>? Now when a new Web user wants to write that first home page, product announcement, or golf club news-sheet, there are graphical page-design packages to help with the layout, and Web-based validation services to test it all against various browsers.

The learning curve is here again: people who have been creating Web pages without really ever seeing much of raw HTML markup will need to come to terms with the stuff in pointy brackets if they want to get in on the ground floor of XML. This time around, the apparent drawback (there is no predefined HTML to help you) is the actual benefit (XML lets you roll your own markup).

Things ought to be a bit different on the software front, too. Long before the pixels were dry on the draft XML spec, there was XML software being developed and XML documents were being written. Almost a year before the final spec was published, there was a suite of XML files to test against, and even a full XML application (CML) complete with a browser (JUMBO). Right now we’re armpit-deep in software for the technology which underlies XML: every first-year computer science student is writing an XML parser in Java; every industry, trade, or business association has someone writing at least one application (the list on the SGML/XML Web pages grows almost daily); most of the major SGML vendors have XML-compliant versions of their products; and there are some wonderfully good public domain software toolsets. What’s still missing is some of the glue to hold the bits together, and the end-user software.

Comments are welcome


All Rights Reserved. Legal Notices.
Created: May 11, 1998
Revised: May 14, 1998

URL: http://www.webreference.com/authoring/languages/xml/questions/