spacer

Webref WebRef   Sitemap · Experts · Tools · Services · Newsletters · About i.com

home / experts / xml / column22

XML Parsers

The candidates

Managed Services Subject Matter Expert (CA)
Next Step Systems
US-CA-San Ramon

Justtechjobs.com Post A Job | Post A Resume
Developer News
Microsoft Shows Some Ankle With Visual Studio
Gentoo Linux Cancels Distribution
It's Official: Windows 7 at PDC, WinHEC

IBM XML4J
Apache Xerces
Sun Project XMicrosoft MSXMLOracle XML Parser for JavaJames Clark XP
well-formedness checking+++++
validating++++-
XML-Schema+----
Namespaces++++-
XSL-Twith LotusXSL-++-
Java++-++
Win32through Javathrough Javathrough Java+-
SAX 1.0+++++
SAX 2.0+-+--
DOM Level 1 1.0++++-
DOM Level 2 1.0+----
Open Source+---+
DownloadXML4J
Xerces
ProjectXMSXMLORAXP

Parsers are different not only in their support for checking and transforming documents but also in the way they read a document. Event-based parsers read the text sequentially, and whenever a start or end tag appears an event is sent to the application. The Simple API for XML (SAX) is such an API. With the second approach the parser builds a hierachical data structure from the content of the document. This is how the Document Object Model (DOM) API works, similar to the HTML document tree in a Web browser. You can find a more in-depth discussion of these programming models in column11.

The quality of a parser is largely defined by its conformance to the XML standard. A test suite has been defined by the Organization for Advancement of Structured Information Systems (OASIS). A set of over 1000 valid and invalid documents have been defined to check a parser's capabilities of accepting the valid ones and rejecting the invalid ones.

Only Sun's parser passes all of the tests. XML4J rejects some valid documents that contain UTF-16 special characters. This can create difficulties when processing foreign language documents. The Java version of XP parser is in early beta state but already passes almost all tests. The Oracle parser seems to be based on SGML code, it accepts some documents that are illegal in XML but are legal in the more flexible SGML definition. MSXML bails out sometimes too early when presented with illegal documents. Different sub-versions produce different results, of course. (See column20.)

Conclusion

An XML parser is the key building block for every XML application. Sun's parser and XP currently support the XML standard best. IBM's product is at the forefront of implementing new APIs. All of the above-mentioned parsers are good candidates to build your XML applications on, and when you use standard APIs such as SAX or DOM you should be able to swap different implementations in and out without changing a single line of code anyway. The most important thing is to start your XML project today!

http://www.internet.com



JupiterOnlineMedia

internet.comearthweb.comDevx.commediabistro.comGraphics.com

Search:

Jupitermedia Corporation has two divisions: Jupiterimages and JupiterOnlineMedia

Jupitermedia Corporate Info


Legal Notices, Licensing, Reprints, & Permissions, Privacy Policy.

Advertise | Newsletters | Tech Jobs | Shopping | E-mail Offers

Solutions
Whitepapers and eBooks
IBM Whitepaper: Innovative Collaboration to Advance Your Business
Internet.com eBook: Real Life Rails
Avaya Article: Call Control XML - Powerful, Standards-Based Call Control
Internet.com eBook: The Pros and Cons of Outsourcing
Go Parallel Article: Scalable Parallelism with Intel(R) Threading Building Blocks
Internet.com eBook: Best Practices for Developing a Web Site
IBM CXO Whitepaper: The 2008 Global CEO Study "The Enterprise of the Future"
Avaya Article: Call Control XML in Action - A CCXML Auto Attendant
Go Parallel Article: James Reinders on the Intel Parallel Studio Beta Program
IBM CXO Whitepaper: Unlocking the DNA of the Adaptable Workforce--The Global Human Capital Study 2008
Adobe Acrobat Connect Pro: Web Conferencing and eLearning Whitepapers
Go Parallel Article: Getting Started with TBB on Windows
HP eBook: Storage Networking , Part 1
MORE WHITEPAPERS, EBOOKS, AND ARTICLES
Webcasts
Go Parallel Video: Intel(R) Threading Building Blocks: A New Method for Threading in C++
HP Video: Is Your Data Center Ready for a Real World Disaster?
Microsoft Partner Portal Video: Microsoft Gold Certified Partners Build Successful Practices
HP On Demand Webcast: Virtualization in Action
Go Parallel Video: Performance and Threading Tools for Game Developers
Rackspace Hosting Center: Customer Videos
Intel vPro Developer Virtual Bootcamp
HP Disaster-Proof Solutions eSeminar
HP On Demand Webcast: Discover the Benefits of Virtualization
MORE WEBCASTS, PODCASTS, AND VIDEOS
Downloads and eKits
Microsoft Download: Silverlight 2 Software Development Kit Beta 2
30-Day Trial: SPAMfighter Exchange Module
Red Gate Download: SQL Toolbelt
Iron Speed Designer Application Generator
Microsoft Download: Silverlight 2 Beta 2 Runtime
MORE DOWNLOADS, EKITS, AND FREE TRIALS
Tutorials and Demos
IBM IT Innovation Article: Green Servers Provide a Competitive Advantage
Microsoft Article: Expression Web 2 for PHP Developers--Simplify Your PHP Applications
Featured Algorithm: Intel Threading Building Blocks - parallel_reduce
MORE TUTORIALS, DEMOS AND STEP-BY-STEP GUIDES
webref The latest from WebReference.com Browse >
Controllers: Programming Application Logic - Part 2 · How to Use JavaScript to Validate Form Data · Controllers: Programming Application Logic
Sitemap · Experts · Tools · Services · Email a Colleague · Contact FREE Newsletters 
 The latest from internet.com
Sprint Launches Mobile WiMAX Network · Albatron Downsizes with the KI780G Mini-ITX Motherboard · Can't Find a Wi-Fi Network? Make Your Own.

Produced by Michael Claßen
All Rights Reserved. Legal Notices.

URL: http://www.webreference.com/xml/column22/2.html
Created: Oct 20, 2000
Revised: Oct 20, 2000