Developing Data-Oriented XML Formats (XML in a Nutshell, 2nd Edition)

15.2. Developing Data-Oriented XML Formats

Despite the mature status of most of XML's core technologies, XML application development is only now being recognized as a distinct discipline. Many architects and XML developers are attempting to turn existing design methodologies (like UML) and design patterns to the problem of constructing markup languages, but a widely accepted design process for creating XML applications still does not exist.

TIP: The term "XML application" is often used in XML contexts to describe an XML vocabulary for a particular domain rather than the software used to process it. This may seem a little strange to developers used to creating software applications, but it makes sense if you think about integrating a software application with an XML application, for instance.

XML applications can range in scope from a proprietary vocabulary used to store a single computer program's configuration settings to an industry-wide standard for storing consumer loan applications. Although the specifics and sometimes the sequence will vary, the basic steps involved in creating a new XML application are as follows:

Determine the requirements of the application.
Look for existing applications that might meet those requirements.
Choose a validation model.
Decide on a namespace structure.
Plan for expansion.
Consider the impact of the design on application developers.
Determine how old and new versions of the application will coexist.

The following sections explore each of these steps in greater depth.

15.2.6. Maintaining Compatibility

Maintaining backward compatibility with existing documents is a primary concern for XML applications that are widely used by diverse audiences. The difficulties faced by standards organizations when dealing with the task of updating a popular application (such as HTML) are formidable. While most applications may not become as widespread as HTML, some thought should be given in advance as to how new versions of a schema or DTD will interact with existing documents.

One possible approach to maintaining backward compatibility is to create a new, distinct namespace that will be used to mark new element declarations or perhaps to change the namespace of the entire document to reflect a substantially changed version. Another possible strategy is only to extend existing applications without removing prior functionality. The most important thing is to ensure that each instance document for an application has some readily identifiable marker that associates it with a particular version of a DTD or schema. The good news is that the highly transformable nature of XML makes it very easy to migrate old documents to new document formats.

Removing functionality is possible, but frequently difficult, once a format is widely used. Deprecating functionality--marking it as a likely target for removal a version or several before it is actually removed--is one approach. While deprecated features often linger in implementations long after they've been targeted for removals, they change the expectations of developers building new applications and make it possible, if slow, to remove functionality.

15.2. Developing Data-Oriented XML Formats

15.2.1. Basic Application Requirements

15.2.1.1. Where and how will new documents be created?

15.2.1.2. How complex will the document be?

15.2.1.3. How will documents be consumed?

15.2.1.4. How widely will the resulting documents be distributed?

15.2.1.5. Will others need to incorporate this document structure into their own applications?

15.2.2. Investigating Available Options

15.2.2.1. XML vocabulary development

15.2.3. Planning for Growth

Example 15-1. extensible.dtd

Example 15-2. Document extending extensible.dtd

15.2.4. Choosing a Validation Method

15.2.5. Namespace Support

15.2.5.1. Will instance documents need to be validated using a DTD?

15.2.5.2. Will markup from this application need to be embedded in other applications?

15.2.5.3. Are there legacy documents to support?

15.2.6. Maintaining Compatibility