Apress-Visual CSharp 2010 Recipes A Problem Solution Approach_2 potx

The recipes in this chapter describe how to do the following: • Read, parse, and manipulate XML data recipes 6-1, 6-2, 6-3, and 6-7 • Search an XML document for specific nodes, either by

Trang 2

■ ■ ■

261

XML Processing

One of the most remarkable aspects of the Microsoft NET Framework is its deep integration with XML

In many NET applications, you won’t even be aware you’re using XML technologies—they’ll just be

used behind the scenes when you serialize a Microsoft ADO.NET DataSet, call a web service, or read

application settings from a Web.config configuration file In other cases, you’ll want to work directly with the System.Xml namespaces to manipulate Extensible Markup Language (XML) data Common XML

tasks don’t just include parsing an XML file, but also include validating it against a schema, applying an Extensible Stylesheet Language (XSL) transform to create a new document or Hypertext Markup

Language (HTML) page, and searching intelligently with XPath

In NET 3.5, Microsoft added LINQ to XML, which integrates XML handling into the LINQ model for

querying data sources You can use the same keywords and syntax to query XML as you would a

collection or a database

The recipes in this chapter describe how to do the following:

• Read, parse, and manipulate XML data (recipes 6-1, 6-2, 6-3, and 6-7)

• Search an XML document for specific nodes, either by name (recipe 6-4), by

namespace (recipe 6-5), or by using XPath (recipe 6-6)

• Validate an XML document with an XML schema (recipe 6-8)

• Serialize an object to XML (recipe 9), create an XML schema for a class (recipe

10), and generate the source code for a class based on an XML schema (recipe

Trang 3

The NET Framework provides several different ways to process XML documents The one you use

depends in part upon your programming task One of the most fully featured classes is XmlDocument,

which provides an in-memory representation of an XML document that conforms to the W3C Document

Object Model (DOM) The XmlDocument class allows you to browse through the nodes in any direction,

insert and remove nodes, and change the structure on the fly For details of the DOM specification, go to

www.w3c.org

■ Note The XmlDocument class is not scalable for very large XML documents, because it holds the entire XML content in memory at once If you want a more memory-efficient alternative, and you can afford to read and process the XML piece by piece, consider the XmlReader and XmlWriter classes described in recipe 6-7

To use the XmlDocument class, simply create a new instance of the class and call the Load method with

a file name, a Stream, a TextReader, or an XmlReader object It is also possible to read the XML from a simple string with the LoadXML method You can even supply a string with a URL that points to an XML document on the Web using the Load method The XmlDocument instance will be populated with the tree

of elements, or nodes, from the source document The entry point for accessing these nodes is the root

element, which is provided through the XmlDocument.DocumentElement property DocumentElement is an XmlElement object that can contain one or more nested XmlNode objects, which in turn can contain more XmlNode objects, and so on An XmlNode is the basic ingredient of an XML file Common XML nodes

include elements, attributes, comments, and contained text

When dealing with an XmlNode or a class that derives from it (such as XmlElement or XmlAttribute),

you can use the following basic properties:

• ChildNodes is an XmlNodeList collection that contains the first level of nested

nodes

• Name is the name of the node

• NodeType returns a member of the System.Xml.XmlNodeType enumeration that

indicates the type of the node (element, attribute, text, and so on)

• Value is the content of the node, if it’s a text or CDATA node

• Attributes provides a collection of node objects representing the attributes

applied to the element

• InnerText retrieves a string with the concatenated value of the node and all nested

nodes

Trang 4

263

• InnerXml retrieves a string with the concatenated XML markup for all nested

nodes

• OuterXml retrieves a string with the concatenated XML markup for the current

node and all nested nodes

The Code

The following example walks through every element of an XmlDocument using the ChildNodes property

and a recursive method Each node is displayed in a TreeView control, with descriptive text that either

identifies it or shows its content

// Default the file name to the sample document

private void Recipe06_01_Load(object sender, EventArgs e)

// Load the XML document

XmlDocument doc = new XmlDocument();

Trang 5

// Add a TreeNode node that represents this XmlNode

TreeNode newTreeNode = treeNodes.Add(xmlNode.Name);

// Customize the TreeNode text based on the XmlNode

// type and content

}

// Call this routine recursively for each attribute

// (XmlAttribute is a subclass of XmlNode.)

}

Trang 6

265

// Call this routine recursively for each child node

// Typically, this child node represents a nested element

<productName>Blue China Tea Pot</productName>

<description>A trendy update for tea drinkers.</description>

Trang 7

266

Figure 6-1 The displayed structure of an XML document

6-2 Insert Nodes in an XML Document

How It Works

Inserting a node into the XmlDocument class is a two-step process You must first create the node, and then you insert it at the appropriate location You can then call XmlDocument.Save to persist changes

Trang 8

267

To create a node, you use one of the XmlDocument methods starting with the word Create, depending

on the type of node This ensures that the node will have the same namespace as the rest of the

document (Alternatively, you can supply a namespace as an additional string argument.) Next, you

must find a suitable related node and use one of its insertion methods to add the new node to the tree

// Create a new, empty document

XmlNode docNode = doc.CreateXmlDeclaration("1.0", "UTF-8", null);

doc.AppendChild(docNode);

// Create and insert a new element

XmlNode productsNode = doc.CreateElement("products");

doc.AppendChild(productsNode);

// Create a nested element (with an attribute)

XmlNode productNode = doc.CreateElement("product");

XmlAttribute productAttribute = doc.CreateAttribute("id");

productAttribute.Value = "1001";

productNode.Attributes.Append(productAttribute);

productsNode.AppendChild(productNode);

// Create and add the subelements for this product node

// (with contained text data)

XmlNode nameNode = doc.CreateElement("productName");

Trang 10

269

Solution

Create a helper function that accepts a tag name and content, and can generate the entire element at

once Alternatively, use the XmlDocument.CloneNode method to copy branches of an XmlDocument

How It Works

Inserting a single element into an XmlDocument requires several lines of code You can shorten this code

in several ways One approach is to create a dedicated helper class with higher-level methods for adding

elements and attributes For example, you could create an AddElement method that generates a new

element, inserts it, and adds any contained text—the three operations needed to insert most elements

public static XmlNode AddElement(string tagName,

string textContent, XmlNode parent)

public static XmlNode AddAttribute(string attributeName,

string textContent, XmlNode parent)

Trang 11

// Create the basic document

XmlNode docNode = doc.CreateXmlDeclaration("1.0", "UTF-8", null);

doc.AppendChild(docNode);

XmlNode products = doc.CreateElement("products");

doc.AppendChild(products);

// Add two products

XmlNode product = XmlHelper.AddElement("product", null, products);

entire branch, with all nested nodes

Here is an example that creates a new product node by copying the first node:

// (Add first product node.)

// Create a new element based on an existing product

product = product.CloneNode(true);

Trang 12

The XmlDocument class provides a convenient GetElementsByTagName method that searches an entire

document for nodes that have the indicated element name It returns the results as a collection of

Trang 13

272

// Load the document

doc.Load(@" \ \ProductCatalog.xml");

// Retrieve all prices

XmlNodeList prices = doc.GetElementsByTagName("productPrice");

You can also search portions of an XML document by using the XmlElement.GetElementsByTagName

method It searches all the descendant nodes looking for matches To use this method, first retrieve an

XmlNode that corresponds to an element Then cast this object to an XmlElement The following example demonstrates how to find the price node under the first product element:

// Retrieve a reference to the first product

XmlNode product = doc.GetElementsByTagName("products")[0];

// Find the price under this product

Trang 14

273

Solution

Use the overload of the XmlDocument.GetElementsByTagName method that requires a namespace name as

a string argument Additionally, supply an asterisk (*) for the element name if you want to match all

Tiêu đề	XML Processing
Trường học	University of Technology
Chuyên ngành	Computer Science
Thể loại	Bài luận
Năm xuất bản	2010
Thành phố	Hanoi

Định dạng
Số trang	95
Dung lượng	2,05 MB