This XML file contains HTML code with escaped markup: tags are written like <body/> with escaped < and > signs. This is often the case with RSS feeds.

In this example, the HTMLTransformer is configured to parse the contents of the <description> and <escaped-html>, elements, creating well-formed content from this escaped markup.

A downstream transform is used to filter the result, as the HTMLTransformer writes full HTML documents as <html> elements, for each input element that it parses.

For more info search "HTMLTransformer" in the sitemap that drives this sample.

This was parsed by the HTMLTransformer:

Here's some escaped HTML

This test document contains escaped HTML code in the description elements, as is customary in RSS documents for example.

This description element contains escaped html markup, which should be converted to proper html by the HTMLTransformer. If the Cocoon logo is shown properly, it means that the parsing worked.

This should be a blockquote and here's some bold.

Note that the HTMLTransformer generates a complete HTML document for each element that is parsed - here this is filtered downstream to keep only the contents of the <body> of the parsed documents.

This was parsed by the HTMLTransformer:

More escaped HTML

Another description element with escaped html markup, including a Cocoon logo.

Here's some italic and a link to the Cocoon site.

HTMLTransformer output

Here's some escaped HTML

More escaped HTML