out |
CSV |
The transformed CSV file is the table of the tags, that represents their
hierarchy in the XML document, their content and parameters.
The file contains the following columns:
- #TID# - The unique ID for a tag (generated by the component)
- #PID# - The unique ID of its parent
- #PPID# - The dot-separated sequence of all tag's parents in the hierarchy
- #NAME# - The name of the tag (for instance, <a href="http://www.google.com">
has name "a")
- #CONTENT# - The content of the tag (for instance, <a href="http://www.google.com">
bla-bla-bla</a> has content "bla-bla-bla"). Warning: quotes in the
tag content are not handled correctly!
- <---a-parameter---> - a parameter of the tag (for instance, <a href="http://www.google.com">
bla-bla-bla</a> has parameter "href" with the value "http://www.google.com").
The CSV table contains a column for each tag parameter found in the XML document
|