<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic tFileInputMSXML - Document in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/tFileInputMSXML-Document/m-p/2313842#M84660</link>
    <description>Hi, 
&lt;BR /&gt;i try to parse a xml file using the tFileInputMSXML Component. 
&lt;BR /&gt;XML Layout (Sample) 
&lt;BR /&gt; 
&lt;PRE&gt;&amp;lt;?xml version="1.0" encoding="UTF-8"?&amp;gt;&lt;BR /&gt;&amp;lt;employee id="1"&amp;gt;&lt;BR /&gt;	&amp;lt;epno type="1"&amp;gt;1&amp;lt;/epno&amp;gt;&lt;BR /&gt;	...&lt;BR /&gt;	&amp;lt;years&amp;gt;&lt;BR /&gt;		&amp;lt;year id="2011" update="2011-07-01"&amp;gt;&lt;BR /&gt;			&amp;lt;assessment by="1"&amp;gt;10&amp;lt;/assessment&amp;gt;&lt;BR /&gt;			...&lt;BR /&gt;			&amp;lt;title xml:lang="en"&amp;gt;ABC&amp;lt;/title&amp;gt;&lt;BR /&gt;			&amp;lt;title xml:lang="fr"&amp;gt;ABC&amp;lt;/title&amp;gt;&lt;BR /&gt;		&amp;lt;/year&amp;gt;&lt;BR /&gt;	&amp;lt;/years&amp;gt;&lt;BR /&gt;&amp;lt;/employee&amp;gt;&lt;/PRE&gt; 
&lt;BR /&gt;I set the Root XPath Query to "/ 
&lt;FONT face="consolas, monaco, 'bitstream vera sans mono', 'courier new', courier, monospace"&gt;employee&lt;/FONT&gt;". 
&lt;BR /&gt;In the Subshema (Causing the prob. ) the XPath Query is "/years/year". 
&lt;BR /&gt;And in the Shema for years i have the assessment field and a column named XML_Doc of the type "Document". 
&lt;BR /&gt;Without the XML_Doc Column everything works as expected. 
&lt;BR /&gt;I want to populate the XML_Doc Column with the xml data from the year so that i can use the tExtractXMLField Component to read the titles as well as the year - so i set the XPath Query to "." to save the whole object - but as soon as i add the column i get the following error. 
&lt;BR /&gt; 
&lt;PRE&gt;Error on line 1 of document &amp;nbsp;: Content is not allowed in prolog. Nested exception content is not allowed in prolog.&lt;BR /&gt;&lt;/PRE&gt; 
&lt;BR /&gt;- TOS 5.1.1.r118616 
&lt;BR /&gt;Thank you in advance.</description>
    <pubDate>Tue, 30 Sep 2014 20:21:39 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2014-09-30T20:21:39Z</dc:date>
    <item>
      <title>tFileInputMSXML - Document</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tFileInputMSXML-Document/m-p/2313842#M84660</link>
      <description>Hi, 
&lt;BR /&gt;i try to parse a xml file using the tFileInputMSXML Component. 
&lt;BR /&gt;XML Layout (Sample) 
&lt;BR /&gt; 
&lt;PRE&gt;&amp;lt;?xml version="1.0" encoding="UTF-8"?&amp;gt;&lt;BR /&gt;&amp;lt;employee id="1"&amp;gt;&lt;BR /&gt;	&amp;lt;epno type="1"&amp;gt;1&amp;lt;/epno&amp;gt;&lt;BR /&gt;	...&lt;BR /&gt;	&amp;lt;years&amp;gt;&lt;BR /&gt;		&amp;lt;year id="2011" update="2011-07-01"&amp;gt;&lt;BR /&gt;			&amp;lt;assessment by="1"&amp;gt;10&amp;lt;/assessment&amp;gt;&lt;BR /&gt;			...&lt;BR /&gt;			&amp;lt;title xml:lang="en"&amp;gt;ABC&amp;lt;/title&amp;gt;&lt;BR /&gt;			&amp;lt;title xml:lang="fr"&amp;gt;ABC&amp;lt;/title&amp;gt;&lt;BR /&gt;		&amp;lt;/year&amp;gt;&lt;BR /&gt;	&amp;lt;/years&amp;gt;&lt;BR /&gt;&amp;lt;/employee&amp;gt;&lt;/PRE&gt; 
&lt;BR /&gt;I set the Root XPath Query to "/ 
&lt;FONT face="consolas, monaco, 'bitstream vera sans mono', 'courier new', courier, monospace"&gt;employee&lt;/FONT&gt;". 
&lt;BR /&gt;In the Subshema (Causing the prob. ) the XPath Query is "/years/year". 
&lt;BR /&gt;And in the Shema for years i have the assessment field and a column named XML_Doc of the type "Document". 
&lt;BR /&gt;Without the XML_Doc Column everything works as expected. 
&lt;BR /&gt;I want to populate the XML_Doc Column with the xml data from the year so that i can use the tExtractXMLField Component to read the titles as well as the year - so i set the XPath Query to "." to save the whole object - but as soon as i add the column i get the following error. 
&lt;BR /&gt; 
&lt;PRE&gt;Error on line 1 of document &amp;nbsp;: Content is not allowed in prolog. Nested exception content is not allowed in prolog.&lt;BR /&gt;&lt;/PRE&gt; 
&lt;BR /&gt;- TOS 5.1.1.r118616 
&lt;BR /&gt;Thank you in advance.</description>
      <pubDate>Tue, 30 Sep 2014 20:21:39 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tFileInputMSXML-Document/m-p/2313842#M84660</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-09-30T20:21:39Z</dc:date>
    </item>
    <item>
      <title>Re: tFileInputMSXML - Document</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tFileInputMSXML-Document/m-p/2313843#M84661</link>
      <description>Hi
&lt;BR /&gt;You aren't able to extract a document from a XML file with tFileInputMSXML, you should use tFileInputXML instead of tFileInputMSXML and check the 'Get Nodes' box of the column.
&lt;BR /&gt;Best regards
&lt;BR /&gt;Shong</description>
      <pubDate>Sun, 12 Oct 2014 10:57:10 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tFileInputMSXML-Document/m-p/2313843#M84661</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-10-12T10:57:10Z</dc:date>
    </item>
  </channel>
</rss>

