<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: parse flux xml in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245980#M31646</link>
    <description>Hi phil,&lt;BR /&gt;you must have the whole xml document in one row. Depending on your data flow you could use tDenormalize for example.&lt;BR /&gt;Bye&lt;BR /&gt;Volker</description>
    <pubDate>Tue, 27 Jan 2009 09:09:31 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2009-01-27T09:09:31Z</dc:date>
    <item>
      <title>parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245976#M31642</link>
      <description>i have a file 
&lt;BR /&gt;i transform this file in xml file
&lt;BR /&gt;with tFileinputFullRow + tJavaRow
&lt;BR /&gt;it's ok 
&lt;BR /&gt;now i would like extract a attribut of a element 
&lt;BR /&gt;i want to use tparseXMLrow 
&lt;BR /&gt;but i have a exception in component tParseXMLRow_1
&lt;BR /&gt;org.dom4j.DocumentException: Error on line 1 of document : XML document structures must start and end within the same entity. Nested exception: XML document structures must start and end within the same entity.
&lt;BR /&gt; at org.dom4j.io.SAXReader.read(SAXReader.java:482)
&lt;BR /&gt; at org.dom4j.io.SAXReader.read(SAXReader.java:365)
&lt;BR /&gt; at talenddemosjava.xmltovtg_0_1.xmltovtg.tMysqlInput_1Process(xmltovtg.java:575)
&lt;BR /&gt;how do i make ?
&lt;BR /&gt;thank a lot for your help !
&lt;BR /&gt;xml structure is
&lt;BR /&gt;&amp;lt;A&amp;gt;
&lt;BR /&gt;&amp;lt;B&amp;gt;
&lt;BR /&gt;&amp;lt;/B&amp;gt;
&lt;BR /&gt;&amp;lt;C&amp;gt;
&lt;BR /&gt;&amp;lt;data ct="hhh" &amp;gt;&amp;lt;/data&amp;gt;
&lt;BR /&gt;&amp;lt;data ct="hhh" &amp;gt;&amp;lt;/data&amp;gt;
&lt;BR /&gt;&amp;lt;/C&amp;gt;
&lt;BR /&gt;&amp;lt;/A&amp;gt;</description>
      <pubDate>Sat, 16 Nov 2024 14:05:32 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245976#M31642</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T14:05:32Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245977#M31643</link>
      <description>Hi,
&lt;BR /&gt;if you take a look at the error message you could find the solution: Your xml must start end end with the same tag. So you should add a opening tag at the start "&amp;lt;data&amp;gt;" for example and a closing one at the end "&amp;lt;/data&amp;gt;" in this case.
&lt;BR /&gt;Additional you should add a header in the first line like the following:
&lt;BR /&gt;&amp;lt;?xml version="1.0" encoding="iso-8859-1"?&amp;gt;
&lt;BR /&gt;You could find more information for example on: 
&lt;A href="http://en.wikipedia.org/wiki/Xml" target="_blank" rel="nofollow noopener noreferrer"&gt;Wikipedia&lt;/A&gt;.
&lt;BR /&gt;Bye
&lt;BR /&gt;Volker</description>
      <pubDate>Tue, 27 Jan 2009 06:32:58 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245977#M31643</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-27T06:32:58Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245978#M31644</link>
      <description>hi,
&lt;BR /&gt;i verif structure of my xml message
&lt;BR /&gt;it is good
&lt;BR /&gt;but i have always Exception in component tParseXMLRow_1
&lt;BR /&gt;org.dom4j.DocumentException: Error on line 1 of document : XML document structures must start and end within the same entity. Nested exception: XML document structures must start and end within the same entity.
&lt;BR /&gt; at org.dom4j.io.SAXReader.read(SAXReader.java:482)
&lt;BR /&gt; at org.dom4j.io.SAXReader.read(SAXReader.java:365)
&lt;BR /&gt; at talenddemosjava.v1toxml_0_1.v1toxml.tFileInputFullRow_1Process(v1toxml.java:1356)
&lt;BR /&gt;i think that the flux xml it isn't in one row but in several rows
&lt;BR /&gt;when tParseXMLRow_1 parse the first row i see the start entity but i don't see the end entity because the end entity is on another row 
&lt;BR /&gt;do i make concat each row in one row ?
&lt;BR /&gt;or is it a mistake to think that ?
&lt;BR /&gt;thank for your help !</description>
      <pubDate>Tue, 27 Jan 2009 08:46:56 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245978#M31644</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-27T08:46:56Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245979#M31645</link>
      <description>hi,&lt;BR /&gt;&lt;BLOCKQUOTE&gt;&lt;TABLE border="1"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;XML document structures must start and end within the same entity&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;/BLOCKQUOTE&gt;&lt;BR /&gt;looks like a error xml structure ... Did your root Element are closed "at the end" !?&lt;BR /&gt;&lt;PRE&gt;&amp;lt;myroot&amp;gt;&lt;BR /&gt;  &amp;lt;other&amp;gt;....&amp;lt;/other&amp;gt;&lt;BR /&gt;&amp;lt;/myroot&amp;gt;&lt;/PRE&gt;&lt;BR /&gt;You can check your file by open it in Firefox to point the error !&lt;BR /&gt;++</description>
      <pubDate>Tue, 27 Jan 2009 08:53:59 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245979#M31645</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-27T08:53:59Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245980#M31646</link>
      <description>Hi phil,&lt;BR /&gt;you must have the whole xml document in one row. Depending on your data flow you could use tDenormalize for example.&lt;BR /&gt;Bye&lt;BR /&gt;Volker</description>
      <pubDate>Tue, 27 Jan 2009 09:09:31 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245980#M31646</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-27T09:09:31Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245981#M31647</link>
      <description>hi,
&lt;BR /&gt;when i read my structure xml with tInputFileXML in logout i can extract the attritbut of element
&lt;BR /&gt;it 's good
&lt;BR /&gt;but when i parse the stream with component tParseXMLRow_1
&lt;BR /&gt;i have always Exception 
&lt;BR /&gt;org.dom4j.DocumentException: Error on line 1 of document : XML document structures must start and end within the same entity. Nested exception: XML document structures must start and end within the same entity.
&lt;BR /&gt; at org.dom4j.io.SAXReader.read(SAXReader.java:482)
&lt;BR /&gt; at org.dom4j.io.SAXReader.read(SAXReader.java:365)
&lt;BR /&gt; at talenddemosjava.v1toxml_0_1.v1toxml.tFileInputFullRow_1Process(v1toxml.java:1356)
&lt;BR /&gt;why i can't use component tParseXMLRow_1
&lt;BR /&gt;+</description>
      <pubDate>Tue, 27 Jan 2009 09:30:22 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245981#M31647</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-27T09:30:22Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245982#M31648</link>
      <description>I think you will find the solution in my answer. tParseXmlRow is row-based. If you have spread your document over multiple rows this wouldn't work.</description>
      <pubDate>Tue, 27 Jan 2009 09:34:36 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245982#M31648</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-27T09:34:36Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245983#M31649</link>
      <description>hi,&lt;BR /&gt;how do i parse a stream with xml flux ?&lt;BR /&gt;i have only tInputFileXML , it is for file&lt;BR /&gt;but for stream , what do i use ?&lt;BR /&gt;++</description>
      <pubDate>Tue, 27 Jan 2009 09:43:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245983#M31649</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-27T09:43:00Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245984#M31650</link>
      <description>You could use tFileInputXml for an xml file and tParseXmlRow for XML data in an attribute of your flow. What is your input?</description>
      <pubDate>Tue, 27 Jan 2009 10:01:46 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245984#M31650</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-27T10:01:46Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245985#M31651</link>
      <description>my input is in first   a file&lt;BR /&gt;i transform this file in xml file&lt;BR /&gt;with tFileinputFullRow + tJavaRow&lt;BR /&gt;after i use tparseXMLrow for extract XML data in an attribute of flow&lt;BR /&gt;it doesn't work because xml message spread over multiple rows&lt;BR /&gt;how can i do to extract XML data in an attribute ?&lt;BR /&gt;++</description>
      <pubDate>Tue, 27 Jan 2009 10:13:35 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245985#M31651</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-27T10:13:35Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245986#M31652</link>
      <description>You could use tDenormalize no concatenate multiple rows together. Use "\n" as delimiter. you need a unique key over all rows for the whole xml document. If you do not have one (or the file has only one xml document) you could a fix value in one additional attribute inside of your tJavaRow.</description>
      <pubDate>Tue, 27 Jan 2009 10:23:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245986#M31652</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-27T10:23:24Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245987#M31653</link>
      <description>i don't understand how i use tDenormalize&lt;BR /&gt;can you give me a example, please ?&lt;BR /&gt;++</description>
      <pubDate>Tue, 27 Jan 2009 10:36:47 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245987#M31653</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-27T10:36:47Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245988#M31654</link>
      <description>the file has only one xml document , the xml message spread over multiple rows&lt;BR /&gt;i have only a column where is  spreaded the xml message&lt;BR /&gt;how i use tDenormalize &lt;BR /&gt;++</description>
      <pubDate>Tue, 27 Jan 2009 11:05:43 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245988#M31654</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-27T11:05:43Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245989#M31655</link>
      <description>I made an example. Take a look at the pictures. There are two jobs. The difference is in the input row delimiter. The upper one uses "---" (which is never used in the xml) and loads the whole file in one row. The second one uses "\n" as delimiter and need to merge the rows together with a tDenormalize.
&lt;BR /&gt;I modified the input:
&lt;BR /&gt;
&lt;PRE&gt;&amp;lt;?xml version="1.0" encoding="iso-8859-1"?&amp;gt;&lt;BR /&gt;&amp;lt;document&amp;gt;&lt;BR /&gt;&amp;lt;A&amp;gt;&lt;BR /&gt;&amp;lt;B&amp;gt;&lt;BR /&gt;&amp;lt;/B&amp;gt;&lt;BR /&gt;&amp;lt;C&amp;gt;&lt;BR /&gt;&amp;lt;data ct="hhh" &amp;gt;First&amp;lt;/data&amp;gt;&lt;BR /&gt;&amp;lt;data ct="iii" &amp;gt;Second&amp;lt;/data&amp;gt;&lt;BR /&gt;&amp;lt;/C&amp;gt;&lt;BR /&gt;&amp;lt;/A&amp;gt;&lt;BR /&gt;&amp;lt;/document&amp;gt;&lt;/PRE&gt;
&lt;BR /&gt;And this is the output (both the same one):
&lt;BR /&gt;
&lt;PRE&gt;ct;value&lt;BR /&gt;hhh;First&lt;BR /&gt;iii;Second&lt;/PRE&gt;
&lt;BR /&gt;Bye
&lt;BR /&gt;Volker</description>
      <pubDate>Tue, 27 Jan 2009 20:04:55 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245989#M31655</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-27T20:04:55Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245990#M31656</link>
      <description>thanks a lot of for your help&lt;BR /&gt;++</description>
      <pubDate>Wed, 28 Jan 2009 23:40:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245990#M31656</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-28T23:40:37Z</dc:date>
    </item>
    <item>
      <title>Re: parse flux xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245991#M31657</link>
      <description>You are welcome!</description>
      <pubDate>Thu, 29 Jan 2009 06:15:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/parse-flux-xml/m-p/2245991#M31657</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-01-29T06:15:33Z</dc:date>
    </item>
  </channel>
</rss>

