<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic tFileInput XML and xml parsing settings in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/tFileInput-XML-and-xml-parsing-settings/m-p/2247143#M32424</link>
    <description>Hello, 
&lt;BR /&gt;I have some problems with tFileInputXML and XML parsing options settings. 
&lt;BR /&gt;I need to extract few specific data from several big pdml files (tcp capture packets xml files) to put them in a csv file and treat them afterwards. 
&lt;BR /&gt;I succeeded in performing the extraction on a "little" sample file (19ko) with the DOM parsing option (see DOM_execution.jpg, but got an "Java Heap Space" error on a bigger file (several Mo) 
&lt;BR /&gt;I tried to turn to SAX parsing option but can't get my project work even on my "little" sample file (the output is empty see SAX_execution.jpg) 
&lt;BR /&gt;Is this a possible known bug of the tFileInputXML component with SAX parsing option and corresponding code generation or am I perhaps missing something somewhere in my project configuration ? 
&lt;BR /&gt;(I'm using TOS 3.1.2) 
&lt;BR /&gt;Thanks for your help, 
&lt;BR /&gt;Damien,</description>
    <pubDate>Sat, 16 Nov 2024 13:12:12 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2024-11-16T13:12:12Z</dc:date>
    <item>
      <title>tFileInput XML and xml parsing settings</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tFileInput-XML-and-xml-parsing-settings/m-p/2247143#M32424</link>
      <description>Hello, 
&lt;BR /&gt;I have some problems with tFileInputXML and XML parsing options settings. 
&lt;BR /&gt;I need to extract few specific data from several big pdml files (tcp capture packets xml files) to put them in a csv file and treat them afterwards. 
&lt;BR /&gt;I succeeded in performing the extraction on a "little" sample file (19ko) with the DOM parsing option (see DOM_execution.jpg, but got an "Java Heap Space" error on a bigger file (several Mo) 
&lt;BR /&gt;I tried to turn to SAX parsing option but can't get my project work even on my "little" sample file (the output is empty see SAX_execution.jpg) 
&lt;BR /&gt;Is this a possible known bug of the tFileInputXML component with SAX parsing option and corresponding code generation or am I perhaps missing something somewhere in my project configuration ? 
&lt;BR /&gt;(I'm using TOS 3.1.2) 
&lt;BR /&gt;Thanks for your help, 
&lt;BR /&gt;Damien,</description>
      <pubDate>Sat, 16 Nov 2024 13:12:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tFileInput-XML-and-xml-parsing-settings/m-p/2247143#M32424</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T13:12:12Z</dc:date>
    </item>
    <item>
      <title>Re: tFileInput XML and xml parsing settings</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tFileInput-XML-and-xml-parsing-settings/m-p/2247144#M32425</link>
      <description>for information I tried to upgrade to TOS 4.1.1 but I got exactly the same behavior... 
&lt;BR /&gt;Information to reproduce is below : 
&lt;BR /&gt;- Loop XPath query : "/pdml/packet" 
&lt;BR /&gt;- mapping information : 
&lt;BR /&gt;frame : "proto/field/@value" 
&lt;BR /&gt;time : "proto/field/@show" 
&lt;BR /&gt;id : "proto/field/@value" 
&lt;BR /&gt;and the input xml sample file (this one is not a valid pdml file but is sufficient to reproduce the parsing problem I meet using SAX) : 
&lt;BR /&gt;&amp;lt;?xml version="1.0"?&amp;gt; 
&lt;BR /&gt;&amp;lt;pdml version="0" creator="wireshark/1.2.6"&amp;gt; 
&lt;BR /&gt;&amp;lt;packet&amp;gt; 
&lt;BR /&gt; &amp;lt;proto name="geninfo" pos="0" showname="General information" size="1308"&amp;gt; 
&lt;BR /&gt; &amp;lt;field name="num" pos="0" show="3" showname="Number" value="3" size="1308"/&amp;gt; 
&lt;BR /&gt; &amp;lt;/proto&amp;gt; 
&lt;BR /&gt; &amp;lt;proto name="frame" showname="Frame 3 (1308 bytes on wire, 1308 bytes captured)" size="1308" pos="0"&amp;gt; 
&lt;BR /&gt; &amp;lt;field name="frame.time_relative" showname="Time since reference or first frame: 0.007766000 seconds" size="0" pos="0" show="0.007766000"/&amp;gt; 
&lt;BR /&gt; &amp;lt;/proto&amp;gt; 
&lt;BR /&gt; &amp;lt;proto name="fake-field-wrapper"&amp;gt; 
&lt;BR /&gt; &amp;lt;field name="data.data" showname="xxx" size="1254" pos="54" show="xxx" value="xxx"/&amp;gt; 
&lt;BR /&gt; &amp;lt;/proto&amp;gt; 
&lt;BR /&gt;&amp;lt;/packet&amp;gt; 
&lt;BR /&gt;&amp;lt;/pdml&amp;gt;</description>
      <pubDate>Wed, 17 Nov 2010 17:49:08 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tFileInput-XML-and-xml-parsing-settings/m-p/2247144#M32425</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2010-11-17T17:49:08Z</dc:date>
    </item>
    <item>
      <title>Re: tFileInput XML and xml parsing settings</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tFileInput-XML-and-xml-parsing-settings/m-p/2247145#M32426</link>
      <description>It seems I'm running into the same problem as in topic : 
&lt;BR /&gt; 
&lt;A href="https://community.talend.com/t5/Design-and-Development/Problem-with-an-Xpath-expression-with-Sax-parser/td-p/112369" rel="nofollow noopener noreferrer"&gt;https://community.talend.com/t5/Design-and-Development/Problem-with-an-Xpath-expression-with-Sax-parser/td-p/112369&lt;/A&gt; 
&lt;BR /&gt;...I find that a warning could be raised to inform that the xpath expression used would not be taken into account with sax...it was not obvious to me...</description>
      <pubDate>Thu, 18 Nov 2010 09:57:32 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tFileInput-XML-and-xml-parsing-settings/m-p/2247145#M32426</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2010-11-18T09:57:32Z</dc:date>
    </item>
  </channel>
</rss>

