<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: XML file import error in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/XML-file-import-error/m-p/2336313#M104773</link>
    <description>&lt;P&gt;Thanks. But how I will test SAX parser working with my XPath query and mapping.&lt;/P&gt;&lt;P&gt;As whenever I try to create generic format by creating xml through metadata, it uses by default Dom4J parser. So it works over there , but fails with SAX.&lt;/P&gt;&lt;P&gt;Please suggest...&lt;/P&gt;</description>
    <pubDate>Wed, 07 Dec 2022 06:24:44 GMT</pubDate>
    <dc:creator>SunitS</dc:creator>
    <dc:date>2022-12-07T06:24:44Z</dc:date>
    <item>
      <title>XML file import error</title>
      <link>https://community.qlik.com/t5/Talend-Studio/XML-file-import-error/m-p/2336311#M104771</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;P&gt;Hi Team ,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I am trying to import xml file having size of 2.0 GB through&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;talend&amp;nbsp;Version: 7.3.1.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Below is sample xml format&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;lt;xml&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;lt;local namespace = "Talend" name =&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;"Community"&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;lt;field name = "FName"&amp;gt;Sunit&amp;lt;/field&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;lt;field name = "LName"&amp;gt;S&amp;lt;/field&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;lt;field name = "MobNo"&amp;gt;9999888877&amp;lt;/field&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;lt;field name = "Area"&amp;gt;XYSX&amp;lt;/field&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;lt;field name = "State"&amp;gt;Maharashtra&amp;lt;/field&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;lt;field name = "Pincode"&amp;gt;421302&amp;lt;/field&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;lt;/local&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;lt;/xml&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I am using simple tfileinputxml to fetch xml input and tDBOutput&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;to export to db.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Refer image 1&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My xpath query and mapping working perfectly fine , with the&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;input file size upto 200 MB to max 500 MB.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Xpath :&amp;nbsp;"/xml/local" and Mapping&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;:&amp;nbsp;"*[contains(@name,'FName)]" ......&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;and below setting&amp;nbsp;&lt;/P&gt;&lt;P&gt; Refer Image 2&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Also I&amp;nbsp;configured advanced runtime settings as to handle&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;huge input.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Refer Image 3&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;Using above config, we are able to load small file only&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;max 500 MB.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If I&amp;nbsp;tried any file more than that&amp;nbsp;it throws error as&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;OpenJDK 64-Bit Server VM warning: INFO:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;os::commit_memory(0x0000000555380000, 3223322624, 0) failed; error='Cannot&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;allocate memory' (errno=12)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;or&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Exception in thread "main" java.lang.OutOfMemoryError:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;GC overhead limit exceeded&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;or&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Exception in thread "main" java.lang.OutOfMemoryError:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Java heap space&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;etc.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Currently code is deployed on system having 16 GB RAM and Linux&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;as OS.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Kindly check and assist me to solve this issue asap.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Also I would like to use SAX parser ,but it is not working with&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;current xpath.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 15 Nov 2024 22:18:17 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/XML-file-import-error/m-p/2336311#M104771</guid>
      <dc:creator>SunitS</dc:creator>
      <dc:date>2024-11-15T22:18:17Z</dc:date>
    </item>
    <item>
      <title>Re: XML file import error</title>
      <link>https://community.qlik.com/t5/Talend-Studio/XML-file-import-error/m-p/2336312#M104772</link>
      <description>&lt;P&gt;Dom4J won't work well unless you have A LOT of memory. It's great for processing smaller files, but terrible for huge files. Maybe try Xerces....although I suspect that this will also have issues. The best parser to use is SAX....but you have the least control with that. But, a way to consider this is "divide and conquer". Use SAX to split the full XML into loop sections, then process each of those sections independently. Select each loop section as a "NODE" and then process the nodes with another component using Dom4J.&lt;/P&gt;</description>
      <pubDate>Tue, 06 Dec 2022 00:43:05 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/XML-file-import-error/m-p/2336312#M104772</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-12-06T00:43:05Z</dc:date>
    </item>
    <item>
      <title>Re: XML file import error</title>
      <link>https://community.qlik.com/t5/Talend-Studio/XML-file-import-error/m-p/2336313#M104773</link>
      <description>&lt;P&gt;Thanks. But how I will test SAX parser working with my XPath query and mapping.&lt;/P&gt;&lt;P&gt;As whenever I try to create generic format by creating xml through metadata, it uses by default Dom4J parser. So it works over there , but fails with SAX.&lt;/P&gt;&lt;P&gt;Please suggest...&lt;/P&gt;</description>
      <pubDate>Wed, 07 Dec 2022 06:24:44 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/XML-file-import-error/m-p/2336313#M104773</guid>
      <dc:creator>SunitS</dc:creator>
      <dc:date>2022-12-07T06:24:44Z</dc:date>
    </item>
    <item>
      <title>Re: XML file import error</title>
      <link>https://community.qlik.com/t5/Talend-Studio/XML-file-import-error/m-p/2336314#M104774</link>
      <description>&lt;P&gt;It is a terrible idea to have such large XML files. This is for sure a big design flow. The first thing you MUST do is cutting the huge file into smaller pieces! It does not matter with which tool you are trying to process these large file, it is always a nightmare.&lt;/P&gt;&lt;P&gt;I would try to change the generation process to build more than one file (e.g. 100 instead of one) and if this is not possible I would try to use a SAX parser and cut the large file into smaller files without trying to use complex XPath expressions. Cut the file simply by one of the near root tags - the must be a lot of!&lt;/P&gt;&lt;P&gt;You will see the performance is by far better and you have the option to process these smaller files in parallel!&lt;/P&gt;</description>
      <pubDate>Wed, 07 Dec 2022 08:41:06 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/XML-file-import-error/m-p/2336314#M104774</guid>
      <dc:creator>jlolling</dc:creator>
      <dc:date>2022-12-07T08:41:06Z</dc:date>
    </item>
    <item>
      <title>Re: XML file import error</title>
      <link>https://community.qlik.com/t5/Talend-Studio/XML-file-import-error/m-p/2336315#M104775</link>
      <description>&lt;P&gt;Creating XML and reading XML are completely different. I explained how you might go about this in my original post and @Jan Lolling​&amp;nbsp;has essentially said the same. &lt;/P&gt;</description>
      <pubDate>Wed, 07 Dec 2022 08:57:05 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/XML-file-import-error/m-p/2336315#M104775</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-12-07T08:57:05Z</dc:date>
    </item>
  </channel>
</rss>

