<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: [resolved] Split a large XML file into small files with talend in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363591#M127453</link>
    <description>Hi  Willm, I have chcnaged my XML parser to SAX in tFileInput , and I incresased the heap size for the job , but I still have the same problem.&lt;BR /&gt;thanks for your precious suggestion</description>
    <pubDate>Mon, 02 Jun 2014 16:12:50 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2014-06-02T16:12:50Z</dc:date>
    <item>
      <title>[resolved] Split a large XML file into small files with talend</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363585#M127447</link>
      <description>Hi, 
&lt;BR /&gt;I am trying to integrate data from a large XML file (300 Mo). Is there a way to do it with talend ?</description>
      <pubDate>Mon, 02 Jun 2014 13:59:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363585#M127447</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-02T13:59:24Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Split a large XML file into small files with talend</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363586#M127448</link>
      <description>What is the problem that you are facing in doing this?&lt;BR /&gt;Vaibhav</description>
      <pubDate>Mon, 02 Jun 2014 14:27:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363586#M127448</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-02T14:27:12Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Split a large XML file into small files with talend</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363587#M127449</link>
      <description>The problem is that I can't load the XML File (300Mo ) to the medatadata XML.
&lt;BR /&gt;Every time I try to do this talend craches</description>
      <pubDate>Mon, 02 Jun 2014 14:42:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363587#M127449</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-02T14:42:13Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Split a large XML file into small files with talend</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363588#M127450</link>
      <description>I have done this using the Perl library TWIG and just used a tSystem to call perl/twig and split the XML.</description>
      <pubDate>Mon, 02 Jun 2014 15:18:49 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363588#M127450</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-02T15:18:49Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Split a large XML file into small files with talend</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363589#M127451</link>
      <description>Jholman , Coud you give me more details about this please</description>
      <pubDate>Mon, 02 Jun 2014 15:34:30 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363589#M127451</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-02T15:34:30Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Split a large XML file into small files with talend</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363590#M127452</link>
      <description>&lt;P&gt;Hi Seiif - Before suggesting alternatives (below), have you changed your XML parser to SAX in tFileInput, increased your heap size for the job and tried it? DOM parser is very memory intensive whereas SAX is not...&lt;BR /&gt;&lt;BR /&gt;Like jholman, I've done this using sed utility in a shell script (.sh) on the filesystem, called from a tSystem. Using sed, I looked for a particular tag (open tag for the XML), and wherever I found it, I extracted the text between.&lt;BR /&gt;Another cruder method I did recently was reading the file as plain text (tFullRow), looking for these markers in the XML, marking them with an increment counter (sequence), and then split the file using tMap. This was for queue data that needed to be processed for each 'row'.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 02 Jun 2014 15:54:52 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363590#M127452</guid>
      <dc:creator>willm1</dc:creator>
      <dc:date>2014-06-02T15:54:52Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Split a large XML file into small files with talend</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363591#M127453</link>
      <description>Hi  Willm, I have chcnaged my XML parser to SAX in tFileInput , and I incresased the heap size for the job , but I still have the same problem.&lt;BR /&gt;thanks for your precious suggestion</description>
      <pubDate>Mon, 02 Jun 2014 16:12:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363591#M127453</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-02T16:12:50Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Split a large XML file into small files with talend</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363592#M127454</link>
      <description>Seiif - Can you do a simple job where you use a tFileInputFullRow to read the XML file and spit out to a tLogRow? If that works - which means your job will run, you can parse it using the 'cruder' Talend-specific solution that I mentioned above. &lt;BR /&gt;Let me know if you can do this...</description>
      <pubDate>Mon, 02 Jun 2014 17:15:45 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363592#M127454</guid>
      <dc:creator>willm1</dc:creator>
      <dc:date>2014-06-02T17:15:45Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Split a large XML file into small files with talend</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363593#M127455</link>
      <description>It works with tFileInputFullRow. I will try the cruder and tell you about the results. Thanks Willm 
&lt;BR /&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MECs.jpg"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/153459i5252D745DFB0BC81/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MECs.jpg" alt="0683p000009MECs.jpg" /&gt;&lt;/span&gt;</description>
      <pubDate>Mon, 02 Jun 2014 17:21:44 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363593#M127455</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-02T17:21:44Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Split a large XML file into small files with talend</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363594#M127456</link>
      <description>Please see the relevant documentation for Twig here : 
&lt;A href="http://search.cpan.org/dist/XML-Twig/tools/xml_split/xml_split" rel="nofollow noopener noreferrer"&gt;http://search.cpan.org/dist/XML-Twig/tools/xml_split/xml_split&lt;/A&gt; 
&lt;BR /&gt;It also provides a mechanism for merging them back together again.</description>
      <pubDate>Mon, 02 Jun 2014 19:08:54 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363594#M127456</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-02T19:08:54Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Split a large XML file into small files with talend</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363595#M127457</link>
      <description>Josh - would the execution server for your job (using Twig) need to have Perl installed?&lt;BR /&gt;Thanks.&lt;BR /&gt;Will</description>
      <pubDate>Mon, 02 Jun 2014 20:14:43 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363595#M127457</guid>
      <dc:creator>willm1</dc:creator>
      <dc:date>2014-06-02T20:14:43Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Split a large XML file into small files with talend</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363596#M127458</link>
      <description>Yes, you would need a Perl install, you can install twig with CPAN. If you are on Windows, ActivePerl should work fine.</description>
      <pubDate>Tue, 03 Jun 2014 15:14:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363596#M127458</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-03T15:14:13Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Split a large XML file into small files with talend</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363597#M127459</link>
      <description>Hi, 
&lt;BR /&gt;I found another solution for the heap space error.The approach is reading the big XML as CSV file with tFileInputDelimited componenet and then passing data to tFileOutputXML or tAdvancedFileOutputXML which split it into small XML files. 
&lt;BR /&gt;The next step is to integrate data in this files in a database. My problem now is to find a way to schedule treatment for all this files applying theorems process management. 
&lt;BR /&gt;Thanks for your help 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACn.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154443iC5B8CACEF3D12C6A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACn.png" alt="0683p000009MACn.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;Seif</description>
      <pubDate>Thu, 05 Jun 2014 16:11:59 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Split-a-large-XML-file-into-small-files-with-talend/m-p/2363597#M127459</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-05T16:11:59Z</dc:date>
    </item>
  </channel>
</rss>

