<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: [JSON files] Parsing and splitting huge json file in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/JSON-files-Parsing-and-splitting-huge-json-file/m-p/2348702#M115783</link>
    <description>&lt;P&gt;Please share the sample code of parsing and splitting the JSON file in java component.@Michal Swiniarski​&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 08 Jun 2022 03:17:49 GMT</pubDate>
    <dc:creator>Anjali3</dc:creator>
    <dc:date>2022-06-08T03:17:49Z</dc:date>
    <item>
      <title>[JSON files] Parsing and splitting huge json file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/JSON-files-Parsing-and-splitting-huge-json-file/m-p/2348700#M115781</link>
      <description>&lt;P&gt;Hi. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Last time I faced a lot of issues regarding parsing and splitting a big json files. Big means let's say array of the 100 000 documents. &lt;/P&gt;&lt;P&gt;The requirement is to split one big file into a lot of small files (one file per one document) plus some not complecated transformations. &lt;/P&gt;&lt;P&gt;When I tried to implement it based on the tFileInputJSON/tFileOutputJSON component it took many hours because iteration link was there (splitted files need to be named based on the key value (id) ). &lt;/P&gt;&lt;P&gt;This is why I changed approach and implemented whole parsing and splitting process based on the java code (tJavaRow component). &lt;/P&gt;&lt;P&gt;I know that this is not elegant but it takes much less time (more less 3 minutes). &lt;/P&gt;&lt;P&gt;Here is the issue - 3 minutes for stakeholders is still too long.&lt;/P&gt;&lt;P&gt;I use the org.json library. &lt;/P&gt;&lt;P&gt;Can anyone propose some different approach or different java library to improve the performace?&lt;/P&gt;&lt;P&gt;I'm not a java developer so I'm afraid that very complicated java libraries can be chalenging for me.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Regards.&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 00:03:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/JSON-files-Parsing-and-splitting-huge-json-file/m-p/2348700#M115781</guid>
      <dc:creator>JackStrong</dc:creator>
      <dc:date>2024-11-16T00:03:12Z</dc:date>
    </item>
    <item>
      <title>Re: [JSON files] Parsing and splitting huge json file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/JSON-files-Parsing-and-splitting-huge-json-file/m-p/2348701#M115782</link>
      <description>&lt;P&gt;Hiya&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hopefully it wont be too complex, but consider using the tJavaFlex component to split out the single big JSON into smaller/more manageable chunks.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.talendbyexample.com/talend-tjavaflex-component-reference.html" alt="https://www.talendbyexample.com/talend-tjavaflex-component-reference.html" target="_blank"&gt;This link&lt;/A&gt; might help you.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;David&lt;/P&gt;</description>
      <pubDate>Tue, 29 Jun 2021 10:19:17 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/JSON-files-Parsing-and-splitting-huge-json-file/m-p/2348701#M115782</guid>
      <dc:creator>David_Beaty</dc:creator>
      <dc:date>2021-06-29T10:19:17Z</dc:date>
    </item>
    <item>
      <title>Re: [JSON files] Parsing and splitting huge json file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/JSON-files-Parsing-and-splitting-huge-json-file/m-p/2348702#M115783</link>
      <description>&lt;P&gt;Please share the sample code of parsing and splitting the JSON file in java component.@Michal Swiniarski​&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 08 Jun 2022 03:17:49 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/JSON-files-Parsing-and-splitting-huge-json-file/m-p/2348702#M115783</guid>
      <dc:creator>Anjali3</dc:creator>
      <dc:date>2022-06-08T03:17:49Z</dc:date>
    </item>
  </channel>
</rss>

