<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic downloading JSON in structured format in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286328#M59950</link>
    <description>&lt;P&gt;Hi All,&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I'm working on web services and I need to download the JSON response using http get requests and then read those JSON files and parse them.&lt;/P&gt; 
&lt;P&gt;This works out well for smaller files , the issue here is the JSON document which I get is downloaded in a single row ! , so I have 200 mb of data downloaded in a single row .&lt;/P&gt; 
&lt;P&gt;This file is then impossible to open using an editor and then if I try to parse it in talend I get out of memory exception.&lt;/P&gt; 
&lt;P&gt;Is there any work around for this issue ?&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Does JSON response from https get request always have to be in Single row ? can we break it down and do some formatting before saving it on disk ?&lt;/P&gt; 
&lt;P&gt;What is the best way to parse a large JSON file size greater than 200 MB ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 06:30:18 GMT</pubDate>
    <dc:creator>karandama2006</dc:creator>
    <dc:date>2024-11-16T06:30:18Z</dc:date>
    <item>
      <title>downloading JSON in structured format</title>
      <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286328#M59950</link>
      <description>&lt;P&gt;Hi All,&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I'm working on web services and I need to download the JSON response using http get requests and then read those JSON files and parse them.&lt;/P&gt; 
&lt;P&gt;This works out well for smaller files , the issue here is the JSON document which I get is downloaded in a single row ! , so I have 200 mb of data downloaded in a single row .&lt;/P&gt; 
&lt;P&gt;This file is then impossible to open using an editor and then if I try to parse it in talend I get out of memory exception.&lt;/P&gt; 
&lt;P&gt;Is there any work around for this issue ?&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Does JSON response from https get request always have to be in Single row ? can we break it down and do some formatting before saving it on disk ?&lt;/P&gt; 
&lt;P&gt;What is the best way to parse a large JSON file size greater than 200 MB ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 06:30:18 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286328#M59950</guid>
      <dc:creator>karandama2006</dc:creator>
      <dc:date>2024-11-16T06:30:18Z</dc:date>
    </item>
    <item>
      <title>Re: downloading JSON in structured format</title>
      <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286329#M59951</link>
      <description>&lt;P&gt;The problem here is NOT the formatting. If anything the lack of formatting will save on space. The problem here is the size of data. You may want to find a text editor which handles large files better than the one you are using. I use UEStudio which would easily handle 200MB.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;To read it in Talend, you may want to play with your job and Studio memory settings.&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;This link will help you increase the memory for Studio (&lt;A href="https://help.talend.com/reader/pd~gJPOP3y0tu8jedjNfIA/aNyC~EW3eVxmjwAUWgA~pw" target="_blank" rel="nofollow noopener noreferrer"&gt;https://help.talend.com/reader/pd~gJPOP3y0tu8jedjNfIA/aNyC~EW3eVxmjwAUWgA~pw&lt;/A&gt;)&lt;/P&gt; 
&lt;P&gt;This link will help with increasing the memory for your job (&lt;A href="https://community.qlik.com/s/article/ka03p0000006EZuAAM" target="_blank"&gt;https://community.talend.com/t5/Migration-Configuration-and/OutOfMemory-Exception/ta-p/21669&lt;/A&gt;)&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 24 Feb 2019 20:44:56 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286329#M59951</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-02-24T20:44:56Z</dc:date>
    </item>
    <item>
      <title>Re: downloading JSON in structured format</title>
      <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286330#M59952</link>
      <description>&lt;P&gt;+1 to all from&amp;nbsp;&lt;A href="https://community.qlik.com/s/profile/005390000069RuGAAU"&gt;@rhall&lt;/A&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;also good to check your API endpoint documentation, many API contain additional settings to reduce response size - split it to chunks&lt;/P&gt; 
&lt;P&gt;example:&lt;/P&gt; 
&lt;UL&gt; 
 &lt;LI&gt;receive all customers - is very huge JSON&lt;/LI&gt; 
 &lt;LI&gt;receive 100 customers, with offset XXX, it is 100 customers, but many iterations&lt;/LI&gt; 
&lt;/UL&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;additional + for UltraEdit, it could edit docs of any size, personally, I use it for 15-20Gb files&lt;/P&gt;</description>
      <pubDate>Mon, 25 Feb 2019 00:56:03 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286330#M59952</guid>
      <dc:creator>vapukov</dc:creator>
      <dc:date>2019-02-25T00:56:03Z</dc:date>
    </item>
    <item>
      <title>Re: downloading JSON in structured format</title>
      <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286331#M59953</link>
      <description>&lt;P&gt;Thanks&amp;nbsp;&lt;A href="https://community.qlik.com/s/profile/005390000069RuGAAU"&gt;@rhall&lt;/A&gt;&amp;nbsp;and&amp;nbsp;&lt;A href="https://community.qlik.com/s/profile/005390000067LRbAAM"&gt;@vapukov&lt;/A&gt;&amp;nbsp;for your valuable suggestion.&lt;/P&gt; 
&lt;P&gt;I tried increasing memory of the talend studio and Job with below configurations&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;PRE&gt;-vm&lt;BR /&gt;C:\Program Files (x86)\Talend-Studio\jre1.8.0_131\bin\server\jvm.dll&lt;BR /&gt;-vmargs&lt;BR /&gt;-Xms1024m&lt;BR /&gt;-Xmx40966m&lt;BR /&gt;-Dfile.encoding=UTF-8&lt;BR /&gt;-Dosgi.requiredJavaVersion=1.8&lt;BR /&gt;-XX:+UseG1GC&lt;BR /&gt;-XX:+UseStringDeduplication&lt;/PRE&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Capture1.PNG" style="width: 826px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2f5.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/155628iA76D6B31D0A5115B/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2f5.png" alt="0683p000009M2f5.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Gave temporary location in tMap as well&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Capture3.PNG" style="width: 530px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2lv.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/153516i494B155C6BDD26E9/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2lv.png" alt="0683p000009M2lv.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;But this did not help , now instead of getting out of memory exception , Talend is in a stuck state for 3 hours.&lt;BR /&gt;I'm working on a system which has 8GB RAM&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;My Job works&amp;nbsp; for other files which are less than or equal to 10mb and share a similar JSON structure to 208 mb file&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Capture2.png" style="width: 953px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2m0.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/140473i4D2B2E2965EB2BF9/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2m0.png" alt="0683p000009M2m0.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;T-MAP is not complicated, it is one-to-one mapping with addition of timestamp.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Is there any other way we can get this to work ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Mon, 25 Feb 2019 12:03:58 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286331#M59953</guid>
      <dc:creator>karandama2006</dc:creator>
      <dc:date>2019-02-25T12:03:58Z</dc:date>
    </item>
    <item>
      <title>Re: downloading JSON in structured format</title>
      <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286332#M59954</link>
      <description>&lt;P&gt;I think I see your problem. You are using XPath instead of JsonPath. What happens when you do this is that the JSON is converted to XML in memory and then the XML is parsed. With a 200MB&amp;nbsp; json file, this will be a massive overhead.&amp;nbsp;If you are working with json, you should try to use JsonPath.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 25 Feb 2019 13:13:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286332#M59954</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-02-25T13:13:14Z</dc:date>
    </item>
    <item>
      <title>Re: downloading JSON in structured format</title>
      <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286333#M59955</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/005390000069RuGAAU"&gt;@rhall&lt;/A&gt;&amp;nbsp; Thanks for your reply&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I had tried using the other options before but I did not get expected output so I switched to Xpath.&lt;BR /&gt;I tried to parse the JSON again with below configuration with a smaller file, this config is similar to the one I used in Xpath&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Capture1.JPG" style="width: 868px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2mA.jpg"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/157195iBB54A1217C376EB3/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2mA.jpg" alt="0683p000009M2mA.jpg" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;But I get an Error and Unexpected output&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;PRE&gt;The Json resource datas maybe have some problems, please make sure the data structure with the same fields.&lt;BR /&gt;[WARN ]: integration_demo.j_08_transfer_transactional_tr_alloc_fy_0_1.j_08_transfer_transactional_tr_alloc_fy - tFileInputJSON_2 - The Json resource datas maybe have some problems, please make sure the data structure with the same fields.&lt;/PRE&gt; 
&lt;P&gt;Output file does not loop the outermost element of JSON , export id in this case .&lt;BR /&gt;In below screenshot we can see it prints exportid only once , but I need it in all Lines&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Capture2.JPG" style="width: 653px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2g8.jpg"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/152836i7C6E645440868CA2/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2g8.jpg" alt="0683p000009M2g8.jpg" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;This is small JSON Snippet&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;PRE&gt;{
   "ExportSetGuid":"a7480ae9-5045-4e33-b6f5-875cd17e1711",
   "Actions":[
      {
         "ActionId":410766,
         "Allocations":[
            {
               "AllocationTargetOrgUnitId":82373,
               "FiscalYears":[
                  {
                     "FiscalYear":2019,
                     "PercentValue":3.0200
                  }
               ]
            }
         ]
      }
   ]
}&lt;/PRE&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 25 Feb 2019 15:46:08 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286333#M59955</guid>
      <dc:creator>karandama2006</dc:creator>
      <dc:date>2019-02-25T15:46:08Z</dc:date>
    </item>
    <item>
      <title>Re: downloading JSON in structured format</title>
      <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286334#M59956</link>
      <description>&lt;P&gt;Why are you using JsonPath without loop? Surely you need the loops?&lt;/P&gt;</description>
      <pubDate>Mon, 25 Feb 2019 19:00:41 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286334#M59956</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-02-25T19:00:41Z</dc:date>
    </item>
    <item>
      <title>Re: downloading JSON in structured format</title>
      <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286335#M59957</link>
      <description>&lt;P&gt;JSON path with loops only returns the data which are inside a&amp;nbsp; list (This is what I've found)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I need data from the list as well as the elements which are outside .&lt;/P&gt;</description>
      <pubDate>Tue, 26 Feb 2019 02:32:26 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286335#M59957</guid>
      <dc:creator>karandama2006</dc:creator>
      <dc:date>2019-02-26T02:32:26Z</dc:date>
    </item>
    <item>
      <title>Re: downloading JSON in structured format</title>
      <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286336#M59958</link>
      <description>&lt;P&gt;Are you saying that your 200MB json file does not have any loops in it? If you are extracting looped data, you will need to use JsonPath and not JsonPath without loops.&lt;/P&gt;</description>
      <pubDate>Tue, 26 Feb 2019 09:52:35 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286336#M59958</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-02-26T09:52:35Z</dc:date>
    </item>
    <item>
      <title>Re: downloading JSON in structured format</title>
      <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286337#M59959</link>
      <description>&lt;P&gt;All my file have loops , but if I use JSON Path and then specify a loop , I am able to retrieve only specific data from a Single Loop.&lt;/P&gt;&lt;P&gt;In my JSON there are many loops and I want to be able to retrieve data from all the loops at once in a single line (Xpath allows me to do that as we can see in earlier screenshot)&amp;nbsp; &amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If below is the Json&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;{
   "ExportSetGuid":"a7480ae9-5045-4e33-b6f5-875cd17e1711",
   "Actions":[
      {
         "ActionId":410766,
         "Allocations":[
            {
               "AllocationTargetOrgUnitId":82373,
               "FiscalYears":[
                  {
                     "FiscalYear":2019,
                     "PercentValue":3.0200
                  },
                  {
                     "FiscalYear":2019,
                     "PercentValue":5.0200
                  }

               ]
            }
         ]
      }&lt;BR /&gt;      { "ActionId":410768, &lt;BR /&gt;         "Allocations":[ &lt;BR /&gt;             { "AllocationTargetOrgUnitId":82373,&lt;BR /&gt;               "FiscalYears":&lt;BR /&gt;                [ &lt;BR /&gt;                { "FiscalYear":2019, "PercentValue":3.0200 },&lt;BR /&gt;                { "FiscalYear":2019, "PercentValue":5.0200 }&lt;BR /&gt;                ]&lt;BR /&gt;             }&lt;BR /&gt;          ]&lt;BR /&gt;       }
   ]
}&lt;/PRE&gt;&lt;P&gt;I need output as below&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;a7480ae9-5045-4e33-b6f5-875cd17e1711;410766;82373;2019;3.0200&lt;BR /&gt;a7480ae9-5045-4e33-b6f5-875cd17e1711;410766;82373;2019;5.0200&lt;/P&gt;&lt;P&gt;a7480ae9-5045-4e33-b6f5-875cd17e1711;410768;82373;2019;3.0200&lt;BR /&gt;a7480ae9-5045-4e33-b6f5-875cd17e1711;410768;82373;2019;5.0200&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Tried it many times , unable achieve above result with Json path or Json path without loop , it works only with Xpath&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 26 Feb 2019 10:12:02 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286337#M59959</guid>
      <dc:creator>karandama2006</dc:creator>
      <dc:date>2019-02-26T10:12:02Z</dc:date>
    </item>
    <item>
      <title>Re: downloading JSON in structured format</title>
      <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286338#M59960</link>
      <description>&lt;P&gt;I've looked at your file and the example you gave was not valid JSON. I suspect that is because of how you transferred it and cut it to here though. Before this code....&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;PRE&gt;{ "ActionId":410768,&lt;/PRE&gt; 
&lt;P&gt;....you needed a comma. A small thing, but if your source does not provide that, it will not work.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Now, I changed the values of your file to make it easier for me to see whether it was working. I changed your file to this....&lt;/P&gt; 
&lt;PRE&gt;{
   "ExportSetGuid":"a7480ae9-5045-4e33-b6f5-875cd17e1711",
   "Actions":[
      {
         "ActionId":410766,
         "Allocations":[
            {
               "AllocationTargetOrgUnitId":82373,
               "FiscalYears":[
                  {
                     "FiscalYear":2017,
                     "PercentValue":1.0300
                  },
                  {
                     "FiscalYear":2018,
                     "PercentValue":2.0300
                  }

               ]
            }
         ]
      },
      { "ActionId":410768, 
         "Allocations":[ 
             { "AllocationTargetOrgUnitId":82377,
               "FiscalYears":
                [ 
                { "FiscalYear":2020, "PercentValue":3.0600 },
                { "FiscalYear":2021, "PercentValue":5.0900 }
                ]
             }
          ]
       }
   ]
}
&lt;/PRE&gt; 
&lt;P&gt;I then built a job like this.....&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screenshot 2019-02-26 at 11.32.04.png" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2dJ.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/150089i7041C7778006C473/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2dJ.png" alt="0683p000009M2dJ.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Ignore the deactivated tLogRows. I added those to see what I was getting after each component. This is a really good way of checking you are on the right track. The following screenshots show each of the components from left to right in order (ignoring the tLogRows....&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screenshot 2019-02-26 at 11.32.21.png" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2nX.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/155661i5CB57562CBE4748F/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2nX.png" alt="0683p000009M2nX.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;First I extract the outermost loop information. The Actions data in the above screen shot is essentially pulling out a json snippet from the bigger json file and sending it forward to be processed.&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screenshot 2019-02-26 at 11.32.31.png" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2nc.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/130604iB7ECFFD83E15EEF3/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2nc.png" alt="0683p000009M2nc.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;The above tExtractJsonFields component is passing through the ExportSetGuid value (hence it is left blank in the Json query) and is extracting the ActionId value and the Allocations array json snippet. Notice the Json Field value is set to Actions. Also notice the loop. It is imply looping over the array.&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screenshot 2019-02-26 at 11.32.41.png" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2ft.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/129794iB1D1FE1650566762/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2ft.png" alt="0683p000009M2ft.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;The above tExtractJsonFields component is passing through the ExportSetGuid and ActionId values and is extracting the AllocationTargetOrgUnitId value and the FiscalYears array json snippet. Notice the Json Field value is set to Allocations. Also notice the loop. It is imply looping over the array.&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screenshot 2019-02-26 at 11.32.51.png" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2Vy.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/130364iC268DC89B563985C/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2Vy.png" alt="0683p000009M2Vy.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;The above tExtractJsonFields component is passing through the ExportSetGuid, ActionId and AllocationTargetOrgUnitId values and is extracting the FiscalYear and PercentValue values. This is the lowest leaf on the tree. Notice the Json Field value is set to FiscalYears. Also notice the loop. It is imply looping over the array.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;The output from this was.....&lt;/P&gt; 
&lt;PRE&gt;Starting job ExampleFilteringtXML at 11:28 26/02/2019.

[statistics] connecting to socket on port 3677
[statistics] connected
a7480ae9-5045-4e33-b6f5-875cd17e1711|410766|82373|2017|1.03
a7480ae9-5045-4e33-b6f5-875cd17e1711|410766|82373|2018|2.03
a7480ae9-5045-4e33-b6f5-875cd17e1711|410768|82377|2020|3.06
a7480ae9-5045-4e33-b6f5-875cd17e1711|410768|82377|2021|5.09
[statistics] disconnected

Job ExampleFilteringtXML ended at 11:28 26/02/2019. [exit code=0]&lt;/PRE&gt; 
&lt;P&gt;This is not necessarily a quick and easy way to achieve this, but it is a methodical way to work.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 26 Feb 2019 11:42:48 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286338#M59960</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-02-26T11:42:48Z</dc:date>
    </item>
    <item>
      <title>Re: downloading JSON in structured format</title>
      <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286339#M59961</link>
      <description>&lt;P&gt;Thank you very much&amp;nbsp;&lt;A href="https://community.qlik.com/s/profile/005390000069RuGAAU"&gt;@rhall&lt;/A&gt;&amp;nbsp; for such detailed explanation.&lt;/P&gt; 
&lt;P&gt;The approach you suggested is considerably fast for parsing JSON than the Xpath I was using .&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;It also solved another issue of invalid XML unicode character which I was getting in some of the files due to the Xpath approach&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;For the 208 mb file at first I got the heap-space issue&amp;nbsp;(That was immediate didn't have to wait 4 hours to get this error) .&lt;/P&gt; 
&lt;P&gt;I increased the memory of the job to 12 GB (Ran it on a system which actually had more RAM) and then I got the GC overhead issue , found a way to fix that in some other Talend community post which is to add&amp;nbsp; JVM parameter to disable GC overhead limit , and after that everything worked &lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACn.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154443iC5B8CACEF3D12C6A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACn.png" alt="0683p000009MACn.png" /&gt;&lt;/span&gt;&lt;/P&gt; 
&lt;P&gt;File was processed within 2 minutes !!&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Capture3.PNG" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2bI.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/139976iDEFC8C51004CEDBB/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2bI.png" alt="0683p000009M2bI.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks again , really appreciate your help .&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Can't imagine how much time I invested to parse this JSON the wrong way , I think this approach can be&amp;nbsp; added as a &lt;STRONG&gt;Scenario&lt;/STRONG&gt; under tfilejsoninput&amp;nbsp; in help.talend.com (This is the first page you land on when you google Talend + JSON)&lt;/P&gt; 
&lt;P&gt;The existing one's there are too basic (If this is not already present somewhere else , as I couldn't find it)&lt;/P&gt;</description>
      <pubDate>Wed, 27 Feb 2019 04:51:30 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286339#M59961</guid>
      <dc:creator>karandama2006</dc:creator>
      <dc:date>2019-02-27T04:51:30Z</dc:date>
    </item>
    <item>
      <title>Re: downloading JSON in structured format</title>
      <link>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286340#M59962</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;A href="https://community.qlik.com/s/profile/0053p000007LPDjAAO"&gt;@karandama2006&lt;/A&gt;, I am pleased that this worked for you. I will see if we can get this post reworded and promoted to an article.&lt;/P&gt;</description>
      <pubDate>Thu, 28 Feb 2019 10:01:56 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/downloading-JSON-in-structured-format/m-p/2286340#M59962</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-02-28T10:01:56Z</dc:date>
    </item>
  </channel>
</rss>

