<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Merging files from tfilelist with different set schema (generate a new set of file) in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Merging-files-from-tfilelist-with-different-set-schema-generate/m-p/2349901#M116818</link>
    <description>&lt;P&gt;Hi @Vinoth Kumar K​&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Assuming you have a key column in all those files that could be used for joining them, you can map their schemas to a set of tFileInput* components, connect them to a tMap and configure the join there.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Then you can use a set of tFileList components, configure its file name pattern according to each schema and connect them to the corresponding tFileInput* using the Iterate connection.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;By doing this, each lookup input will load into memory all the files matching the name pattern before the main input runs, so when the main input starts it will actually lookup over all the lookup files, not only one of the matching files.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, please keep in mind that this approach might consume a high amount of memory depending on the size and number of files listed on each lookup. One way to optimize this a little bit is keeping the file with larger schema on the main input. Another way is to break this process in smaller ones.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Another thing to consider is that your job design might get poluted if there are many lookups. To avoid this you might consider listing and loading the lookup files to memory first using tHash components.&lt;/P&gt;</description>
    <pubDate>Mon, 19 Jun 2023 01:31:01 GMT</pubDate>
    <dc:creator>anselmopeixoto</dc:creator>
    <dc:date>2023-06-19T01:31:01Z</dc:date>
    <item>
      <title>Merging files from tfilelist with different set schema (generate a new set of file)</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Merging-files-from-tfilelist-with-different-set-schema-generate/m-p/2349900#M116817</link>
      <description>&lt;P&gt;Hi Team , &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have a folder with 150 files , it is a mix of different schemas , 2 files will have one schema and another 8 files will have one schema ,and another 1 file will have one schema. to be very clear i have attached a screenshot&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0695b00000huycOAAQ.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/134610iD524B83E7EE52486/image-size/large?v=v2&amp;amp;px=999" role="button" title="0695b00000huycOAAQ.png" alt="0695b00000huycOAAQ.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;i need to merge TalendSHPR10WarningsHitosvsSuelosSemanal_0 and &lt;/P&gt;&lt;P&gt;TalendSHPR10WarningsHitosvsSuelosSemanal_1 and finally generate a file TalendSHPR10WarningsHitosvsSuelosSemanal.csv &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;similiar &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;i need to merge&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TalendOREPRO25BalanceSPCarteras_0&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TalendOREPRO25BalanceSPCarteras_1&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TalendOREPRO25BalanceSPCarteras_2&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TalendOREPRO25BalanceSPCarteras_3&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TalendOREPRO25BalanceSPCarteras_4&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TalendOREPRO25BalanceSPCarteras_5&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TalendOREPRO25BalanceSPCarteras_6&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TalendOREPRO25BalanceSPCarteras_7&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;﻿TalendOREPRO25BalanceSPCarteras_8&lt;/P&gt;&lt;P&gt;and i need to generate a file &lt;/P&gt;&lt;P&gt;﻿i need to merge TalendOREPRO25BalanceSPCarteras.csv&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;and _0 are invidivual file not to be merge , it has to e generate has "filename".csv&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I'm planning to use tfileList , how can i achieve it  &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks in advance &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Vinoth Kumar K.&lt;/P&gt;</description>
      <pubDate>Sat, 17 Jun 2023 12:35:36 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Merging-files-from-tfilelist-with-different-set-schema-generate/m-p/2349900#M116817</guid>
      <dc:creator>kvinoth19991</dc:creator>
      <dc:date>2023-06-17T12:35:36Z</dc:date>
    </item>
    <item>
      <title>Re: Merging files from tfilelist with different set schema (generate a new set of file)</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Merging-files-from-tfilelist-with-different-set-schema-generate/m-p/2349901#M116818</link>
      <description>&lt;P&gt;Hi @Vinoth Kumar K​&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Assuming you have a key column in all those files that could be used for joining them, you can map their schemas to a set of tFileInput* components, connect them to a tMap and configure the join there.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Then you can use a set of tFileList components, configure its file name pattern according to each schema and connect them to the corresponding tFileInput* using the Iterate connection.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;By doing this, each lookup input will load into memory all the files matching the name pattern before the main input runs, so when the main input starts it will actually lookup over all the lookup files, not only one of the matching files.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, please keep in mind that this approach might consume a high amount of memory depending on the size and number of files listed on each lookup. One way to optimize this a little bit is keeping the file with larger schema on the main input. Another way is to break this process in smaller ones.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Another thing to consider is that your job design might get poluted if there are many lookups. To avoid this you might consider listing and loading the lookup files to memory first using tHash components.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Jun 2023 01:31:01 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Merging-files-from-tfilelist-with-different-set-schema-generate/m-p/2349901#M116818</guid>
      <dc:creator>anselmopeixoto</dc:creator>
      <dc:date>2023-06-19T01:31:01Z</dc:date>
    </item>
  </channel>
</rss>

