<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Talend data integration for File operation in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Talend-data-integration-for-File-operation/m-p/2322407#M92355</link>
    <description>I need to use Talend data integration tool for following scenario. 
&lt;BR /&gt;There will be flat file containing 10 million records, we will take each record apply some business logic and again store in a separate file Data.txt. 
&lt;BR /&gt;Now Data.txt file contains 10 million records. 
&lt;BR /&gt;There is another file newdata.txt containing 10 K records so now we want to check each of this record whether it is in Data.txt or not 
&lt;BR /&gt;Case 1 : if the record exist will update the Data.txt file 
&lt;BR /&gt;Case 2. If record doesn?t exist in Data.txt then will insert the new record in Data.txt file. 
&lt;BR /&gt;I want help in finding how this can be done using Talend data integration 
&lt;BR /&gt;Please give some path way to proceed.</description>
    <pubDate>Wed, 04 Sep 2013 10:01:18 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2013-09-04T10:01:18Z</dc:date>
    <item>
      <title>Talend data integration for File operation</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-data-integration-for-File-operation/m-p/2322407#M92355</link>
      <description>I need to use Talend data integration tool for following scenario. 
&lt;BR /&gt;There will be flat file containing 10 million records, we will take each record apply some business logic and again store in a separate file Data.txt. 
&lt;BR /&gt;Now Data.txt file contains 10 million records. 
&lt;BR /&gt;There is another file newdata.txt containing 10 K records so now we want to check each of this record whether it is in Data.txt or not 
&lt;BR /&gt;Case 1 : if the record exist will update the Data.txt file 
&lt;BR /&gt;Case 2. If record doesn?t exist in Data.txt then will insert the new record in Data.txt file. 
&lt;BR /&gt;I want help in finding how this can be done using Talend data integration 
&lt;BR /&gt;Please give some path way to proceed.</description>
      <pubDate>Wed, 04 Sep 2013 10:01:18 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-data-integration-for-File-operation/m-p/2322407#M92355</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-09-04T10:01:18Z</dc:date>
    </item>
    <item>
      <title>Re: Talend data integration for File operation</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-data-integration-for-File-operation/m-p/2322408#M92356</link>
      <description>The components you'll be wanting to look at are:-&lt;BR /&gt;tFileInputDelimited (Assuming your files are delimited; but there are other variants).&lt;BR /&gt;tFileOutputDelimited&lt;BR /&gt;tMap to join your data and perform your transformations</description>
      <pubDate>Wed, 04 Sep 2013 14:16:18 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-data-integration-for-File-operation/m-p/2322408#M92356</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-09-04T14:16:18Z</dc:date>
    </item>
    <item>
      <title>Re: Talend data integration for File operation</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-data-integration-for-File-operation/m-p/2322409#M92357</link>
      <description>Yes i know i can use these components but the problem is as my look file will be of large size so it will completely loaded in the memory when i'll use tmap component and consume lot of resources so is there any other way out for doing this ??</description>
      <pubDate>Fri, 06 Sep 2013 12:50:16 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-data-integration-for-File-operation/m-p/2322409#M92357</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-09-06T12:50:16Z</dc:date>
    </item>
    <item>
      <title>Re: Talend data integration for File operation</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-data-integration-for-File-operation/m-p/2322410#M92358</link>
      <description>&lt;BLOCKQUOTE&gt;
 &lt;TABLE border="1"&gt;
  &lt;TBODY&gt;
   &lt;TR&gt;
    &lt;TD&gt;Yes i know i can use these components but the problem is as my look file will be of large size so it will completely loaded in the memory when i'll use tmap component and consume lot of resources so is there any other way out for doing this ??&lt;/TD&gt;
   &lt;/TR&gt;
  &lt;/TBODY&gt;
 &lt;/TABLE&gt;
&lt;/BLOCKQUOTE&gt;
&lt;BR /&gt;Ok. But that's not what you asked.
&lt;BR /&gt;If you've got enough memory, then increase heap and do just that.
&lt;BR /&gt; tMap map has join options so that you can "Store temp data" to disk. It will be slower but will conserve memory.</description>
      <pubDate>Fri, 06 Sep 2013 14:31:22 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-data-integration-for-File-operation/m-p/2322410#M92358</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-09-06T14:31:22Z</dc:date>
    </item>
    <item>
      <title>Re: Talend data integration for File operation</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-data-integration-for-File-operation/m-p/2322411#M92359</link>
      <description>hi all,
&lt;BR /&gt;i think using text file it's the most optimistic way to search into data, 'couse there is no indexes.
&lt;BR /&gt;how about table ?
&lt;BR /&gt;regards
&lt;BR /&gt;laurent</description>
      <pubDate>Fri, 06 Sep 2013 15:10:35 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-data-integration-for-File-operation/m-p/2322411#M92359</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-09-06T15:10:35Z</dc:date>
    </item>
    <item>
      <title>Re: Talend data integration for File operation</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-data-integration-for-File-operation/m-p/2322412#M92360</link>
      <description>&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;hi all,&lt;BR /&gt;i think using text file it's the most optimistic way to search into data, 'couse there is no indexes.&lt;BR /&gt;how about table ?&lt;BR /&gt;regards&lt;BR /&gt;laurent&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;I think it depends on the use case. 
&lt;BR /&gt;If you have two text files and you want to join them once, then I think it is perfectly acceptable, although in the case of Talend, it can be memory hungry. 
&lt;BR /&gt;There may be no indexes; but there also isn't all of the other overheads of an RDBMS. 
&lt;BR /&gt;In this particular case, I can see that performing a look-up against 10M rows, requires some thought. 
&lt;BR /&gt;Without knowing the source of these files, how often they change and how often the Job runs, it's difficult to recommend loading them in to a database.</description>
      <pubDate>Sat, 07 Sep 2013 09:35:41 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-data-integration-for-File-operation/m-p/2322412#M92360</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-09-07T09:35:41Z</dc:date>
    </item>
  </channel>
</rss>

