<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Optimize data loading into Database in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Optimize-data-loading-into-Database/m-p/2287568#M61141</link>
    <description>&lt;P&gt;I would like to optimize my data loading strategy with Talend.  My scenario is as follows.&lt;/P&gt;&lt;P&gt; &lt;/P&gt;&lt;P&gt; I am doing extraction and Transformation of data using Talend and generating files with suffix as time( yyyy-MM-dd_HH:mm:ss) because my collection and transformation frequency is in range of minutes ( 5, 10, 20, 30) for different flows of data. Currently i am having same frequency of data loading as it is for data extraction and transformation which is generating small size files and DB remains loaded. So i prefer to do loading every 3 hours or may be different but i am out of ideas how to play with small files generated time based by Extraction &amp;amp; Transformation. &lt;/P&gt;&lt;P&gt;For example &lt;/P&gt;&lt;P&gt;I could use append with intFileoutputDelimited component with suffix as (yyyy-MM-dd-HH) so that one file gets generated every hour but i am not sure how to ensure that  :&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;My Talend loader which should run every 3 hours should only process past all hours files which have been completed and Not process the current hour file.&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;I also thought of renaming the files currently generating or to keep them to another directory but not sure how to do it fetch them on time basis?&lt;/P&gt;&lt;P&gt; &lt;/P&gt;&lt;P&gt; I would appreciate if anyone can help on this.&lt;/P&gt;</description>
    <pubDate>Mon, 24 Aug 2020 18:23:46 GMT</pubDate>
    <dc:creator>MKAPOOR1596038160</dc:creator>
    <dc:date>2020-08-24T18:23:46Z</dc:date>
    <item>
      <title>Optimize data loading into Database</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Optimize-data-loading-into-Database/m-p/2287568#M61141</link>
      <description>&lt;P&gt;I would like to optimize my data loading strategy with Talend.  My scenario is as follows.&lt;/P&gt;&lt;P&gt; &lt;/P&gt;&lt;P&gt; I am doing extraction and Transformation of data using Talend and generating files with suffix as time( yyyy-MM-dd_HH:mm:ss) because my collection and transformation frequency is in range of minutes ( 5, 10, 20, 30) for different flows of data. Currently i am having same frequency of data loading as it is for data extraction and transformation which is generating small size files and DB remains loaded. So i prefer to do loading every 3 hours or may be different but i am out of ideas how to play with small files generated time based by Extraction &amp;amp; Transformation. &lt;/P&gt;&lt;P&gt;For example &lt;/P&gt;&lt;P&gt;I could use append with intFileoutputDelimited component with suffix as (yyyy-MM-dd-HH) so that one file gets generated every hour but i am not sure how to ensure that  :&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;My Talend loader which should run every 3 hours should only process past all hours files which have been completed and Not process the current hour file.&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;I also thought of renaming the files currently generating or to keep them to another directory but not sure how to do it fetch them on time basis?&lt;/P&gt;&lt;P&gt; &lt;/P&gt;&lt;P&gt; I would appreciate if anyone can help on this.&lt;/P&gt;</description>
      <pubDate>Mon, 24 Aug 2020 18:23:46 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Optimize-data-loading-into-Database/m-p/2287568#M61141</guid>
      <dc:creator>MKAPOOR1596038160</dc:creator>
      <dc:date>2020-08-24T18:23:46Z</dc:date>
    </item>
  </channel>
</rss>

