<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Loading data from mysql db to vertica db in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274326#M51038</link>
    <description>If you get data continuously in a MySQL database then the task is a bit difficult to get only the updates. 
&lt;BR /&gt;At first you have to take care the runtime of your job should be short. This can be done by limiting the data what will be processed within one iteration. If you detect the new data with a timestamp - call it last_modified - avoid a design in which you read from the last timestamp until now. Always limit the timestamp range to a reasonable value - say 1hour. 
&lt;BR /&gt;After finishing the time range proceed with the next time range until the time range not ending with now! This way you can be sure you will not miss any new dataset. 
&lt;BR /&gt;Of course you can implement CDC also in the TOS. CDC for MySQL is based on trigger procedures and you can create them for your self. These triggers must write the IDs of the new or updated datasets into a table and you can use this table for a join to get only whats changed.</description>
    <pubDate>Thu, 26 Dec 2013 10:04:44 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2013-12-26T10:04:44Z</dc:date>
    <item>
      <title>Loading data from mysql db to vertica db</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274321#M51033</link>
      <description>Hi &lt;BR /&gt;&lt;BR /&gt;Im trying to load data from mysql db which acts as staging db in data warehouse to vertica(data warehouse) db .&lt;BR /&gt;The flow is as follows:&lt;BR /&gt;tmysqlinput----&amp;gt;main----&amp;gt;tmap----&amp;gt;main----&amp;gt;tverticaoutput&lt;BR /&gt;And fetching only new data from mysql table by using left outer join on tables in both db and i have scheduled the job for each one hour.&lt;BR /&gt;I want suggestions or logic's on how to do the following:&lt;BR /&gt;What if there is no new data available what will be done? &lt;BR /&gt;And what if data is coming continuously even while reading? &lt;BR /&gt;How we check in ftp if the file is completely transferred we should have such check on data&lt;BR /&gt;What if job gets stuck ?</description>
      <pubDate>Thu, 26 Dec 2013 06:51:41 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274321#M51033</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-12-26T06:51:41Z</dc:date>
    </item>
    <item>
      <title>Re: Loading data from mysql db to vertica db</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274322#M51034</link>
      <description>Hi, &lt;BR /&gt;For your requirement, are you looking for &lt;A href="https://help.talend.com/search/all?query=CDC+architectural+overview&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt;TalendHelpCenter:Talend CDC architectural overview&lt;/A&gt;&lt;BR /&gt;&lt;A href="https://www.youtube.com/watch?v=6SRNz83Wyo0&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt;TalendHelpCenter:Change Data Capture CDC is easy with Talend Studio&lt;/A&gt;.&lt;BR /&gt;Best regards&lt;BR /&gt;Sabrina</description>
      <pubDate>Thu, 26 Dec 2013 07:28:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274322#M51034</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-12-26T07:28:38Z</dc:date>
    </item>
    <item>
      <title>Re: Loading data from mysql db to vertica db</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274323#M51035</link>
      <description>Hi &lt;BR /&gt;Im using TOS, how to achieve CDC in this</description>
      <pubDate>Thu, 26 Dec 2013 08:50:56 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274323#M51035</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-12-26T08:50:56Z</dc:date>
    </item>
    <item>
      <title>Re: Loading data from mysql db to vertica db</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274324#M51036</link>
      <description>Hi, 
&lt;BR /&gt;So far, CDC is only available in Talend Enterprise Subscription Version. 
&lt;BR /&gt;For community version, actually you can compare two tables to get updated data. 
&lt;BR /&gt;tmysqlinput(target)----&amp;gt;main----&amp;gt;tmap----&amp;gt;main----&amp;gt;tverticaoutput 
&lt;BR /&gt;tmysqlinput(source)----&amp;gt;loopup----&amp;gt; 
&lt;BR /&gt;Make inner join in tMap and set the "Catch lookup inner join reject " as true. The processed data should be updated data. 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Sabrina</description>
      <pubDate>Thu, 26 Dec 2013 08:59:55 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274324#M51036</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-12-26T08:59:55Z</dc:date>
    </item>
    <item>
      <title>Re: Loading data from mysql db to vertica db</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274325#M51037</link>
      <description>hi &lt;BR /&gt;Okay that is one way and is there any triggering options in TOS if new data arrives in any DB so at that time we can run a job to read those data.</description>
      <pubDate>Thu, 26 Dec 2013 09:26:58 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274325#M51037</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-12-26T09:26:58Z</dc:date>
    </item>
    <item>
      <title>Re: Loading data from mysql db to vertica db</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274326#M51038</link>
      <description>If you get data continuously in a MySQL database then the task is a bit difficult to get only the updates. 
&lt;BR /&gt;At first you have to take care the runtime of your job should be short. This can be done by limiting the data what will be processed within one iteration. If you detect the new data with a timestamp - call it last_modified - avoid a design in which you read from the last timestamp until now. Always limit the timestamp range to a reasonable value - say 1hour. 
&lt;BR /&gt;After finishing the time range proceed with the next time range until the time range not ending with now! This way you can be sure you will not miss any new dataset. 
&lt;BR /&gt;Of course you can implement CDC also in the TOS. CDC for MySQL is based on trigger procedures and you can create them for your self. These triggers must write the IDs of the new or updated datasets into a table and you can use this table for a join to get only whats changed.</description>
      <pubDate>Thu, 26 Dec 2013 10:04:44 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274326#M51038</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-12-26T10:04:44Z</dc:date>
    </item>
    <item>
      <title>Re: Loading data from mysql db to vertica db</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274327#M51039</link>
      <description>hi&lt;BR /&gt;Relating to fetching data from table ,can we make talend job run multiple times on same table to fetch data simultaneously and condition is each job should pick unique data from table and write into file. How can we achieve this in talend tool</description>
      <pubDate>Tue, 31 Dec 2013 06:21:18 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-data-from-mysql-db-to-vertica-db/m-p/2274327#M51039</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-12-31T06:21:18Z</dc:date>
    </item>
  </channel>
</rss>

