<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Change Data Capturing PostgreSQL and Synchronizing in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Change-Data-Capturing-PostgreSQL-and-Synchronizing/m-p/2212135#M9877</link>
    <description>&lt;P&gt;Hello guys,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;my scenario is to have multiple databases with (maybe) different schema and all of the data should be synchronized to one main-database without any big latencies. I would not call it a multi-master-database-architecture since only specific tables are mutable by specific applications in their &lt;EM&gt;master-database&lt;/EM&gt; but it is something into that direction. Now I have some general questions regarding the CDC capabilities of Talend and hope you can give me some answers.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;OL&gt; 
 &lt;LI&gt;Is it true that Talend cannot subscribe to the WAL of PostgreSQL and would instead use the trigger mechanism? I read that in the documentation and kind of hope that the documentation is outdated. 
  &lt;OL&gt; 
   &lt;LI&gt;&lt;A href="https://help.talend.com/reader/O~7WPF1NkGXETRxkSxA~uw/Cb8aTDjcDucIMPHrDFMUHQ" target="_blank" rel="noopener nofollow noopener noreferrer"&gt;https://help.talend.com/reader/O~7WPF1NkGXETRxkSxA~uw/Cb8aTDjcDucIMPHrDFMUHQ&lt;/A&gt;&lt;/LI&gt; 
  &lt;/OL&gt; &lt;/LI&gt; 
 &lt;LI&gt;The documentation states the following sentence: „When setting up a CDC environment, make sure that the database connection for CDC is on the same server with the source data to which changes are to be captured.“ 
  &lt;OL&gt; 
   &lt;LI&gt;What does "database connection" mean in this context? Can I or can I not host Talend and PostgreSQL on different physical machines and even locations? 
    &lt;OL&gt; 
     &lt;LI&gt;If they can be hosted completely separately: what happens if Talend looses the connection to the source-database? The triggers will fire and no one is there to capture and the data-change-event would be lost?&lt;/LI&gt; 
     &lt;LI&gt;If Talend can make use of the WAL this should not be a problem, since the WAL would wait for the Talend-Listener to reconnect, am I correct with this assumption?&lt;/LI&gt; 
    &lt;/OL&gt; &lt;/LI&gt; 
   &lt;LI&gt;What happens if Talend looses the connection to the &lt;STRONG&gt;target&lt;/STRONG&gt;-database? Will Talend enqueue all changes and wait for a reconnection?&lt;/LI&gt; 
  &lt;/OL&gt; &lt;/LI&gt; 
 &lt;LI&gt;Can we horizontally scale Talend so every source-database has its own Talend-Instance with the specific CDC mechanism enabled? 
  &lt;OL&gt; 
   &lt;LI&gt;If that is possible I don't see a problem regarding point 2.&lt;/LI&gt; 
  &lt;/OL&gt; &lt;/LI&gt; 
&lt;/OL&gt; 
&lt;P&gt;I hope someone can answer my questions and I did not post this in the wrong section.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks in advance and have a nice day &lt;span class="lia-unicode-emoji" title=":grinning_face_with_big_eyes:"&gt;😃&lt;/span&gt;&lt;/P&gt; 
&lt;P&gt;Malachi&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 23 Jan 2019 09:55:51 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2019-01-23T09:55:51Z</dc:date>
    <item>
      <title>Change Data Capturing PostgreSQL and Synchronizing</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Change-Data-Capturing-PostgreSQL-and-Synchronizing/m-p/2212135#M9877</link>
      <description>&lt;P&gt;Hello guys,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;my scenario is to have multiple databases with (maybe) different schema and all of the data should be synchronized to one main-database without any big latencies. I would not call it a multi-master-database-architecture since only specific tables are mutable by specific applications in their &lt;EM&gt;master-database&lt;/EM&gt; but it is something into that direction. Now I have some general questions regarding the CDC capabilities of Talend and hope you can give me some answers.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;OL&gt; 
 &lt;LI&gt;Is it true that Talend cannot subscribe to the WAL of PostgreSQL and would instead use the trigger mechanism? I read that in the documentation and kind of hope that the documentation is outdated. 
  &lt;OL&gt; 
   &lt;LI&gt;&lt;A href="https://help.talend.com/reader/O~7WPF1NkGXETRxkSxA~uw/Cb8aTDjcDucIMPHrDFMUHQ" target="_blank" rel="noopener nofollow noopener noreferrer"&gt;https://help.talend.com/reader/O~7WPF1NkGXETRxkSxA~uw/Cb8aTDjcDucIMPHrDFMUHQ&lt;/A&gt;&lt;/LI&gt; 
  &lt;/OL&gt; &lt;/LI&gt; 
 &lt;LI&gt;The documentation states the following sentence: „When setting up a CDC environment, make sure that the database connection for CDC is on the same server with the source data to which changes are to be captured.“ 
  &lt;OL&gt; 
   &lt;LI&gt;What does "database connection" mean in this context? Can I or can I not host Talend and PostgreSQL on different physical machines and even locations? 
    &lt;OL&gt; 
     &lt;LI&gt;If they can be hosted completely separately: what happens if Talend looses the connection to the source-database? The triggers will fire and no one is there to capture and the data-change-event would be lost?&lt;/LI&gt; 
     &lt;LI&gt;If Talend can make use of the WAL this should not be a problem, since the WAL would wait for the Talend-Listener to reconnect, am I correct with this assumption?&lt;/LI&gt; 
    &lt;/OL&gt; &lt;/LI&gt; 
   &lt;LI&gt;What happens if Talend looses the connection to the &lt;STRONG&gt;target&lt;/STRONG&gt;-database? Will Talend enqueue all changes and wait for a reconnection?&lt;/LI&gt; 
  &lt;/OL&gt; &lt;/LI&gt; 
 &lt;LI&gt;Can we horizontally scale Talend so every source-database has its own Talend-Instance with the specific CDC mechanism enabled? 
  &lt;OL&gt; 
   &lt;LI&gt;If that is possible I don't see a problem regarding point 2.&lt;/LI&gt; 
  &lt;/OL&gt; &lt;/LI&gt; 
&lt;/OL&gt; 
&lt;P&gt;I hope someone can answer my questions and I did not post this in the wrong section.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks in advance and have a nice day &lt;span class="lia-unicode-emoji" title=":grinning_face_with_big_eyes:"&gt;😃&lt;/span&gt;&lt;/P&gt; 
&lt;P&gt;Malachi&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 23 Jan 2019 09:55:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Change-Data-Capturing-PostgreSQL-and-Synchronizing/m-p/2212135#M9877</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-01-23T09:55:51Z</dc:date>
    </item>
    <item>
      <title>Re: Change Data Capturing PostgreSQL and Synchronizing</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Change-Data-Capturing-PostgreSQL-and-Synchronizing/m-p/2212136#M9878</link>
      <description>&lt;P&gt;1. yes, it true - only trigger based replication&lt;/P&gt;&lt;P&gt;2. followed by 1 - triggers for collect changes must be installed on the source database server&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;2.2 what happens if - nothing, it just a tables where triggers store subscribers and data, so reconnect and continue&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;there are several alternative solutions for send PostgreSQL (and not only) CDC to Kafka, then Talend could be used for parse Kafka&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 25 Jan 2019 02:40:16 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Change-Data-Capturing-PostgreSQL-and-Synchronizing/m-p/2212136#M9878</guid>
      <dc:creator>vapukov</dc:creator>
      <dc:date>2019-01-25T02:40:16Z</dc:date>
    </item>
  </channel>
</rss>

