<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: 26 billion full load in Qlik Replicate</title>
    <link>https://community.qlik.com/t5/Qlik-Replicate/26-billion-full-load/m-p/2485043#M12873</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/307123"&gt;@sreaney89&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;In the past, we had a feature that allowed us to resume loading from a specific record. However, we found this feature to be impractical. &lt;BR /&gt;&lt;BR /&gt;To resume from a processed record, we need to query the records in order, such as by primary key or unique index.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;If a table contains many records, using "ORDER BY" can place a significant load on the system due to the need to sort the records. Therefore, as&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/121014"&gt;@Dana_Baldwin&lt;/a&gt;&amp;nbsp;mentioned, we avoid using "ORDER BY" for performance reasons.&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Desmond&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 03 Oct 2024 07:21:07 GMT</pubDate>
    <dc:creator>DesmondWOO</dc:creator>
    <dc:date>2024-10-03T07:21:07Z</dc:date>
    <item>
      <title>26 billion full load</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/26-billion-full-load/m-p/2482695#M12792</link>
      <description>&lt;P&gt;Hi all -&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I wondering if i could collate some recommendation for a full load. I appear to be getting a disconnect from the DB issue, that restarts the full load each time. I haven't set the logs to verbose just yet to see whats happening. But what I do know the tables loading are in the 10s of billions for records size.&lt;BR /&gt;&lt;BR /&gt;I am wondering if given that records count there might be something obvious I may need to set to stop this happening. It should be noted its taking about 5 hours per billion records.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;thanks.&lt;/P&gt;</description>
      <pubDate>Fri, 20 Sep 2024 08:57:17 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/26-billion-full-load/m-p/2482695#M12792</guid>
      <dc:creator>sreaney89</dc:creator>
      <dc:date>2024-09-20T08:57:17Z</dc:date>
    </item>
    <item>
      <title>Re: 26 billion full load</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/26-billion-full-load/m-p/2482754#M12794</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/307123"&gt;@sreaney89&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;Thanks for reaching out to Qlik Community!&lt;/P&gt;
&lt;P&gt;There are several issues with the task that need to be addressed:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;
&lt;P&gt;&lt;STRONG&gt;Improper Task Settings&lt;/STRONG&gt;:&lt;BR /&gt;Please disable &lt;EM&gt;Apply Changes Processing&lt;/EM&gt; and keep only &lt;EM&gt;Store Changes Processing&lt;/EM&gt; enabled. This may resolve some configuration-related errors.&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="john_wang_1-1726829368082.png" style="width: 999px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/171969iBA4FF1AF722F969F/image-size/large?v=v2&amp;amp;px=999" role="button" title="john_wang_1-1726829368082.png" alt="john_wang_1-1726829368082.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;P&gt;&lt;STRONG&gt;Error: "WAL reader terminated with broken connection / recoverable error. WAL stream loop ended abnormally"&lt;/STRONG&gt;:&lt;BR /&gt;This error is causing the task to stop and attempt auto-recovery. The root cause is likely network-related—potential issues include an unstable connection, connection timeout, firewall rules closing inactive connections, server settings, or resource constraints on the server. To mitigate this, please enable the &lt;EM&gt;WAL heartbeat&lt;/EM&gt; on the PostgreSQL source endpoint to check if it improves stability.&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="john_wang_2-1726829849415.png" style="width: 999px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/171975i7E8B6C5A097767B2/image-size/large?v=v2&amp;amp;px=999" role="button" title="john_wang_2-1726829849415.png" alt="john_wang_2-1726829849415.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;/LI&gt;
&lt;/OL&gt;
&lt;H3&gt;Impact on Full Load Performance:&lt;/H3&gt;
&lt;P&gt;These errors are negatively affecting the full load performance and may lead to the full load stopping and restarting during recovery.&lt;/P&gt;
&lt;P&gt;Current load time: &lt;STRONG&gt;5 hours per billion records&lt;/STRONG&gt;.&lt;/P&gt;
&lt;P&gt;To improve performance:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Enabling &lt;A title="Parallel Load" href="https://help.qlik.com/en-US/replicate/May2024/Content/Global_Common/Content/SharedEMReplicate/Customize%20Tasks/Parallel_Load.htm#:~:text=Table%20Settings-,Parallel%20Load,be%20segmented%20by%20data%20ranges%2C%20by%20partitions%2C%20or%20by%20sub%2Dpartitions.,-Supported%20endpoints" target="_blank" rel="noopener"&gt;&lt;EM&gt;Parallel Load&lt;/EM&gt;&lt;/A&gt; can significantly reduce full load times. However, if the Replicate server and the source/target databases are not on the same intranet, bandwidth limitations could become a bottleneck.&lt;/LI&gt;
&lt;LI&gt;&lt;A title="PS engaged" href="https://community.qlik.com/t5/Official-Support-Articles/How-and-when-to-contact-Qlik-s-Professional-Services-and/tac-p/2481770#M14579" target="_blank" rel="noopener"&gt;PS engaged&lt;/A&gt; if you need help.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Hope this helps.&lt;/P&gt;
&lt;P&gt;John.&lt;/P&gt;</description>
      <pubDate>Fri, 20 Sep 2024 11:10:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/26-billion-full-load/m-p/2482754#M12794</guid>
      <dc:creator>john_wang</dc:creator>
      <dc:date>2024-09-20T11:10:38Z</dc:date>
    </item>
    <item>
      <title>Re: 26 billion full load</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/26-billion-full-load/m-p/2483257#M12818</link>
      <description>&lt;P&gt;Just another quick question - if I start and stop the full load task does it restart from the beginning and attempt to load the entire table? Something I've noticed as well is that some of the estimates its generating are way off.. in the millions rather than billions of records.&lt;/P&gt;</description>
      <pubDate>Tue, 24 Sep 2024 13:52:25 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/26-billion-full-load/m-p/2483257#M12818</guid>
      <dc:creator>sreaney89</dc:creator>
      <dc:date>2024-09-24T13:52:25Z</dc:date>
    </item>
    <item>
      <title>Re: 26 billion full load</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/26-billion-full-load/m-p/2483260#M12819</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/307123"&gt;@sreaney89&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you are stopping a task where full load is happening then Stop and Start will reinitiate a fresh reload .&lt;/P&gt;
&lt;P&gt;Regards&lt;/P&gt;
&lt;P&gt;Arun&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 24 Sep 2024 13:52:56 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/26-billion-full-load/m-p/2483260#M12819</guid>
      <dc:creator>aarun_arasu</dc:creator>
      <dc:date>2024-09-24T13:52:56Z</dc:date>
    </item>
    <item>
      <title>Re: 26 billion full load</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/26-billion-full-load/m-p/2484952#M12870</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/307123"&gt;@sreaney89&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;To add to&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/165259"&gt;@aarun_arasu&lt;/a&gt;&amp;nbsp;'s post, we run a simple "select *" query against the source rather than a "select * order by" for performance reasons. Due to this, there is no way to track where a full load left off in order to resume. Also, this could leave out records if some were inserted after the task stopped and before it was resumed.&lt;/P&gt;
&lt;P&gt;I hope this helps!&lt;/P&gt;
&lt;P&gt;Dana&lt;/P&gt;</description>
      <pubDate>Wed, 02 Oct 2024 17:21:17 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/26-billion-full-load/m-p/2484952#M12870</guid>
      <dc:creator>Dana_Baldwin</dc:creator>
      <dc:date>2024-10-02T17:21:17Z</dc:date>
    </item>
    <item>
      <title>Re: 26 billion full load</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/26-billion-full-load/m-p/2485043#M12873</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/307123"&gt;@sreaney89&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;In the past, we had a feature that allowed us to resume loading from a specific record. However, we found this feature to be impractical. &lt;BR /&gt;&lt;BR /&gt;To resume from a processed record, we need to query the records in order, such as by primary key or unique index.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;If a table contains many records, using "ORDER BY" can place a significant load on the system due to the need to sort the records. Therefore, as&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/121014"&gt;@Dana_Baldwin&lt;/a&gt;&amp;nbsp;mentioned, we avoid using "ORDER BY" for performance reasons.&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Desmond&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 03 Oct 2024 07:21:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/26-billion-full-load/m-p/2485043#M12873</guid>
      <dc:creator>DesmondWOO</dc:creator>
      <dc:date>2024-10-03T07:21:07Z</dc:date>
    </item>
  </channel>
</rss>

