<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Databricks cluster resizing causes 503 error in Replicate in Qlik Replicate</title>
    <link>https://community.qlik.com/t5/Qlik-Replicate/Databricks-cluster-resizing-causes-503-error-in-Replicate/m-p/2454993#M11171</link>
    <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/144275"&gt;@NakulanR&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;Thanks for contacting Qlik community forum.&lt;BR /&gt;&lt;BR /&gt;Based on the provided error message "A 503 response was returned but no Retry-After header was provided" means that the target server was temporarily unavailable. This could be due to a number of reasons, such as the server being overloaded or under maintenance.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;Can you validate that the there is no connection related issues to your Databricks? Like, uploading csv from another server.&lt;/P&gt;
&lt;P&gt;Can you try pinging the databricks server from the Replicate server and see if anything gets to it?&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;Here is the explanation for the error. This were returned by the Databricks cluster, This needs to be verified by the Databricks team.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://community.databricks.com/t5/data-engineering/how-to-fix-intermittent-503-errors-in-10-4-lts/td-p/29667" target="_blank"&gt;https://community.databricks.com/t5/data-engineering/how-to-fix-intermittent-503-errors-in-10-4-lts/td-p/29667&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;/P&gt;
&lt;P&gt;Sachin B&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 22 May 2024 06:01:49 GMT</pubDate>
    <dc:creator>SachinB</dc:creator>
    <dc:date>2024-05-22T06:01:49Z</dc:date>
    <item>
      <title>Databricks cluster resizing causes 503 error in Replicate</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/Databricks-cluster-resizing-causes-503-error-in-Replicate/m-p/2454968#M11168</link>
      <description>&lt;P&gt;Hi Support,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We are seeing an issue where Replicate reports an error with the following message when writing to Databricks (Delta): "RetCode: SQL_ERROR SqlState: 08S01 NativeError: 124 Message: [Simba][Hardy] (124) A 503 response was returned but no Retry-After header was provided. Original error: Unknown".&lt;/P&gt;
&lt;P&gt;The timestamps of these errors match up to timestamps on the Databricks side when the Databricks cluster was being resized as a result of auto-scaling. However, the Databricks (Delta) endpoint limitations don't have any mention of auto-scaling/cluster resizing being unsupported.&lt;/P&gt;
&lt;P&gt;Is this a known issue when using the Databricks (Delta) endpoint with auto-scaling enabled? If so, is there a workaround that can be implemented in Replicate to prevent the error occurring when the Databricks cluster is being resized?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Nak&lt;/P&gt;</description>
      <pubDate>Wed, 22 May 2024 04:33:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/Databricks-cluster-resizing-causes-503-error-in-Replicate/m-p/2454968#M11168</guid>
      <dc:creator>NakulanR</dc:creator>
      <dc:date>2024-05-22T04:33:07Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks cluster resizing causes 503 error in Replicate</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/Databricks-cluster-resizing-causes-503-error-in-Replicate/m-p/2454985#M11170</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/144275"&gt;@NakulanR&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hope below link may help .&lt;/P&gt;
&lt;P&gt;&lt;A href="https://docs.databricks.com/api/workspace/clusters/resize" target="_blank"&gt;https://docs.databricks.com/api/workspace/clusters/resize&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;BR /&gt;Sushil Kumar&lt;/P&gt;</description>
      <pubDate>Wed, 22 May 2024 05:11:23 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/Databricks-cluster-resizing-causes-503-error-in-Replicate/m-p/2454985#M11170</guid>
      <dc:creator>SushilKumar</dc:creator>
      <dc:date>2024-05-22T05:11:23Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks cluster resizing causes 503 error in Replicate</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/Databricks-cluster-resizing-causes-503-error-in-Replicate/m-p/2454993#M11171</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/144275"&gt;@NakulanR&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;Thanks for contacting Qlik community forum.&lt;BR /&gt;&lt;BR /&gt;Based on the provided error message "A 503 response was returned but no Retry-After header was provided" means that the target server was temporarily unavailable. This could be due to a number of reasons, such as the server being overloaded or under maintenance.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;Can you validate that the there is no connection related issues to your Databricks? Like, uploading csv from another server.&lt;/P&gt;
&lt;P&gt;Can you try pinging the databricks server from the Replicate server and see if anything gets to it?&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;Here is the explanation for the error. This were returned by the Databricks cluster, This needs to be verified by the Databricks team.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://community.databricks.com/t5/data-engineering/how-to-fix-intermittent-503-errors-in-10-4-lts/td-p/29667" target="_blank"&gt;https://community.databricks.com/t5/data-engineering/how-to-fix-intermittent-503-errors-in-10-4-lts/td-p/29667&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;/P&gt;
&lt;P&gt;Sachin B&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 22 May 2024 06:01:49 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/Databricks-cluster-resizing-causes-503-error-in-Replicate/m-p/2454993#M11171</guid>
      <dc:creator>SachinB</dc:creator>
      <dc:date>2024-05-22T06:01:49Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks cluster resizing causes 503 error in Replicate</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/Databricks-cluster-resizing-causes-503-error-in-Replicate/m-p/2455008#M11173</link>
      <description>&lt;P&gt;Hi Sachin,&lt;/P&gt;
&lt;P&gt;The error appears and as a result the endpoint gets disconnected. A few minutes later the endpoint gets reconnected on its own, and the task is back up and running. We are able to determine that this occurs when the auto-scaling resizes the cluster. Testing the connection normally to Databricks yields a successful test connection.&lt;/P&gt;
&lt;P&gt;If this is occurring as a result of some sort of timeout disconnect on Databricks whilst the auto-scaling is happening, would using the loadTimeout or executeTimeout internal parameters be of any use? Or is there a Databricks specific internal parameter that can be used?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Nak&lt;/P&gt;</description>
      <pubDate>Wed, 22 May 2024 06:34:31 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/Databricks-cluster-resizing-causes-503-error-in-Replicate/m-p/2455008#M11173</guid>
      <dc:creator>NakulanR</dc:creator>
      <dc:date>2024-05-22T06:34:31Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks cluster resizing causes 503 error in Replicate</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/Databricks-cluster-resizing-causes-503-error-in-Replicate/m-p/2455015#M11174</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/144275"&gt;@NakulanR&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;If the connection issues in Databricks are due to auto-scaling, you can increase the wait period for executions by setting the internal parameters &lt;FONT face="Menlo, Monaco, Consolas, Courier New, monospace" color="#c7254e"&gt;&lt;SPAN&gt;loadTimeout&lt;/SPAN&gt;&lt;/FONT&gt;, &lt;CODE&gt;executeTimeout/&lt;/CODE&gt;&lt;CODE&gt;CDCTimeout&lt;/CODE&gt; to 10 times their current values. This adjustment helps prevent timeouts during scaling operations.&lt;BR /&gt;&lt;BR /&gt;Hope this helps.&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Sachin B&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 22 May 2024 07:02:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/Databricks-cluster-resizing-causes-503-error-in-Replicate/m-p/2455015#M11174</guid>
      <dc:creator>SachinB</dc:creator>
      <dc:date>2024-05-22T07:02:00Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks cluster resizing causes 503 error in Replicate</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/Databricks-cluster-resizing-causes-503-error-in-Replicate/m-p/2504498#M13655</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/141158"&gt;@SachinB&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We're still seeing the same 503 error when using Databricks as a target. The error occurs and then Replicate recovers by itself a few minutes later. This is happening during the full load so the full load needs to be started from the beginning each time.&lt;/P&gt;
&lt;P&gt;On the Databricks end the compute loses a node due to spot instance termination, however this disconnect is temporary. The loadTimeout and executeTimeout values have been increased to 10x their original value as suggested.&lt;/P&gt;
&lt;P&gt;Is there another parameter that can be used to ensure that the full load isn't terminated and doesn't need to be started again?&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;Regards,&lt;/P&gt;
&lt;P&gt;Nak&lt;/P&gt;</description>
      <pubDate>Wed, 05 Feb 2025 01:20:19 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/Databricks-cluster-resizing-causes-503-error-in-Replicate/m-p/2504498#M13655</guid>
      <dc:creator>NakulanR</dc:creator>
      <dc:date>2025-02-05T01:20:19Z</dc:date>
    </item>
  </channel>
</rss>

