<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Best possible ways to read Delta format Files and insert into a synapse table in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Best-possible-ways-to-read-Delta-format-Files-and-insert-into-a/m-p/2344590#M112227</link>
    <description>&lt;P&gt;Hello there,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My objective is to read data from Azure Delta (tables/files) and INSERT the date into a Azure synapse table through Talend (using Dynamic Schema). I'm trying to explore all the possible options available.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've tried multiple options but had different issues.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;a. Tried to read data using DeltaLake component (jdbc) and INSERT data using synapseOutput component.&lt;/P&gt;&lt;P&gt; synapseOutput component doesn't support dynamic datatype and if I select standard schema, the INSERTS are really slow (5 rows/sec)&lt;/P&gt;&lt;P&gt; Tried to use synapseBulk, which also doesn't support dynamic.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;b. Tried to read data using Deltalake component (jdbc) and INSERT data using tDBOutput (Synapse via jdbc)&lt;/P&gt;&lt;P&gt; tDBOutput &lt;/P&gt;&lt;P&gt;(Synapse via jdbc)&lt;/P&gt;&lt;P&gt; component INSERT speeds are really slow (same a synapse component - 5 rows/sec)&lt;/P&gt;&lt;P&gt; Tried using BULK components, they are failing with weird errors!! (for which I have raise a different topic)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;c. Current temporary working approach ; Reading &lt;/P&gt;&lt;P&gt;data using Deltalake component (jdbc) and write a csv file on gen2 storage &lt;/P&gt;&lt;P&gt; account, using COPY INTO read that csv back and INSERT into synapse table.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Though option C is working, it isn't viable as option A and B.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would like to know if there any other options available, which I may have overlooked.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Tickets I've raised :&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.talend.com/s/question/0D55b000065cA4TCAU/issue-with-tazuresynapseoutput-and-tdboutput" alt="https://community.talend.com/s/question/0D55b000065cA4TCAU/issue-with-tazuresynapseoutput-and-tdboutput" target="_blank"&gt;Issue with tAzureSynapseOutput and tDBOutput (talend.com)&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.talend.com/s/question/0D55b000065cAWXCA2/issue-with-tdboutputbulk-tdbbulkexec-tdboutputbulkexec" alt="https://community.talend.com/s/question/0D55b000065cAWXCA2/issue-with-tdboutputbulk-tdbbulkexec-tdboutputbulkexec" target="_blank"&gt;Issue with tDBOutputBulk, tDBBulkExec &amp;amp; tDBOutputBulkExec (talend.com)&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kind Regards&lt;/P&gt;&lt;P&gt; &lt;/P&gt;&lt;P&gt; &lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 00:07:42 GMT</pubDate>
    <dc:creator>vbadri</dc:creator>
    <dc:date>2024-11-16T00:07:42Z</dc:date>
    <item>
      <title>Best possible ways to read Delta format Files and insert into a synapse table</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Best-possible-ways-to-read-Delta-format-Files-and-insert-into-a/m-p/2344590#M112227</link>
      <description>&lt;P&gt;Hello there,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My objective is to read data from Azure Delta (tables/files) and INSERT the date into a Azure synapse table through Talend (using Dynamic Schema). I'm trying to explore all the possible options available.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've tried multiple options but had different issues.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;a. Tried to read data using DeltaLake component (jdbc) and INSERT data using synapseOutput component.&lt;/P&gt;&lt;P&gt; synapseOutput component doesn't support dynamic datatype and if I select standard schema, the INSERTS are really slow (5 rows/sec)&lt;/P&gt;&lt;P&gt; Tried to use synapseBulk, which also doesn't support dynamic.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;b. Tried to read data using Deltalake component (jdbc) and INSERT data using tDBOutput (Synapse via jdbc)&lt;/P&gt;&lt;P&gt; tDBOutput &lt;/P&gt;&lt;P&gt;(Synapse via jdbc)&lt;/P&gt;&lt;P&gt; component INSERT speeds are really slow (same a synapse component - 5 rows/sec)&lt;/P&gt;&lt;P&gt; Tried using BULK components, they are failing with weird errors!! (for which I have raise a different topic)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;c. Current temporary working approach ; Reading &lt;/P&gt;&lt;P&gt;data using Deltalake component (jdbc) and write a csv file on gen2 storage &lt;/P&gt;&lt;P&gt; account, using COPY INTO read that csv back and INSERT into synapse table.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Though option C is working, it isn't viable as option A and B.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would like to know if there any other options available, which I may have overlooked.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Tickets I've raised :&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.talend.com/s/question/0D55b000065cA4TCAU/issue-with-tazuresynapseoutput-and-tdboutput" alt="https://community.talend.com/s/question/0D55b000065cA4TCAU/issue-with-tazuresynapseoutput-and-tdboutput" target="_blank"&gt;Issue with tAzureSynapseOutput and tDBOutput (talend.com)&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.talend.com/s/question/0D55b000065cAWXCA2/issue-with-tdboutputbulk-tdbbulkexec-tdboutputbulkexec" alt="https://community.talend.com/s/question/0D55b000065cAWXCA2/issue-with-tdboutputbulk-tdbbulkexec-tdboutputbulkexec" target="_blank"&gt;Issue with tDBOutputBulk, tDBBulkExec &amp;amp; tDBOutputBulkExec (talend.com)&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kind Regards&lt;/P&gt;&lt;P&gt; &lt;/P&gt;&lt;P&gt; &lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 00:07:42 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Best-possible-ways-to-read-Delta-format-Files-and-insert-into-a/m-p/2344590#M112227</guid>
      <dc:creator>vbadri</dc:creator>
      <dc:date>2024-11-16T00:07:42Z</dc:date>
    </item>
    <item>
      <title>Re: Best possible ways to read Delta format Files and insert into a synapse table</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Best-possible-ways-to-read-Delta-format-Files-and-insert-into-a/m-p/2344591#M112228</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;When the customer wants to load large, we suggest they use tAzureSynapseBulkExec component.&lt;/P&gt;&lt;P&gt;It supports 2 ways to load large data into Azure synapse:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Ploybase : &lt;A href="https://docs.microsoft.com/en-us/azure/synapse-analytics/sql/data-loading-best-practices" alt="https://docs.microsoft.com/en-us/azure/synapse-analytics/sql/data-loading-best-practices" target="_blank"&gt;https://docs.microsoft.com/en-us/azure/synapse-analytics/sql/data-loading-best-practices&lt;/A&gt;&lt;/LI&gt;&lt;LI&gt;Copy: &lt;A href="https://docs.microsoft.com/en-us/sql/t-sql/statements/copy-into-transact-sql?view=azure-sqldw-latest" alt="https://docs.microsoft.com/en-us/sql/t-sql/statements/copy-into-transact-sql?view=azure-sqldw-latest" target="_blank"&gt;https://docs.microsoft.com/en-us/sql/t-sql/statements/copy-into-transact-sql?view=azure-sqldw-latest&lt;/A&gt; &lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Best regards&lt;/P&gt;&lt;P&gt;Sabrina&lt;/P&gt;</description>
      <pubDate>Wed, 27 Apr 2022 03:02:03 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Best-possible-ways-to-read-Delta-format-Files-and-insert-into-a/m-p/2344591#M112228</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-04-27T03:02:03Z</dc:date>
    </item>
  </channel>
</rss>

