<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Integration with Databricks and leveraging Delta Lake in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Integration-with-Databricks-and-leveraging-Delta-Lake/m-p/2270447#M48343</link>
    <description>&lt;P&gt;Hi Luis,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please, regarding to the driver, please download the below and create tDeltaLakeOutput.&lt;/P&gt;&lt;P&gt;https://databricks.com/spark/jdbc-drivers-download&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It's good that you read the article below:&lt;/P&gt;&lt;P&gt;https://help.talend.com/r/en-US/7.3/delta-lake/linking-components-to-design-flow-of-delta-lake-data-in&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any futher doubt, plese write here.&lt;/P&gt;</description>
    <pubDate>Sun, 13 Feb 2022 12:08:45 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2022-02-13T12:08:45Z</dc:date>
    <item>
      <title>Integration with Databricks and leveraging Delta Lake</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Integration-with-Databricks-and-leveraging-Delta-Lake/m-p/2270446#M48342</link>
      <description>&lt;P&gt;Hi everyone,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I am currently trying to understand how to use Talend to ingest CSV files on Azure Data Lake (Gen2) into a Staging Delta Lake Table on Databricks.&lt;/P&gt; 
&lt;P&gt;I am new to Talend and trying to understand all the components that can be used but to be honest I am overwhelmed with the amount of options that I can take...&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;As far as I could research, I think the first step would be to have a tJDBCConfiguration configured to connect to the databricks cluster, but I don't know what driver to install or choose for this to happen. Basically what you should configure on the "drivers" option. I have tried downloading the Databricks Simba 4.1 driver but apparently does not seem to work.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Second, I would like to know if anyone already worked with Delta Lake, and how it would be possible to have an incremental approach... Are there any components for that or it should be scripted?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Any help would be greatly appreciated.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thank you&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Regards&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 03:27:08 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Integration-with-Databricks-and-leveraging-Delta-Lake/m-p/2270446#M48342</guid>
      <dc:creator>Luis_Simoes</dc:creator>
      <dc:date>2024-11-16T03:27:08Z</dc:date>
    </item>
    <item>
      <title>Re: Integration with Databricks and leveraging Delta Lake</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Integration-with-Databricks-and-leveraging-Delta-Lake/m-p/2270447#M48343</link>
      <description>&lt;P&gt;Hi Luis,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please, regarding to the driver, please download the below and create tDeltaLakeOutput.&lt;/P&gt;&lt;P&gt;https://databricks.com/spark/jdbc-drivers-download&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It's good that you read the article below:&lt;/P&gt;&lt;P&gt;https://help.talend.com/r/en-US/7.3/delta-lake/linking-components-to-design-flow-of-delta-lake-data-in&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any futher doubt, plese write here.&lt;/P&gt;</description>
      <pubDate>Sun, 13 Feb 2022 12:08:45 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Integration-with-Databricks-and-leveraging-Delta-Lake/m-p/2270447#M48343</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-02-13T12:08:45Z</dc:date>
    </item>
  </channel>
</rss>

