<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Performing join in Spark job using JDBC in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Performing-join-in-Spark-job-using-JDBC/m-p/2230417#M21049</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I am trying to fetch from/ write to SSO redshift cluster in spark job.&amp;nbsp;for this i have used JDBC component as native redshift component in spark framework doesn't support for SSO redshift cluster.&lt;/P&gt; 
&lt;P&gt;But i am facing issue while performing join as its not able to recognize the table name if we are providing more than 1 table name in JDBC input component query.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;is there any parameter i am missing here. please help.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks,&lt;/P&gt; 
&lt;P&gt;Bhushan&lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 07:47:21 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2024-11-16T07:47:21Z</dc:date>
    <item>
      <title>Performing join in Spark job using JDBC</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Performing-join-in-Spark-job-using-JDBC/m-p/2230417#M21049</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I am trying to fetch from/ write to SSO redshift cluster in spark job.&amp;nbsp;for this i have used JDBC component as native redshift component in spark framework doesn't support for SSO redshift cluster.&lt;/P&gt; 
&lt;P&gt;But i am facing issue while performing join as its not able to recognize the table name if we are providing more than 1 table name in JDBC input component query.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;is there any parameter i am missing here. please help.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks,&lt;/P&gt; 
&lt;P&gt;Bhushan&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 07:47:21 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Performing-join-in-Spark-job-using-JDBC/m-p/2230417#M21049</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T07:47:21Z</dc:date>
    </item>
    <item>
      <title>Re: Performing join in Spark job using JDBC</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Performing-join-in-Spark-job-using-JDBC/m-p/2230418#M21050</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;We will appreciate if you could post the tJDBCInput component setting screenshot on forum.&lt;/P&gt;
&lt;P&gt;Please mask your sensitive data.&lt;/P&gt;
&lt;P&gt;Best regards&lt;/P&gt;
&lt;P&gt;Sabrina&lt;/P&gt;</description>
      <pubDate>Fri, 24 Aug 2018 10:19:26 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Performing-join-in-Spark-job-using-JDBC/m-p/2230418#M21050</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-08-24T10:19:26Z</dc:date>
    </item>
    <item>
      <title>Re: Performing join in Spark job using JDBC</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Performing-join-in-Spark-job-using-JDBC/m-p/2230419#M21051</link>
      <description>&lt;P&gt;Hi Sabrina,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please find the tJDBCInput component screenshot -&lt;/P&gt;
&lt;P&gt;IN SQL query section of screenshot, if we are providing more than 1 table name (which is compulsory) while doing join, then its getting fail with error " table or view doesn't exist". so ultimately we have to pass only table name which is given in "table_name" section.&amp;nbsp;if we are keeping&amp;nbsp;table name section empty&amp;nbsp;then also its getting fail.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;BR /&gt;&lt;A href="https://community.qlik.com/legacyfs/online/tlnd_dw_files/0683p000009LtTm"&gt;tJDBCInput.PNG&lt;/A&gt;</description>
      <pubDate>Tue, 28 Aug 2018 11:08:06 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Performing-join-in-Spark-job-using-JDBC/m-p/2230419#M21051</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-08-28T11:08:06Z</dc:date>
    </item>
    <item>
      <title>Re: Performing join in Spark job using JDBC</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Performing-join-in-Spark-job-using-JDBC/m-p/2230420#M21052</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt; 
&lt;P&gt;Are you performing left join on multiple tables?Spark has a constraint on Spark sql queries to have all the tables declared before hand as dataframe in order to execute it.&lt;/P&gt; 
&lt;P&gt;Sth like&lt;/P&gt; 
&lt;DIV class="code panel" style="border-width: 1px;"&gt; 
 &lt;DIV class="codeContent panelContent"&gt; 
  &lt;PRE class="code-java"&gt;myOrdersTableDataframe.registerAsTable(&lt;SPAN class="code-quote"&gt;"Orders"&lt;/SPAN&gt;)&lt;/PRE&gt; 
 &lt;/DIV&gt; 
&lt;/DIV&gt; 
&lt;P&gt;could you please try 2 JDBCInput each loading one of the involved tables in the join (Customers and Orders here) and perform a tSqlRow performing the left outer join.&lt;BR /&gt; Spark will merge the 3 steps into one stage at execution time.&lt;/P&gt; 
&lt;P&gt;Here exists a workitem jira issue.&lt;/P&gt; 
&lt;P&gt;&lt;A title="https://jira.talendforge.org/browse/TBD-5707" href="https://jira.talendforge.org/browse/TBD-5707" target="_self" rel="nofollow noopener noreferrer"&gt;https://jira.talendforge.org/browse/TBD-5707&lt;/A&gt;&lt;/P&gt; 
&lt;P&gt;Let us know if it is OK with you.&lt;/P&gt; 
&lt;P&gt;Best regards&lt;/P&gt; 
&lt;P&gt;Sabrina&lt;/P&gt;</description>
      <pubDate>Thu, 06 Sep 2018 10:22:35 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Performing-join-in-Spark-job-using-JDBC/m-p/2230420#M21052</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-09-06T10:22:35Z</dc:date>
    </item>
  </channel>
</rss>

