<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Oracle Dynamic flow to handle 400 tables in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Oracle-Dynamic-flow-to-handle-400-tables/m-p/2366467#M129952</link>
    <description>I guess you can eliminate couple of these steps by using sqoop. Talend Big data does support sqoop (not sure which flavor of talend are you using.)
&lt;BR /&gt;I have not used dynamic schema so I am not sure about it. Can you please share a sample job for the same?
&lt;BR /&gt;For the load of data from File to Hive, I guess your approach is correct. What you can do is do parallelism for multiple tables to load in parallel. Since the structure of the tables is similar to what is present in files, you need not do any validation either. I guess your approach is pretty much on target here</description>
    <pubDate>Thu, 27 Oct 2016 05:01:28 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2016-10-27T05:01:28Z</dc:date>
    <item>
      <title>Oracle Dynamic flow to handle 400 tables</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Oracle-Dynamic-flow-to-handle-400-tables/m-p/2366464#M129949</link>
      <description>&lt;P&gt;&lt;FONT color="black"&gt;Hi,&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="black"&gt;I am struggling to design a job and looking your expertize and how you have done similar kind of work in your project.&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="black"&gt;Source: Oracle DB (405 tables)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="black"&gt;Target: Hive&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="black"&gt;Need a dynamic flow to handle 400 tables with minimum jobs(dynamic).&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="black"&gt;-&amp;nbsp;&lt;/FONT&gt;&lt;FONT color="black"&gt;Transformations involved:&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="black"&gt;1.&lt;FONT size="1"&gt;&lt;FONT face="Times"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;-&amp;nbsp;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT color="black"&gt;Around 300 are full load&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="black"&gt;2.&lt;FONT size="1"&gt;&lt;FONT face="Times"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;-&amp;nbsp;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;FONT color="black"&gt;100 tables are incremental load&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="black"&gt;&amp;nbsp;- 12 tables need data masking functionality&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="black"&gt;I realy do not want to create here 400 + Jobs to takle this situation.&lt;/FONT&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;FONT color="black"&gt;Any Ideas ?&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="black"&gt;Thanks,&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="black"&gt;SK&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 10:17:56 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Oracle-Dynamic-flow-to-handle-400-tables/m-p/2366464#M129949</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T10:17:56Z</dc:date>
    </item>
    <item>
      <title>Re: Oracle Dynamic flow to handle 400 tables</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Oracle-Dynamic-flow-to-handle-400-tables/m-p/2366465#M129950</link>
      <description>I am also in the same boat but I need to load flat files in to the Hive.&lt;BR /&gt;For Oracle to Hive, you can make use the Oracle Information tables and Hive Informational tables to creates the sqoop command dynamically. This should help you.&amp;nbsp;</description>
      <pubDate>Wed, 26 Oct 2016 06:08:56 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Oracle-Dynamic-flow-to-handle-400-tables/m-p/2366465#M129950</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-10-26T06:08:56Z</dc:date>
    </item>
    <item>
      <title>Re: Oracle Dynamic flow to handle 400 tables</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Oracle-Dynamic-flow-to-handle-400-tables/m-p/2366466#M129951</link>
      <description>&lt;B&gt;I was thinking to do that in this&amp;nbsp;way &amp;nbsp;&lt;/B&gt;
&lt;BR /&gt;&amp;nbsp;
&lt;BR /&gt;
&lt;B&gt;Flow&lt;/B&gt; :
&lt;BR /&gt;Oracle Table &amp;gt; &amp;nbsp;Local File &amp;gt; &amp;nbsp;HDFS &amp;nbsp;&amp;gt; Hive
&lt;BR /&gt;&amp;nbsp;
&lt;BR /&gt;
&lt;B&gt;1.&lt;FONT size="1"&gt;&lt;FONT face="Times" new="" roman=""&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/FONT&gt;&lt;/FONT&gt;&lt;/B&gt;
&lt;B&gt;&lt;U&gt;Oracle DB &amp;gt; Local File&lt;/U&gt;&lt;/B&gt;
&lt;BR /&gt;
&lt;B&gt;Table load with no transformation &amp;nbsp;required:&lt;/B&gt; &amp;nbsp;The feature of 
&lt;U&gt;Dynamic Schema&lt;/U&gt; can be used.
&lt;BR /&gt;
&lt;B&gt;Table where Data Masking Tables is required:&lt;/B&gt; Needs a one to one flow. No dynamic way possible as particular columns needs to be specified.
&lt;BR /&gt;&amp;nbsp;
&lt;BR /&gt;
&lt;B&gt;2.&lt;FONT size="1"&gt;&lt;FONT face="Times" new="" roman=""&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/FONT&gt;&lt;/FONT&gt;&lt;/B&gt;
&lt;B&gt;&lt;U&gt;Local File &amp;gt; HDFS&lt;/U&gt;&lt;/B&gt;
&lt;BR /&gt;The path of the local file can be passed dynamically by storing the path of the file in context variable.
&lt;BR /&gt;&amp;nbsp;
&lt;BR /&gt;
&lt;B&gt;3.&lt;FONT size="1"&gt;&lt;FONT face="Times" new="" roman=""&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/FONT&gt;&lt;/FONT&gt;&lt;/B&gt;
&lt;B&gt;&lt;U&gt;HDFS &amp;gt; Hive&lt;/U&gt;&lt;/B&gt;
&lt;BR /&gt;a.
&lt;FONT size="1"&gt;&lt;FONT face="Times" new="" roman=""&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/FONT&gt;&lt;/FONT&gt;A Master table containing the column details or SQL query/DDL query itself need to be created.
&lt;BR /&gt;b.
&lt;FONT size="1"&gt;&lt;FONT face="Times" new="" roman=""&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/FONT&gt;&lt;/FONT&gt;Talend will read the table 
&lt;BR /&gt;c.
&lt;FONT size="1"&gt;&lt;FONT face="Times" new="" roman=""&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/FONT&gt;&lt;/FONT&gt;Hive table will be created for each table mentioned in the record, with help of the DDL query.
&lt;BR /&gt;&amp;nbsp;
&lt;BR /&gt;Hence from there one row at a time will be iterated in Talend containing table name &amp;amp; the SQL query
&lt;BR /&gt;tHiveCreateTable&amp;nbsp; component will read the table &amp;amp; the queries one by one &amp;amp; execute.
&lt;BR /&gt;&amp;nbsp;
&lt;BR /&gt;
&lt;B&gt;Challenges of this strategy:&lt;/B&gt;
&lt;BR /&gt;Talend based dynamic feature works only until creation of file from the DB. For the Hive Table creation a rather static table needs to be referred which cannot accommodate changes on the fly or dynamic feature as provided by Talend.
&lt;BR /&gt;&amp;nbsp;
&lt;BR /&gt;Do you have any better Idea to handle our situation ?
&lt;BR /&gt;BR,
&lt;BR /&gt;SK
&lt;BR /&gt;&amp;nbsp;</description>
      <pubDate>Wed, 26 Oct 2016 23:32:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Oracle-Dynamic-flow-to-handle-400-tables/m-p/2366466#M129951</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-10-26T23:32:51Z</dc:date>
    </item>
    <item>
      <title>Re: Oracle Dynamic flow to handle 400 tables</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Oracle-Dynamic-flow-to-handle-400-tables/m-p/2366467#M129952</link>
      <description>I guess you can eliminate couple of these steps by using sqoop. Talend Big data does support sqoop (not sure which flavor of talend are you using.)
&lt;BR /&gt;I have not used dynamic schema so I am not sure about it. Can you please share a sample job for the same?
&lt;BR /&gt;For the load of data from File to Hive, I guess your approach is correct. What you can do is do parallelism for multiple tables to load in parallel. Since the structure of the tables is similar to what is present in files, you need not do any validation either. I guess your approach is pretty much on target here</description>
      <pubDate>Thu, 27 Oct 2016 05:01:28 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Oracle-Dynamic-flow-to-handle-400-tables/m-p/2366467#M129952</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-10-27T05:01:28Z</dc:date>
    </item>
  </channel>
</rss>

