<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Oracle to Hadoop Data Migration  - Best/Efficient Way to do Onetime/Delta(Ongoing) Load - Please guide ? in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Oracle-to-Hadoop-Data-Migration-Best-Efficient-Way-to-do-Onetime/m-p/2200227#M2778</link>
    <description>&lt;P&gt;How to build a Data Warehouse from current Operational Data Systems stored in Oracle and move the data&amp;nbsp; to hadoop cluster and&lt;/P&gt; 
&lt;P&gt;then implementing processes that allow to update the Data Warehouse with source systems daily (or periodically).&lt;/P&gt; 
&lt;P&gt;-&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Please guide using Talend how one can&amp;nbsp; do the following in a best/efficient way please - ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;1. Initial load into Data Warehouse ( Source - Oracle Target - Hadoop cluster - hdfs ) ?&amp;nbsp; please tell the detailed steps ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;2. Periodical delta loads into Data Warehouse&amp;nbsp;( Source - Oracle Target - Hadoop cluster - hdfs ) ?&amp;nbsp;please tell the detailed steps ?&lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 04:23:20 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2024-11-16T04:23:20Z</dc:date>
    <item>
      <title>Oracle to Hadoop Data Migration  - Best/Efficient Way to do Onetime/Delta(Ongoing) Load - Please guide ?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Oracle-to-Hadoop-Data-Migration-Best-Efficient-Way-to-do-Onetime/m-p/2200227#M2778</link>
      <description>&lt;P&gt;How to build a Data Warehouse from current Operational Data Systems stored in Oracle and move the data&amp;nbsp; to hadoop cluster and&lt;/P&gt; 
&lt;P&gt;then implementing processes that allow to update the Data Warehouse with source systems daily (or periodically).&lt;/P&gt; 
&lt;P&gt;-&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Please guide using Talend how one can&amp;nbsp; do the following in a best/efficient way please - ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;1. Initial load into Data Warehouse ( Source - Oracle Target - Hadoop cluster - hdfs ) ?&amp;nbsp; please tell the detailed steps ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;2. Periodical delta loads into Data Warehouse&amp;nbsp;( Source - Oracle Target - Hadoop cluster - hdfs ) ?&amp;nbsp;please tell the detailed steps ?&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 04:23:20 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Oracle-to-Hadoop-Data-Migration-Best-Efficient-Way-to-do-Onetime/m-p/2200227#M2778</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T04:23:20Z</dc:date>
    </item>
    <item>
      <title>Re: Oracle to Hadoop Data Migration  - Best/Efficient Way to do Onetime/Delta(Ongoing) Load - Please guide ?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Oracle-to-Hadoop-Data-Migration-Best-Efficient-Way-to-do-Onetime/m-p/2200228#M2779</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp; &amp;nbsp; If you have to extract data from a source table without any joins or sub queries, you can directly do it in a Bigdata Batch job as shown below.&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="image.png" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M7nt.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/151365iB27AA88A9C0D7ECE/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M7nt.png" alt="0683p000009M7nt.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;But, if you have join conditions or complex queries, I would recommend to use a Talend standard job to extract the data and use HDFS components in Standard job to load the data to target Hadoop cluster.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;The difference between one off and daily will be the difference in data volume or filter condition. You need to also make sure that you are having adequate memory allocated for the job by changing the memory parameters of the job.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Warm Regards,&lt;BR /&gt;Nikhil Thampi&lt;/P&gt; 
&lt;P&gt;Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 15 Oct 2019 13:43:05 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Oracle-to-Hadoop-Data-Migration-Best-Efficient-Way-to-do-Onetime/m-p/2200228#M2779</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-10-15T13:43:05Z</dc:date>
    </item>
  </channel>
</rss>

