<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Unite two datasets Automatically by name in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Unite-two-datasets-Automatically-by-name/m-p/2210980#M9192</link>
    <description>&lt;P&gt;1/ create table in DB that's the super-set of all columns in all sources (final table).&lt;/P&gt;
&lt;P&gt;2/ create table in DB that's got X columns, where X is a number which is the largest number of columns you'd expect in any incoming data-set, each column will be varchar(8000), just call the columns col1, col2, col3.....colX (work table)&lt;/P&gt;
&lt;P&gt;3/ load incoming data into DB work table&lt;/P&gt;
&lt;P&gt;4/ parse the header row of the incoming data, to ascertain the position of all columns of interest - this will be critical that the same column data is called the same name in each of the incoming data-sets.&lt;/P&gt;
&lt;P&gt;5/ construct SQL to INSERT INTO/SELECT from the work table into the final table.&lt;/P&gt;
&lt;P&gt;6/ execute the SQL&lt;/P&gt;
&lt;P&gt;7/clear down work table.&lt;/P&gt;</description>
    <pubDate>Thu, 06 Dec 2018 12:06:16 GMT</pubDate>
    <dc:creator>David_Beaty</dc:creator>
    <dc:date>2018-12-06T12:06:16Z</dc:date>
    <item>
      <title>Unite two datasets Automatically by name</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Unite-two-datasets-Automatically-by-name/m-p/2210976#M9188</link>
      <description>&lt;P&gt;Hello all,&lt;/P&gt; 
&lt;P&gt;In Talend we use a tool called Tunite to union two or more data-sets. The problem with it is the schema of all the data-sets should be same, where the order of the columns and the number of columns should always be same. When handling a larger data-set, ordering the data-sets or creating dummy columns becomes difficult as there are many number of columns and many data-sets.&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Is there any way where i can just unite them automatically based on the column names? and if the column is missing in other data-set it should have nulls instead.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;If it i a custom component , can anyone guide me towards the existing component or help me build one/&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 07:07:49 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Unite-two-datasets-Automatically-by-name/m-p/2210976#M9188</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T07:07:49Z</dc:date>
    </item>
    <item>
      <title>Re: Unite two datasets Automatically by name</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Unite-two-datasets-Automatically-by-name/m-p/2210977#M9189</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LOtPAAW"&gt;@Aishwarya&lt;/A&gt;,since you need to do manually,there we do not have automatically arrange.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 05 Dec 2018 07:27:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Unite-two-datasets-Automatically-by-name/m-p/2210977#M9189</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2018-12-05T07:27:12Z</dc:date>
    </item>
    <item>
      <title>Re: Unite two datasets Automatically by name</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Unite-two-datasets-Automatically-by-name/m-p/2210978#M9190</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;A way I've tackled this in the past is load the unknown schema data-set up into a table of a suitably large enough number of string columns. Then using the header line of the incoming data, dynamically construct some SQL to select back out all columns, replacing them with nulls if they weren't in the source file.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;It's not pretty, but it does work.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;David&lt;/P&gt;</description>
      <pubDate>Wed, 05 Dec 2018 16:02:35 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Unite-two-datasets-Automatically-by-name/m-p/2210978#M9190</guid>
      <dc:creator>David_Beaty</dc:creator>
      <dc:date>2018-12-05T16:02:35Z</dc:date>
    </item>
    <item>
      <title>Re: Unite two datasets Automatically by name</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Unite-two-datasets-Automatically-by-name/m-p/2210979#M9191</link>
      <description>Can you please help me follow these steps? I am pretty new to Talend</description>
      <pubDate>Thu, 06 Dec 2018 08:53:32 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Unite-two-datasets-Automatically-by-name/m-p/2210979#M9191</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-12-06T08:53:32Z</dc:date>
    </item>
    <item>
      <title>Re: Unite two datasets Automatically by name</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Unite-two-datasets-Automatically-by-name/m-p/2210980#M9192</link>
      <description>&lt;P&gt;1/ create table in DB that's the super-set of all columns in all sources (final table).&lt;/P&gt;
&lt;P&gt;2/ create table in DB that's got X columns, where X is a number which is the largest number of columns you'd expect in any incoming data-set, each column will be varchar(8000), just call the columns col1, col2, col3.....colX (work table)&lt;/P&gt;
&lt;P&gt;3/ load incoming data into DB work table&lt;/P&gt;
&lt;P&gt;4/ parse the header row of the incoming data, to ascertain the position of all columns of interest - this will be critical that the same column data is called the same name in each of the incoming data-sets.&lt;/P&gt;
&lt;P&gt;5/ construct SQL to INSERT INTO/SELECT from the work table into the final table.&lt;/P&gt;
&lt;P&gt;6/ execute the SQL&lt;/P&gt;
&lt;P&gt;7/clear down work table.&lt;/P&gt;</description>
      <pubDate>Thu, 06 Dec 2018 12:06:16 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Unite-two-datasets-Automatically-by-name/m-p/2210980#M9192</guid>
      <dc:creator>David_Beaty</dc:creator>
      <dc:date>2018-12-06T12:06:16Z</dc:date>
    </item>
    <item>
      <title>Re: Unite two datasets Automatically by name</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Unite-two-datasets-Automatically-by-name/m-p/2210981#M9193</link>
      <description>&lt;P&gt;Thanks for the instruction. I tried do this but I did not succeed. Most likely, I just do not understand how to do it. For me, it's as hard as writing a term paper. I could not write a term paper because I do not know how to do it. I found &lt;A href="https://papersgram.com/" target="_blank" rel="nofollow noopener noreferrer"&gt;writing help&lt;/A&gt; a term paper on the internet and helped me. Probably I will have to look for a programmer who will help me to do everything according to your instructions, so that the information is displayed correctly through the database.&lt;/P&gt;</description>
      <pubDate>Thu, 06 Dec 2018 14:11:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Unite-two-datasets-Automatically-by-name/m-p/2210981#M9193</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-12-06T14:11:50Z</dc:date>
    </item>
  </channel>
</rss>

