<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Handling duplicate column names when using Dynamic Schema in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Handling-duplicate-column-names-when-using-Dynamic-Schema/m-p/2226593#M18451</link>
    <description>&lt;P&gt;You could build a pre-processing job to deal with this scenario.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Unfortunately this is a problem that is quite unusual. Data with columns using the same name is not something that any integration tool will solve dynamically, natively and intelligently. If I have a dataset that I have not seen before and it has two columns called "name", I would have to spend a fair amount of time looking at the data to try and assess what each column was actually holding. As a human I have a "world knowledge" that might help me in doing this.&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;A simple process to tokenize the header row, look for duplicates and then reassign duplicates with another name (maybe just adding a number to the existing name) should be relatively easy to achieve.&lt;/P&gt;</description>
    <pubDate>Mon, 06 Jan 2020 17:09:14 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2020-01-06T17:09:14Z</dc:date>
    <item>
      <title>Handling duplicate column names when using Dynamic Schema</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Handling-duplicate-column-names-when-using-Dynamic-Schema/m-p/2226590#M18448</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;I have a basic job setup which loads the contents of a CSV file into a database table using the "DYNAMIC" column type.&amp;nbsp; This works great for quickly getting raw data in a database.&lt;/P&gt; 
&lt;P&gt;However, the latest file I am testing this on has duplicate column headers.&amp;nbsp; Is there a way of handling duplicate column names when using Dynamic Schema?&amp;nbsp; Maybe by adding a number to the end of the duplicate column or something along those lines?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;This would be a basic example:&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="image.png" style="width: 342px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M8PV.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/156251iFB5C0FE5A6D55A85/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M8PV.png" alt="0683p000009M8PV.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;This would be imported as something like below to get away from the duplicate column:&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="image.png" style="width: 316px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M821.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/151965iBEF9C821711381EA/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M821.png" alt="0683p000009M821.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;Many thanks&lt;/P&gt; 
&lt;P&gt;Steve&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 03:45:11 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Handling-duplicate-column-names-when-using-Dynamic-Schema/m-p/2226590#M18448</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T03:45:11Z</dc:date>
    </item>
    <item>
      <title>Re: Handling duplicate column names when using Dynamic Schema</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Handling-duplicate-column-names-when-using-Dynamic-Schema/m-p/2226591#M18449</link>
      <description>&lt;P&gt;The column names will need to be different. If you can add something to make the column names unique, that will solve this issue.&lt;/P&gt;</description>
      <pubDate>Fri, 20 Dec 2019 17:24:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Handling-duplicate-column-names-when-using-Dynamic-Schema/m-p/2226591#M18449</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-12-20T17:24:07Z</dc:date>
    </item>
    <item>
      <title>Re: Handling duplicate column names when using Dynamic Schema</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Handling-duplicate-column-names-when-using-Dynamic-Schema/m-p/2226592#M18450</link>
      <description>&lt;P&gt;OK, thanks.&amp;nbsp; Seems it's not possible then.&amp;nbsp; I wanted to take away the manual aspect of having to check the column names as the CSV files I'm dealing with can have 100s of columns&lt;/P&gt;</description>
      <pubDate>Mon, 06 Jan 2020 10:34:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Handling-duplicate-column-names-when-using-Dynamic-Schema/m-p/2226592#M18450</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2020-01-06T10:34:14Z</dc:date>
    </item>
    <item>
      <title>Re: Handling duplicate column names when using Dynamic Schema</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Handling-duplicate-column-names-when-using-Dynamic-Schema/m-p/2226593#M18451</link>
      <description>&lt;P&gt;You could build a pre-processing job to deal with this scenario.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Unfortunately this is a problem that is quite unusual. Data with columns using the same name is not something that any integration tool will solve dynamically, natively and intelligently. If I have a dataset that I have not seen before and it has two columns called "name", I would have to spend a fair amount of time looking at the data to try and assess what each column was actually holding. As a human I have a "world knowledge" that might help me in doing this.&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;A simple process to tokenize the header row, look for duplicates and then reassign duplicates with another name (maybe just adding a number to the existing name) should be relatively easy to achieve.&lt;/P&gt;</description>
      <pubDate>Mon, 06 Jan 2020 17:09:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Handling-duplicate-column-names-when-using-Dynamic-Schema/m-p/2226593#M18451</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2020-01-06T17:09:14Z</dc:date>
    </item>
  </channel>
</rss>

