<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Duplicate column names - xlsx in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Duplicate-column-names-xlsx/m-p/2299630#M71957</link>
    <description>Hi Ian
&lt;BR /&gt;I think you have to identify the uniqueness of column whatever way you use with community version. You can take a look at the 
&lt;A href="https://help.talend.com/pages/viewpage.action?pageId=190513179" target="_blank" rel="nofollow noopener noreferrer"&gt;dynamic schema &lt;/A&gt;feature in Talend Subscription product.
&lt;BR /&gt;
&lt;BR /&gt;Best regards
&lt;BR /&gt;Shong</description>
    <pubDate>Mon, 19 Oct 2015 10:49:29 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2015-10-19T10:49:29Z</dc:date>
    <item>
      <title>Duplicate column names - xlsx</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Duplicate-column-names-xlsx/m-p/2299629#M71956</link>
      <description>&lt;P&gt;Hello,&lt;BR /&gt;&lt;BR /&gt;I'm using the excellent tFileExcelSheetInput component but I'm having an issue. It is not with the component itself but with the incoming data.&lt;BR /&gt;&lt;BR /&gt;tFileExcelSheetInput allows me to 'search' for column names across the top row of each sheet, and then return the data under that column &lt;STRONG&gt;heading&lt;/STRONG&gt; into my flow (which is great).&lt;BR /&gt;&lt;BR /&gt;My issue is that the incoming spreadsheet&amp;nbsp;can contain duplicate column names. tFileExcelSheetInput appears to be taking the right hand most column with that heading and returning that data. As I said, it isn't really the component I have an issue with it is the duplicate column names.. The duplicated column names have different data below.&lt;BR /&gt;&lt;BR /&gt;So I can have column headings something like:&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;Name&lt;/STRONG&gt;&lt;BR /&gt;Tel&lt;BR /&gt;Mobile&lt;BR /&gt;House Number&lt;BR /&gt;&lt;STRONG&gt;Name&lt;/STRONG&gt;&lt;BR /&gt;Address1&lt;BR /&gt;Address2&lt;BR /&gt;etc..&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;The column positions can change from sheet to sheet - hence why I am using tFileExcelSheetInput.&lt;BR /&gt;As you can see above, the Column name 'Name' is repeated.&lt;BR /&gt;tFileExcelSheetInput will return the second 'Name' data (applies to a house name here and is hardly ever populated) rather than the persons 'Name' data.&lt;BR /&gt;&lt;BR /&gt;I am getting around this by reading the top row, pivoting it, adding a numeric counter (to maintain its position) then doing a tUniqRow to identify duplicates, then renaming the duplicate with a 1,2,3etc re-forming the column, order it on the numeric counter to get the column positions correct, and then unpivot back to a spreadsheet, where I then append the data from the original worksheet.&lt;BR /&gt;&lt;BR /&gt;In this example I end up with column names:&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;Name&lt;/STRONG&gt;&lt;BR /&gt;Tel&lt;BR /&gt;Mobile&lt;BR /&gt;House Number&lt;BR /&gt;&lt;STRONG&gt;Name1&lt;/STRONG&gt;&lt;BR /&gt;Address1&lt;BR /&gt;Address2&lt;BR /&gt;etc..&lt;BR /&gt;&lt;BR /&gt;This is a very roundabout process.&lt;BR /&gt;&lt;BR /&gt;Does anyone know of a better way of dealing with the duplicate column names?&lt;BR /&gt;&lt;BR /&gt;many thanks&lt;BR /&gt;&lt;BR /&gt;Ian&lt;/P&gt;</description>
      <pubDate>Tue, 13 Oct 2015 15:24:41 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Duplicate-column-names-xlsx/m-p/2299629#M71956</guid>
      <dc:creator>IanM</dc:creator>
      <dc:date>2015-10-13T15:24:41Z</dc:date>
    </item>
    <item>
      <title>Re: Duplicate column names - xlsx</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Duplicate-column-names-xlsx/m-p/2299630#M71957</link>
      <description>Hi Ian
&lt;BR /&gt;I think you have to identify the uniqueness of column whatever way you use with community version. You can take a look at the 
&lt;A href="https://help.talend.com/pages/viewpage.action?pageId=190513179" target="_blank" rel="nofollow noopener noreferrer"&gt;dynamic schema &lt;/A&gt;feature in Talend Subscription product.
&lt;BR /&gt;
&lt;BR /&gt;Best regards
&lt;BR /&gt;Shong</description>
      <pubDate>Mon, 19 Oct 2015 10:49:29 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Duplicate-column-names-xlsx/m-p/2299630#M71957</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-10-19T10:49:29Z</dc:date>
    </item>
  </channel>
</rss>

