<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Avoid multiple header rows? in Data Quality</title>
    <link>https://community.qlik.com/t5/Data-Quality/Avoid-multiple-header-rows/m-p/2273558#M2883</link>
    <description>&lt;P&gt;I have a couple of CSV files that I load into Data Prep. All at once (I only specify a directory in "Add Dataset", no individual files). So far, so good.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;All files have the same structure, the first line is the header.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Is there a way to globally set the first row as header for all files? I know there is this "Row" -&amp;gt;&amp;nbsp;"Make as header..." feature, but what happens in my case is:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;file1.csv:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Firstname;Lastname;Age&lt;/P&gt;&lt;P&gt;Felix;Kjellberg;23&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Julian;Ilett;43&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;file2.csv:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Firstname;Lastname;Age&lt;/P&gt;&lt;P&gt;Ben;Heck;58&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Dave;Jones;48&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;The result in Data Prep is:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;Firstname|Lastname|Age&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Ben Heck 58&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Dave Jones 48&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#00FF00"&gt;Firstname Lastname Age&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;Felix Kjellberg 23&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Julian Ilett 43&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;So even if I set the blue line as header, the green line will stay. Is there a way to avoid this?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 09:47:42 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2024-11-16T09:47:42Z</dc:date>
    <item>
      <title>Avoid multiple header rows?</title>
      <link>https://community.qlik.com/t5/Data-Quality/Avoid-multiple-header-rows/m-p/2273558#M2883</link>
      <description>&lt;P&gt;I have a couple of CSV files that I load into Data Prep. All at once (I only specify a directory in "Add Dataset", no individual files). So far, so good.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;All files have the same structure, the first line is the header.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Is there a way to globally set the first row as header for all files? I know there is this "Row" -&amp;gt;&amp;nbsp;"Make as header..." feature, but what happens in my case is:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;file1.csv:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Firstname;Lastname;Age&lt;/P&gt;&lt;P&gt;Felix;Kjellberg;23&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Julian;Ilett;43&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;file2.csv:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Firstname;Lastname;Age&lt;/P&gt;&lt;P&gt;Ben;Heck;58&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Dave;Jones;48&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;The result in Data Prep is:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;Firstname|Lastname|Age&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Ben Heck 58&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Dave Jones 48&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#00FF00"&gt;Firstname Lastname Age&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;Felix Kjellberg 23&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Julian Ilett 43&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;So even if I set the blue line as header, the green line will stay. Is there a way to avoid this?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 09:47:42 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/Avoid-multiple-header-rows/m-p/2273558#M2883</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T09:47:42Z</dc:date>
    </item>
    <item>
      <title>Re: Avoid multiple header rows?</title>
      <link>https://community.qlik.com/t5/Data-Quality/Avoid-multiple-header-rows/m-p/2273559#M2884</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Out of curiosity, can you confirm the following?&lt;/P&gt; 
&lt;UL&gt; 
 &lt;LI&gt;You are using Data Prep 2.0.&lt;/LI&gt; 
 &lt;LI&gt;The CSV files are on HDFS.&lt;/LI&gt; 
&lt;/UL&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;To answer your question: there is no&amp;nbsp;dedicated data set parameter or function to remove subsequent occurrences of the header but you can do it in a single preparation step: set a filter on the first column with the column header as filter value (so filter on&amp;nbsp;"Firstname" in your example below) and use the function "delete filtered rows".&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Regards,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Gwendal&lt;/P&gt;</description>
      <pubDate>Fri, 12 May 2017 12:47:44 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/Avoid-multiple-header-rows/m-p/2273559#M2884</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-05-12T12:47:44Z</dc:date>
    </item>
  </channel>
</rss>

