<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: CSV skip selected rows in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/CSV-skip-selected-rows/m-p/2263880#M43899</link>
    <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp; &amp;nbsp; Below are my thoughts.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;-skip first line after header (its third)&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LKj7AAG"&gt;@TRF&lt;/A&gt;&amp;nbsp;has already answered your query. You can also use header skip in tfileinputdelimited if you do not want to pick any in between records (like picking first, third , fifth etc.).&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;-replace "NA" to null (but dont skip this rows)&lt;/P&gt; 
&lt;P&gt;Read the data and use if expression in tMap to replace with NULL.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;-remove quotes from date &amp;amp; time and concat to datetime&lt;/P&gt; 
&lt;P&gt;if you are reading as String, you can use replaceAll() function in tMap. Lot of examples are already available in community.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;-use normal data types (double, Date, short)&lt;/P&gt; 
&lt;P&gt;Sure &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; Once you do the cleansing, either use java data type conversion functions or use tConvertType component.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;-unique key is datetime, so keep synchronized csv with db in some periods&lt;/P&gt; 
&lt;P&gt;Use tUniqRow component to find unique records or tAggregateRow component to group them&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Warm Regards,&lt;BR /&gt;Nikhil Thampi&lt;/P&gt; 
&lt;P&gt;Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved&lt;/P&gt;</description>
    <pubDate>Fri, 22 Nov 2019 02:11:38 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2019-11-22T02:11:38Z</dc:date>
    <item>
      <title>CSV skip selected rows</title>
      <link>https://community.qlik.com/t5/Talend-Studio/CSV-skip-selected-rows/m-p/2263876#M43895</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;i need to skip some rows or use header and remove first row after header.&lt;/P&gt; 
&lt;P&gt;CSV file format:&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-left" image-alt="PSPad_qTucUnlg8r.png" style="width: 819px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M8CR.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/142297iF615E6CDB4709689/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M8CR.png" alt="0683p000009M8CR.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;first row ignore/remove&lt;/P&gt; 
&lt;P&gt;&lt;STRONG&gt;second Row is header&lt;/STRONG&gt;&lt;/P&gt; 
&lt;P&gt;third row ignore/remove&lt;/P&gt; 
&lt;P&gt;&lt;STRONG&gt;4,5.....*&amp;nbsp; rows are data&lt;/STRONG&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;If I select delimited file and Header = 2, then header is ok, but first data row is "3", that I dont want.&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 04:04:09 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/CSV-skip-selected-rows/m-p/2263876#M43895</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T04:04:09Z</dc:date>
    </item>
    <item>
      <title>Re: CSV skip selected rows</title>
      <link>https://community.qlik.com/t5/Talend-Studio/CSV-skip-selected-rows/m-p/2263877#M43896</link>
      <description>&lt;P&gt;tSampleRow is what you are looking for.&lt;/P&gt;</description>
      <pubDate>Wed, 20 Nov 2019 12:54:42 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/CSV-skip-selected-rows/m-p/2263877#M43896</guid>
      <dc:creator>TRF</dc:creator>
      <dc:date>2019-11-20T12:54:42Z</dc:date>
    </item>
    <item>
      <title>Re: CSV skip selected rows</title>
      <link>https://community.qlik.com/t5/Talend-Studio/CSV-skip-selected-rows/m-p/2263878#M43897</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;I'm guessing, the row you want to skip doesn't have data in the format of the schema provided to the tFileInputDelimited component. If so, you can use the advanced settings of the component. Click on the box "Check each row structure against schema". This is eliminate rows which do not fall under the schema.&lt;/P&gt;
&lt;P&gt;If there is another requirement than this, could you please elaborate on that?&lt;/P&gt;
&lt;P&gt;Hope this helps.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Nov 2019 06:02:02 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/CSV-skip-selected-rows/m-p/2263878#M43897</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-11-21T06:02:02Z</dc:date>
    </item>
    <item>
      <title>Re: CSV skip selected rows</title>
      <link>https://community.qlik.com/t5/Talend-Studio/CSV-skip-selected-rows/m-p/2263879#M43898</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;I using&amp;nbsp;&lt;SPAN&gt;tFileInputDelimited, but schema: I must use string for all columns, because in data are this lines and talend can't convert them:&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;I'm using&amp;nbsp;tfileinputdelimited -&amp;gt; tmap -&amp;gt; DBOutput.&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;What I want? &lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;-skip first line after header (its third)&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;-replace "NA" to null (but dont skip this rows)&lt;/P&gt; 
&lt;P&gt;-remove quotes from date &amp;amp; time and concat to datetime&lt;/P&gt; 
&lt;P&gt;-use normal data types (double, Date, short)&lt;/P&gt; 
&lt;P&gt;-unique key is datetime, so keep synchronized csv with db in some periods&lt;/P&gt; 
&lt;P&gt;Is some easy way, how to do this?&lt;/P&gt; 
&lt;P&gt;&lt;BR /&gt;Thank you&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="EXCEL_XOVrZr2umO.png" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M7xG.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/134916iCE2191D5030E169B/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M7xG.png" alt="0683p000009M7xG.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 21 Nov 2019 13:13:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/CSV-skip-selected-rows/m-p/2263879#M43898</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-11-21T13:13:14Z</dc:date>
    </item>
    <item>
      <title>Re: CSV skip selected rows</title>
      <link>https://community.qlik.com/t5/Talend-Studio/CSV-skip-selected-rows/m-p/2263880#M43899</link>
      <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp; &amp;nbsp; Below are my thoughts.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;-skip first line after header (its third)&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LKj7AAG"&gt;@TRF&lt;/A&gt;&amp;nbsp;has already answered your query. You can also use header skip in tfileinputdelimited if you do not want to pick any in between records (like picking first, third , fifth etc.).&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;-replace "NA" to null (but dont skip this rows)&lt;/P&gt; 
&lt;P&gt;Read the data and use if expression in tMap to replace with NULL.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;-remove quotes from date &amp;amp; time and concat to datetime&lt;/P&gt; 
&lt;P&gt;if you are reading as String, you can use replaceAll() function in tMap. Lot of examples are already available in community.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;-use normal data types (double, Date, short)&lt;/P&gt; 
&lt;P&gt;Sure &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; Once you do the cleansing, either use java data type conversion functions or use tConvertType component.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;-unique key is datetime, so keep synchronized csv with db in some periods&lt;/P&gt; 
&lt;P&gt;Use tUniqRow component to find unique records or tAggregateRow component to group them&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Warm Regards,&lt;BR /&gt;Nikhil Thampi&lt;/P&gt; 
&lt;P&gt;Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved&lt;/P&gt;</description>
      <pubDate>Fri, 22 Nov 2019 02:11:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/CSV-skip-selected-rows/m-p/2263880#M43899</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-11-22T02:11:38Z</dc:date>
    </item>
  </channel>
</rss>

