<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Filter comments out of input file in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Filter-comments-out-of-input-file/m-p/2256120#M38608</link>
    <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LOphAAG"&gt;@imholzj&lt;/A&gt;&amp;nbsp;but if the file is in below format&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;VERSION = 2007&lt;BR /&gt;DATE = 10/23/2017&lt;BR /&gt;EXPDATE = 10/1/2018&lt;BR /&gt;###########################################&lt;BR /&gt;FAMILY NURSE CARE OF LLC&lt;BR /&gt;HARRY GILL, MD, PHD, PC&lt;BR /&gt;LEAWOOD FAMILY EYE CARE PA&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;By using the tfilterrow in advance setting we can only remove one line with ## symbols&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Please look at screenshot below i ran the test&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Headertest.PNG" style="width: 710px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M36D.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/151105i6C98A6CC8FC1C8BD/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M36D.png" alt="0683p000009M36D.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 14 Mar 2019 17:35:47 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2019-03-14T17:35:47Z</dc:date>
    <item>
      <title>Filter comments out of input file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Filter-comments-out-of-input-file/m-p/2256115#M38603</link>
      <description>&lt;P&gt;I have a csv file that has leading comments, how can I filter them out?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;# use this file for additional users
# all values must be present
id,userName,date
"1","foo","01.01.2019"
"2","bar","02.01.2019"
.
.
.&lt;/PRE&gt;</description>
      <pubDate>Sat, 16 Nov 2024 06:20:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Filter-comments-out-of-input-file/m-p/2256115#M38603</guid>
      <dc:creator>imholzj</dc:creator>
      <dc:date>2024-11-16T06:20:13Z</dc:date>
    </item>
    <item>
      <title>Re: Filter comments out of input file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Filter-comments-out-of-input-file/m-p/2256116#M38604</link>
      <description>&lt;P&gt;use Header = 2 in tfiledelimted&lt;/P&gt; 
&lt;P&gt;look at the screeshot below&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="header.PNG" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M3PR.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/139365i1579CECA117B2965/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M3PR.png" alt="0683p000009M3PR.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 14 Mar 2019 13:45:29 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Filter-comments-out-of-input-file/m-p/2256116#M38604</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-03-14T13:45:29Z</dc:date>
    </item>
    <item>
      <title>Re: Filter comments out of input file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Filter-comments-out-of-input-file/m-p/2256117#M38605</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If your 1st Field is Integer like "1" or 1(as in you case) ,&amp;nbsp; just remove "Die On Error" , All the record will be rejected( I&lt;STRONG&gt;ncluding header&lt;/STRONG&gt;)&lt;/P&gt;
&lt;P&gt;You can capture these using Reject Link from Input Component&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 14 Mar 2019 14:48:21 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Filter-comments-out-of-input-file/m-p/2256117#M38605</guid>
      <dc:creator>akumar2301</dc:creator>
      <dc:date>2019-03-14T14:48:21Z</dc:date>
    </item>
    <item>
      <title>Re: Filter comments out of input file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Filter-comments-out-of-input-file/m-p/2256118#M38606</link>
      <description>&lt;P&gt;I am looking for a more general solution, allowing people to add more comments if necessary.&amp;nbsp; Seems like hardcoding a 2 would prevent that.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;jji&lt;/P&gt;</description>
      <pubDate>Thu, 14 Mar 2019 15:04:48 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Filter-comments-out-of-input-file/m-p/2256118#M38606</guid>
      <dc:creator>imholzj</dc:creator>
      <dc:date>2019-03-14T15:04:48Z</dc:date>
    </item>
    <item>
      <title>Re: Filter comments out of input file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Filter-comments-out-of-input-file/m-p/2256119#M38607</link>
      <description>&lt;P&gt;I'm going with a tFilterRow for a solution:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Advanced mode with:&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="courier new,courier"&gt;// filter out all comments&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;!( java.util.regex.Pattern.matches("^\\s*#.*", row1.Status) )&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In this case "Status" is the first field defined in the schema (which happened to be String.)&amp;nbsp; &amp;nbsp;I'm not sure how to do this if it weren't.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;jji&lt;/P&gt;</description>
      <pubDate>Thu, 14 Mar 2019 17:03:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Filter-comments-out-of-input-file/m-p/2256119#M38607</guid>
      <dc:creator>imholzj</dc:creator>
      <dc:date>2019-03-14T17:03:12Z</dc:date>
    </item>
    <item>
      <title>Re: Filter comments out of input file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Filter-comments-out-of-input-file/m-p/2256120#M38608</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LOphAAG"&gt;@imholzj&lt;/A&gt;&amp;nbsp;but if the file is in below format&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;VERSION = 2007&lt;BR /&gt;DATE = 10/23/2017&lt;BR /&gt;EXPDATE = 10/1/2018&lt;BR /&gt;###########################################&lt;BR /&gt;FAMILY NURSE CARE OF LLC&lt;BR /&gt;HARRY GILL, MD, PHD, PC&lt;BR /&gt;LEAWOOD FAMILY EYE CARE PA&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;By using the tfilterrow in advance setting we can only remove one line with ## symbols&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Please look at screenshot below i ran the test&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Headertest.PNG" style="width: 710px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M36D.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/151105i6C98A6CC8FC1C8BD/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M36D.png" alt="0683p000009M36D.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 14 Mar 2019 17:35:47 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Filter-comments-out-of-input-file/m-p/2256120#M38608</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-03-14T17:35:47Z</dc:date>
    </item>
  </channel>
</rss>

