<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: File check - check columns + delimiters in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/File-check-check-columns-delimiters/m-p/2371911#M134802</link>
    <description>Hi,
&lt;BR /&gt;As a basic solution, you can use tFileIinputFullRow, then count the number of field separators to exclude incorrect records.</description>
    <pubDate>Fri, 06 Oct 2017 08:26:09 GMT</pubDate>
    <dc:creator>TRF</dc:creator>
    <dc:date>2017-10-06T08:26:09Z</dc:date>
    <item>
      <title>File check - check columns + delimiters</title>
      <link>https://community.qlik.com/t5/Talend-Studio/File-check-check-columns-delimiters/m-p/2371910#M134801</link>
      <description>&lt;P&gt;Hello all,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Everyday i&amp;nbsp;process a csv file (&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MA9p.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/138034i5F552429DA646D6F/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MA9p.png" alt="0683p000009MA9p.png" /&gt;&lt;/span&gt; with 15 columns but sometimes the format of this file is not correct which may be an error with&amp;nbsp;an extraction itself.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Types of errors with the csv file :&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;- Number of column greater than 15&lt;/P&gt; 
&lt;P&gt;-&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;Number of column less than 15&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;- Number of column is 15 but certain row contain only &amp;gt; OR &amp;lt; column&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;How can i check each of these errors before processing? If one of these are found, i can use tDie to end the process.&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;In my case, i cannot use tSchemaComplianceCheck, since it verifies only type, length, null, date format?&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;Good file :&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&amp;nbsp;&lt;/P&gt; 
&lt;PRE&gt;Nom;Prenom;Adresse;Telephone;Date de naissance;Ville
Jin;Kazama;Moka;56656496;7/11/1989;Pamplemousses
Naruto;Uzumaki;Triolet;56461654;2/12/1989;Pamplemousses
Paul;Phoenix;Soulliac;97146665;23/08/1987;Pamplemousses&lt;/PRE&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;Bad file :&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;PRE&gt;Nom;Prenom;Adresse;Telephone
Jin;Kazama;Moka;56656496
Naruto;Uzumaki;Triolet;56461654
Paul;Phoenix;Soulliac;97146665&lt;/PRE&gt; 
&lt;PRE&gt;Nom;Prenom;Adresse;Telephone;Date de naissance;Ville
Jin;Kazama;Moka;56656496;7/11/1989
Naruto;Uzumaki;Triolet;56461654;2/12/1989;Pamplemousses
Paul;Phoenix;Soulliac;97146665;23/08/1987;Pamplemousses&lt;/PRE&gt; 
&lt;PRE&gt;Nom;Prenom;Adresse;Telephone;Date de naissance;Ville
Jin;Kazama;Moka;56656496;7/11/1989;Pamplemousses
Naruto;Uzumaki;Triolet;56461654;2/12/1989;Pamplemousses
Paul;Phoenix;Soulliac;97146665;Pamplemousses&lt;/PRE&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;Can you advice please?&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Best regards,&lt;/P&gt; 
&lt;P&gt;asadasing&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 09:12:35 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/File-check-check-columns-delimiters/m-p/2371910#M134801</guid>
      <dc:creator>RA6</dc:creator>
      <dc:date>2024-11-16T09:12:35Z</dc:date>
    </item>
    <item>
      <title>Re: File check - check columns + delimiters</title>
      <link>https://community.qlik.com/t5/Talend-Studio/File-check-check-columns-delimiters/m-p/2371911#M134802</link>
      <description>Hi,
&lt;BR /&gt;As a basic solution, you can use tFileIinputFullRow, then count the number of field separators to exclude incorrect records.</description>
      <pubDate>Fri, 06 Oct 2017 08:26:09 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/File-check-check-columns-delimiters/m-p/2371911#M134802</guid>
      <dc:creator>TRF</dc:creator>
      <dc:date>2017-10-06T08:26:09Z</dc:date>
    </item>
    <item>
      <title>Re: File check - check columns + delimiters</title>
      <link>https://community.qlik.com/t5/Talend-Studio/File-check-check-columns-delimiters/m-p/2371912#M134803</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;In the tfileInputDelimited, in advanced settings select the option "Check each row structure against schema". Now take rejects from tfileInputDelimited. If there is any mismatch in number of columns the rows will be rejected.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;You can count the rows in reject file, if there are any rows you can stop the process. I hope this will help.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Regards&lt;/P&gt; 
&lt;P&gt;Ravinder&lt;/P&gt;</description>
      <pubDate>Mon, 09 Oct 2017 08:01:19 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/File-check-check-columns-delimiters/m-p/2371912#M134803</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-10-09T08:01:19Z</dc:date>
    </item>
  </channel>
</rss>

