<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Problematic delimiter in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Problematic-delimiter/m-p/2312883#M83787</link>
    <description>&lt;P&gt;&lt;STRONG&gt;solution&lt;/STRONG&gt; possible to be only one - fix the source of information&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;all other is not a solutions, it only tricks, which could work, or not work, and even if it work, it not mean its continue work with next not accurate case.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;often motivations look like - we can not change source of information ... but in most of cases it not true, and this is only one proper way&lt;/P&gt;</description>
    <pubDate>Wed, 11 Jul 2018 01:38:40 GMT</pubDate>
    <dc:creator>vapukov</dc:creator>
    <dc:date>2018-07-11T01:38:40Z</dc:date>
    <item>
      <title>Problematic delimiter</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problematic-delimiter/m-p/2312882#M83786</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I need to solve a problem of duplicate/punctuation in my job.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Here is an exemple of my input file :&lt;/P&gt; 
&lt;P&gt;baba;”bibi”;”bobo”;bubu;&lt;/P&gt; 
&lt;P&gt;tata; ”titi;titi”; ”toto”;tutu;&lt;/P&gt; 
&lt;P&gt;dada&amp;nbsp;; ” ”didi”didi”;”dodo”;dudu;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;In this exemple, we have 3 lines with 4 columns.&lt;/P&gt; 
&lt;UL&gt; 
 &lt;LI&gt;In the first line, everything is clear. That should be the proper format. note : the double quotes are not necessary in the output file.&lt;/LI&gt; 
 &lt;LI&gt;In the second line, I have my first problem to solve. I have a “;” in the middle of my string and since the given delimiter is also a “;”, Talend thinks I have 5 columns instead of 4.&lt;/LI&gt; 
 &lt;LI&gt;In the third line, I have some “”” (double quote) inside my columns that already possess double quotes.&lt;/LI&gt; 
&lt;/UL&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;So! Basically, what is my problem? My problem is this “;” located on my 2nd line. I need to make Talend think that I do have 4 columns and not 5. So I use a tMap where I erase this “;” by “” (nothing) because the absence of this “;” inside the column is not critical. And to do so, I used the option “CSV options” in my “Basic settings”. Without this CSV option, I can not erase the “;” in my column.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;But, by using the “CSV options”, the double quotes on the extremities on my columns disappear on all my lines! As a matter of fact, the “;” problem is no more. But I have a new problem. On my line 3, my second column is seen as empty. Instead of selecting “didi”didi, Talend selects&amp;nbsp; (the space just before).&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I tried a few things, but I am running out of options. I also tried using the “length” column in my schema. But it seems useless since that tFileInputDelimited doesn’t check the length of the data when it reads it.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Do anyone has any solution to submit to deal with those “;” and “”” ? The ideal solution would be to erase the “;” without erasing any “”” (double quote).&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I have seen&amp;nbsp;&lt;A href="https://community.qlik.com/s/feed/0D53p00007vCrkNCAS#M74577" target="_self"&gt;this solution&lt;/A&gt;&amp;nbsp;which really looks like my actual problem. But my multiples quotes inside my columns make me unable to use this solution...&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 07:58:53 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problematic-delimiter/m-p/2312882#M83786</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T07:58:53Z</dc:date>
    </item>
    <item>
      <title>Re: Problematic delimiter</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problematic-delimiter/m-p/2312883#M83787</link>
      <description>&lt;P&gt;&lt;STRONG&gt;solution&lt;/STRONG&gt; possible to be only one - fix the source of information&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;all other is not a solutions, it only tricks, which could work, or not work, and even if it work, it not mean its continue work with next not accurate case.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;often motivations look like - we can not change source of information ... but in most of cases it not true, and this is only one proper way&lt;/P&gt;</description>
      <pubDate>Wed, 11 Jul 2018 01:38:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problematic-delimiter/m-p/2312883#M83787</guid>
      <dc:creator>vapukov</dc:creator>
      <dc:date>2018-07-11T01:38:40Z</dc:date>
    </item>
  </channel>
</rss>

