<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Text enclosure delimiter in CSV data in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375659#M138042</link>
    <description>Anyone??? 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MPcz.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/157233iD1A564EF62DE3BC2/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MPcz.png" alt="0683p000009MPcz.png" /&gt;&lt;/span&gt;</description>
    <pubDate>Wed, 14 Jan 2015 16:25:19 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2015-01-14T16:25:19Z</dc:date>
    <item>
      <title>Text enclosure delimiter in CSV data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375650#M138033</link>
      <description>I have a set of CSV data with comma as the field separator and double quote(") as the text enclosure. &amp;nbsp;In some of the fields the text enclosure character is in the data. 
&lt;BR /&gt;The behavior of the tfileInputDelimited is to truncate the field at the first " in the field. &amp;nbsp;I do not see any errors from this truncation. 
&lt;BR /&gt;e.g. "This is "a" test field" is truncated to "This is " in the output data. &amp;nbsp;The interesting thing is that the parser is smart enough to recognize the start of the next field correctly so the rest of the data looks fine. 
&lt;BR /&gt;I could change delimiters but this is a large export and I cannot be certain in any case that the text enclosure character will not be in the field. 
&lt;BR /&gt;At a minimum I need to know that the parser truncated the data on that field with some kind of exception log or I need to determine a way to handle the anomaly correctly. 
&lt;BR /&gt;Any suggestions? 
&lt;BR /&gt;Jim</description>
      <pubDate>Tue, 21 Oct 2014 17:50:47 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375650#M138033</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-10-21T17:50:47Z</dc:date>
    </item>
    <item>
      <title>Re: Text enclosure delimiter in CSV data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375651#M138034</link>
      <description>Hi,
&lt;BR /&gt;
&lt;BR /&gt;
&lt;FONT size="2"&gt;&lt;FONT face="Calibri"&gt;Is your expected result: This is a test field?&amp;nbsp; If so, you can use &lt;/FONT&gt;&lt;/FONT&gt;
&lt;FONT size="2"&gt;&lt;FONT face="Calibri"&gt;tReplace component to remove " in your output column.&lt;BR /&gt;Please see component reference &lt;A href="https://help.talend.com/search/all?query=tReplace&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt;TalendHelpCenter:tReplace&lt;/A&gt;&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;BR /&gt;
&lt;FONT size="2"&gt;&lt;FONT face="Calibri"&gt;Best regards&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT size="2"&gt;&lt;FONT face="Calibri"&gt;Sabrina&lt;BR /&gt;&lt;/FONT&gt;&lt;/FONT&gt;</description>
      <pubDate>Wed, 22 Oct 2014 04:37:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375651#M138034</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-10-22T04:37:24Z</dc:date>
    </item>
    <item>
      <title>Re: Text enclosure delimiter in CSV data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375652#M138035</link>
      <description>My expected result is:&amp;nbsp;
&lt;FONT size="1"&gt;&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;This is "a" test field (actually it is xml so I expect &amp;lt;myfield&amp;gt;&lt;/FONT&gt;&lt;/FONT&gt;
&lt;FONT size="1"&gt;&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;This is "a" test field&amp;lt;/myfield&amp;gt;)&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;&lt;FONT size="1"&gt;Since this is XML I could also live with a replacement to an entity ($quote)&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;&lt;FONT size="1"&gt;The replace solution does not work - it seems that when used against a field the parser has already truncated the field data at the first quote.&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;&lt;FONT size="1"&gt;I would need to be able to do the replace against the whole record before it is parsed into fields - I don't see how to do that.&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;BR /&gt;
&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;&lt;FONT size="1"&gt;This feels like a bug. &amp;nbsp;All the other edge cases work great (like record ends in the middle of data fields).&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;&lt;FONT size="1"&gt;This is very important for me to solve soon.&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;&lt;FONT size="1"&gt;Thanks - Jim&lt;/FONT&gt;&lt;/FONT&gt;</description>
      <pubDate>Wed, 22 Oct 2014 16:49:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375652#M138035</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-10-22T16:49:12Z</dc:date>
    </item>
    <item>
      <title>Re: Text enclosure delimiter in CSV data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375653#M138036</link>
      <description>One way of working around this might be to use tFileInputFullRow and use a replace on those records, then pipe that into the tFileInputdelimited. &amp;nbsp;Firstly I don't know how to pipe that output so if there is a way to do that it would help. &amp;nbsp;If not I can probably write the data to a temp file and then process that by tFileInputdelimited but that double my disk space requirements...
&lt;BR /&gt;Jim</description>
      <pubDate>Wed, 22 Oct 2014 18:29:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375653#M138036</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-10-22T18:29:15Z</dc:date>
    </item>
    <item>
      <title>Re: Text enclosure delimiter in CSV data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375654#M138037</link>
      <description>Hi, 
&lt;BR /&gt; 
&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;&lt;FONT size="1"&gt;The replace solution does not work - it seems that when used against a field the parser has already truncated the field data at the first quote.p&lt;/FONT&gt;&lt;/FONT&gt;&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt; 
&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;&lt;FONT size="1"&gt;Did you set tReplace component as screenshot shown?&lt;/FONT&gt;&lt;/FONT&gt; 
&lt;BR /&gt; 
&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;&lt;FONT size="1"&gt;&lt;BR /&gt;Best regards&lt;/FONT&gt;&lt;/FONT&gt; 
&lt;BR /&gt; 
&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;&lt;FONT size="1"&gt;Sabrina&lt;BR /&gt;&lt;/FONT&gt;&lt;/FONT&gt; 
&lt;BR /&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MBL7.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/156266i336A3FAC787A52F0/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MBL7.png" alt="0683p000009MBL7.png" /&gt;&lt;/span&gt;</description>
      <pubDate>Thu, 23 Oct 2014 10:23:44 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375654#M138037</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-10-23T10:23:44Z</dc:date>
    </item>
    <item>
      <title>Re: Text enclosure delimiter in CSV data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375655#M138038</link>
      <description>Yes - the issue seems to be the field is truncated before replace gets to act on it.&lt;BR /&gt;Jim</description>
      <pubDate>Thu, 23 Oct 2014 13:23:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375655#M138038</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-10-23T13:23:38Z</dc:date>
    </item>
    <item>
      <title>Re: Text enclosure delimiter in CSV data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375656#M138039</link>
      <description>Hi,&lt;BR /&gt;Could you please share your current job design screenshot with us?&lt;BR /&gt;Best regards&lt;BR /&gt;Sabrina</description>
      <pubDate>Fri, 24 Oct 2014 07:09:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375656#M138039</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-10-24T07:09:00Z</dc:date>
    </item>
    <item>
      <title>Re: Text enclosure delimiter in CSV data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375657#M138040</link>
      <description>Sorry this reply is so late - I got move to another job - now I am back... 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACn.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154443iC5B8CACEF3D12C6A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACn.png" alt="0683p000009MACn.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;Here is a simple test case with screen shots and data. 
&lt;BR /&gt;Screen1.png shows the job and that it ran without errors. 
&lt;BR /&gt;Screen2.png shows the setup of the inputfile delimited 
&lt;BR /&gt;Screen3.png shows the replace parameters 
&lt;BR /&gt;Screen4.png shows the XML Output Parameters 
&lt;BR /&gt;Screen5.png shows the schema mapping 
&lt;BR /&gt;Screen6.png shows the XML output 
&lt;BR /&gt;EventDetails-bad.txt is the input file. &amp;nbsp;The second record has the word text quoted with the delimiter quotes. 
&lt;BR /&gt;test_csv_conversion_0.1.zip is an executable build of the job. &amp;nbsp;You will need to place the data in the correct folder for it to run. 
&lt;BR /&gt;Note there is no error message on the run. 
&lt;BR /&gt;Note that in the XML output the &amp;lt;DETAIL_VALUE&amp;gt; content for the second record is truncated where the embedded quote is. 
&lt;BR /&gt;Note also that the subsequent field &amp;lt;FILENAME&amp;gt; is correct so the data was parsed correctly but not output correctly. 
&lt;BR /&gt;I hope this helps isolate the issue. 
&lt;BR /&gt;There may be duplicates uploaded as I did not see the confirmation message for more than 4 files. 
&lt;BR /&gt;EventDetails-bad.txt.txttest_csv_conversion_0.1.zip.zip</description>
      <pubDate>Tue, 06 Jan 2015 17:36:47 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375657#M138040</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-01-06T17:36:47Z</dc:date>
    </item>
    <item>
      <title>Re: Text enclosure delimiter in CSV data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375658#M138041</link>
      <description>Anybody from Talend looking at this?&lt;BR /&gt;Thanks&lt;BR /&gt;Jim</description>
      <pubDate>Mon, 12 Jan 2015 17:44:58 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375658#M138041</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-01-12T17:44:58Z</dc:date>
    </item>
    <item>
      <title>Re: Text enclosure delimiter in CSV data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375659#M138042</link>
      <description>Anyone??? 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MPcz.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/157233iD1A564EF62DE3BC2/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MPcz.png" alt="0683p000009MPcz.png" /&gt;&lt;/span&gt;</description>
      <pubDate>Wed, 14 Jan 2015 16:25:19 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Text-enclosure-delimiter-in-CSV-data/m-p/2375659#M138042</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-01-14T16:25:19Z</dc:date>
    </item>
  </channel>
</rss>

