<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Parsing CSV - Commas within double quotes in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301378#M73498</link>
    <description>Currently running 4.0.2 r43696 and it looks like a bug has crept in. (There isn't a tFileInputCSV for this version and appears deprecated)
&lt;BR /&gt;When setting up the metadata/file_delimited and select a CSV similar to vclark's file. (EX)
&lt;BR /&gt;"type","location","setting","complexity",'prod_type","category","stock","website"
&lt;BR /&gt;"admin","base","default","simple","hardware","56,51","0","www.example.com"
&lt;BR /&gt;"admin","base","default","simple","hardware","32,61","1","www.example.com"
&lt;BR /&gt;should read in the following values with Set Heading Row as Column Names checked.
&lt;BR /&gt;Type Location Setting Complexity Prod_Type Category Stock Website
&lt;BR /&gt;admin base default simple hardware 56,51 0 
&lt;A href="http://www.example.com" target="_blank" rel="nofollow noopener noreferrer"&gt;www.example.com&lt;/A&gt;
&lt;BR /&gt;admin base default simple hardware 32,61 1 
&lt;A href="http://www.example.com" target="_blank" rel="nofollow noopener noreferrer"&gt;www.example.com&lt;/A&gt;
&lt;BR /&gt;However upon setting the CSV (Escape Char Settings) - Text Enclosure to "\"" and Comma - Field Separator as "," yields the following incorrect results:
&lt;BR /&gt;"Type" "Location" "Setting" "Complexity" "Prod_Type" "Category" "Stock" "Website"
&lt;BR /&gt;"admin" "base" "default" "simple" "hardware" "56 51" "0"
&lt;BR /&gt;"admin" "base" "default" "simple" "hardware" "32 61" "1"
&lt;BR /&gt;Changing the Field Separator to Semicolon - ";" will read the data fields in correctly but will read the column names as single column and therefore drop all of the fields into a single column (quotes and all).
&lt;BR /&gt;As you can see the Category field is not parsed correctly.
&lt;BR /&gt;Anthony
&lt;BR /&gt;Talend Certified</description>
    <pubDate>Mon, 23 Aug 2010 00:48:41 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2010-08-23T00:48:41Z</dc:date>
    <item>
      <title>Parsing CSV - Commas within double quotes</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301370#M73490</link>
      <description>I'm evaluating TOS. I created a new project and added a delimited file under Metadata -&amp;gt; File Delimited. 
&lt;BR /&gt;Going thru the wizard I get to step 3 where you define the encoding, file separator, and a few other parameters. This screen reads the file you specified in step 2 and parses it according to the delimiter. In my case it is a CSV file. But the file has double quotes around all the field values and some of those contain commas within them. 
&lt;BR /&gt;Is it possible to tell TOS to ignore commas that are between double quotes? This is a very common situation and is handled by default in spreadsheet programs such as Open Office or Excel. 
&lt;BR /&gt;This seems like very elementary functionality and the reason I started looking for an ETL tool in the first place is because the COPY command in Postgres cannot handle this situation. 
&lt;BR /&gt;My other alternative is to use dbf file downloads of the same data. Is it possible in TOS to define the dbf delimiiter?</description>
      <pubDate>Sat, 16 Nov 2024 14:32:41 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301370#M73490</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T14:32:41Z</dc:date>
    </item>
    <item>
      <title>Re: Parsing CSV - Commas within double quotes</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301371#M73491</link>
      <description>Never mind. I figured it out with the Text Enclosure setting.</description>
      <pubDate>Fri, 05 Oct 2007 05:17:19 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301371#M73491</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2007-10-05T05:17:19Z</dc:date>
    </item>
    <item>
      <title>Re: Parsing CSV - Commas within double quotes</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301372#M73492</link>
      <description>I would be interest in how you over came this problem as I experienced a similar problem with my delimiter files.&lt;BR /&gt;I didn't spend too much time to find a solution, i simply changed my delimiter character.</description>
      <pubDate>Fri, 05 Oct 2007 05:43:45 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301372#M73492</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2007-10-05T05:43:45Z</dc:date>
    </item>
    <item>
      <title>Re: Parsing CSV - Commas within double quotes</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301373#M73493</link>
      <description>Hello, &lt;BR /&gt;This is the difference between tFileInputDelimited (no text enclosure) and tFileInputCSV (text enclosure).&lt;BR /&gt;&lt;BR /&gt;Regards,</description>
      <pubDate>Fri, 05 Oct 2007 09:03:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301373#M73493</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2007-10-05T09:03:00Z</dc:date>
    </item>
    <item>
      <title>Re: Parsing CSV - Commas within double quotes</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301374#M73494</link>
      <description>JPN - When you are adding File Delimited Metadata the wizard wiill give you the opportunity to set the "text enclosure". Select "\"" and it reads the file correctly.</description>
      <pubDate>Fri, 05 Oct 2007 15:15:20 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301374#M73494</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2007-10-05T15:15:20Z</dc:date>
    </item>
    <item>
      <title>Re: Parsing CSV - Commas within double quotes</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301375#M73495</link>
      <description>Hello! thank you for your hint. with this option (using Text Enclosure "\"") it is possible to import google-contact exports (CSV). &lt;BR /&gt;--Robert</description>
      <pubDate>Fri, 03 Jul 2009 14:18:26 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301375#M73495</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-07-03T14:18:26Z</dc:date>
    </item>
    <item>
      <title>Re: Parsing CSV - Commas within double quotes</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301376#M73496</link>
      <description>hi all, 
&lt;BR /&gt;a few more explanation about enclosure 
&lt;BR /&gt;the back slash "\" is the way to escape any character following it which can be a reserved one for the code (\', \\ , ..) 
&lt;BR /&gt;I tell the program : "don't use it like a code character but like any other one " 
&lt;BR /&gt;like in a regex "\\d" : d is reserved pattern so put "\" before , but '\' is reserved too ..so also reserved it =&amp;gt; \\d 
&lt;BR /&gt;++</description>
      <pubDate>Fri, 03 Jul 2009 16:19:49 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301376#M73496</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-07-03T16:19:49Z</dc:date>
    </item>
    <item>
      <title>Re: Parsing CSV - Commas within double quotes</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301377#M73497</link>
      <description>My Text Enclosure character is ~... I can edit this property on the tFileDelimited properties but I can't edit it under the Schema definition in the Repository - I have to choose from the drop list and none of them apply.</description>
      <pubDate>Thu, 10 Sep 2009 15:48:22 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301377#M73497</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2009-09-10T15:48:22Z</dc:date>
    </item>
    <item>
      <title>Re: Parsing CSV - Commas within double quotes</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301378#M73498</link>
      <description>Currently running 4.0.2 r43696 and it looks like a bug has crept in. (There isn't a tFileInputCSV for this version and appears deprecated)
&lt;BR /&gt;When setting up the metadata/file_delimited and select a CSV similar to vclark's file. (EX)
&lt;BR /&gt;"type","location","setting","complexity",'prod_type","category","stock","website"
&lt;BR /&gt;"admin","base","default","simple","hardware","56,51","0","www.example.com"
&lt;BR /&gt;"admin","base","default","simple","hardware","32,61","1","www.example.com"
&lt;BR /&gt;should read in the following values with Set Heading Row as Column Names checked.
&lt;BR /&gt;Type Location Setting Complexity Prod_Type Category Stock Website
&lt;BR /&gt;admin base default simple hardware 56,51 0 
&lt;A href="http://www.example.com" target="_blank" rel="nofollow noopener noreferrer"&gt;www.example.com&lt;/A&gt;
&lt;BR /&gt;admin base default simple hardware 32,61 1 
&lt;A href="http://www.example.com" target="_blank" rel="nofollow noopener noreferrer"&gt;www.example.com&lt;/A&gt;
&lt;BR /&gt;However upon setting the CSV (Escape Char Settings) - Text Enclosure to "\"" and Comma - Field Separator as "," yields the following incorrect results:
&lt;BR /&gt;"Type" "Location" "Setting" "Complexity" "Prod_Type" "Category" "Stock" "Website"
&lt;BR /&gt;"admin" "base" "default" "simple" "hardware" "56 51" "0"
&lt;BR /&gt;"admin" "base" "default" "simple" "hardware" "32 61" "1"
&lt;BR /&gt;Changing the Field Separator to Semicolon - ";" will read the data fields in correctly but will read the column names as single column and therefore drop all of the fields into a single column (quotes and all).
&lt;BR /&gt;As you can see the Category field is not parsed correctly.
&lt;BR /&gt;Anthony
&lt;BR /&gt;Talend Certified</description>
      <pubDate>Mon, 23 Aug 2010 00:48:41 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301378#M73498</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2010-08-23T00:48:41Z</dc:date>
    </item>
    <item>
      <title>Re: Parsing CSV - Commas within double quotes</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301379#M73499</link>
      <description>I think this bug is still in the 4.0.3 release: I've been evaluating talend with some very simple CSV transforms and it turns out this is the thing that's been frustrating me! I guess I can preprocess my CSVs before they hit talend. &lt;span class="lia-unicode-emoji" title=":confused_face:"&gt;😕&lt;/span&gt; 
&lt;BR /&gt;Steve 
&lt;BR /&gt; 
&lt;BR /&gt; 
&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;Currently running 4.0.2 r43696 and it looks like a bug has crept in. (There isn't a tFileInputCSV for this version and appears deprecated)&lt;BR /&gt;...&lt;BR /&gt;Changing the Field Separator to Semicolon - ";" will read the data fields in correctly but will read the column names as single column and therefore drop all of the fields into a single column (quotes and all).&lt;BR /&gt;As you can see the Category field is not parsed correctly.&lt;BR /&gt;Anthony&lt;BR /&gt;Talend Certified&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt;</description>
      <pubDate>Thu, 21 Oct 2010 14:40:01 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301379#M73499</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2010-10-21T14:40:01Z</dc:date>
    </item>
    <item>
      <title>Re: Parsing CSV - Commas within double quotes</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301380#M73500</link>
      <description>Sorry, I meant this bug still exists in the TOS-All-r49467-V4.1.0 release.
&lt;BR /&gt;Steve</description>
      <pubDate>Thu, 21 Oct 2010 14:41:17 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301380#M73500</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2010-10-21T14:41:17Z</dc:date>
    </item>
    <item>
      <title>Re: Parsing CSV - Commas within double quotes</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301381#M73501</link>
      <description>I am experiencing the same problem as anthonyp. Is there a solution or work around? I am using the open version.</description>
      <pubDate>Thu, 09 Dec 2010 20:18:10 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301381#M73501</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2010-12-09T20:18:10Z</dc:date>
    </item>
    <item>
      <title>Re: Parsing CSV - Commas within double quotes</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301382#M73502</link>
      <description>The bug in v4.0.2 only affected the New Delimited File dialog and was corrected in v4.0.3 and v4.1.0.&lt;BR /&gt;An actual job in v4.0.2 reading the file worked fine; see screen-print.</description>
      <pubDate>Fri, 10 Dec 2010 02:18:39 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-CSV-Commas-within-double-quotes/m-p/2301382#M73502</guid>
      <dc:creator>alevy</dc:creator>
      <dc:date>2010-12-10T02:18:39Z</dc:date>
    </item>
  </channel>
</rss>

