<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Escape and Text Enclosure for csv in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Escape-and-Text-Enclosure-for-csv/m-p/2310248#M81390</link>
    <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;I'm currently facing an issue while reading an input csv file where the data is enclosed within double quotes.&lt;/P&gt;&lt;P&gt;Below is a scenario where the data is as follows :&lt;/P&gt;&lt;P&gt;id;name;desc&lt;/P&gt;&lt;P&gt;1;"Mark";"Test Data"&lt;/P&gt;&lt;P&gt;2;"David";"Test Data Ha.""&lt;/P&gt;&lt;P&gt;3;"Steven";"\"&lt;/P&gt;&lt;P&gt;Here , within the data there are double quotes and a backslash within the desc column and I'm unable to escape both and hence the data is being shifted.&lt;/P&gt;&lt;P&gt;Ex : Escape char -&lt;/P&gt;&lt;P&gt;&lt;B&gt;""" &lt;/B&gt;&lt;/P&gt;&lt;P&gt;Text Enclosure -&lt;/P&gt;&lt;P&gt;&lt;B&gt;"""&lt;/B&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0693p00000AaIGBAA3.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/137530iF9C2E43BABA1D8C7/image-size/large?v=v2&amp;amp;px=999" role="button" title="0693p00000AaIGBAA3.png" alt="0693p00000AaIGBAA3.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Escape char - "\\" Text Enclosure - """&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0693p00000AaIGaAAN.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/143194i72A8EB0E7989DC0F/image-size/large?v=v2&amp;amp;px=999" role="button" title="0693p00000AaIGaAAN.png" alt="0693p00000AaIGaAAN.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Also, the input files are huge (30+ GB) so would like to avoid having to use script. Other escape characters were also tried , but with no success. ("\"")&lt;/P&gt;&lt;P&gt;Any suggestions?&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 01:01:05 GMT</pubDate>
    <dc:creator>Gunner49</dc:creator>
    <dc:date>2024-11-16T01:01:05Z</dc:date>
    <item>
      <title>Escape and Text Enclosure for csv</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Escape-and-Text-Enclosure-for-csv/m-p/2310248#M81390</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;I'm currently facing an issue while reading an input csv file where the data is enclosed within double quotes.&lt;/P&gt;&lt;P&gt;Below is a scenario where the data is as follows :&lt;/P&gt;&lt;P&gt;id;name;desc&lt;/P&gt;&lt;P&gt;1;"Mark";"Test Data"&lt;/P&gt;&lt;P&gt;2;"David";"Test Data Ha.""&lt;/P&gt;&lt;P&gt;3;"Steven";"\"&lt;/P&gt;&lt;P&gt;Here , within the data there are double quotes and a backslash within the desc column and I'm unable to escape both and hence the data is being shifted.&lt;/P&gt;&lt;P&gt;Ex : Escape char -&lt;/P&gt;&lt;P&gt;&lt;B&gt;""" &lt;/B&gt;&lt;/P&gt;&lt;P&gt;Text Enclosure -&lt;/P&gt;&lt;P&gt;&lt;B&gt;"""&lt;/B&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0693p00000AaIGBAA3.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/137530iF9C2E43BABA1D8C7/image-size/large?v=v2&amp;amp;px=999" role="button" title="0693p00000AaIGBAA3.png" alt="0693p00000AaIGBAA3.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Escape char - "\\" Text Enclosure - """&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0693p00000AaIGaAAN.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/143194i72A8EB0E7989DC0F/image-size/large?v=v2&amp;amp;px=999" role="button" title="0693p00000AaIGaAAN.png" alt="0693p00000AaIGaAAN.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Also, the input files are huge (30+ GB) so would like to avoid having to use script. Other escape characters were also tried , but with no success. ("\"")&lt;/P&gt;&lt;P&gt;Any suggestions?&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 01:01:05 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Escape-and-Text-Enclosure-for-csv/m-p/2310248#M81390</guid>
      <dc:creator>Gunner49</dc:creator>
      <dc:date>2024-11-16T01:01:05Z</dc:date>
    </item>
    <item>
      <title>Re: Escape and Text Enclosure for csv</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Escape-and-Text-Enclosure-for-csv/m-p/2310249#M81391</link>
      <description>&lt;P&gt;@Nauman Hussain​&amp;nbsp;, you do one thing after tfileinputdelimited use the tJavarow or tMap to replace " with empty ,so that you can avoid the extra " .&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Manohar&lt;/P&gt;</description>
      <pubDate>Wed, 02 Dec 2020 14:43:48 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Escape-and-Text-Enclosure-for-csv/m-p/2310249#M81391</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2020-12-02T14:43:48Z</dc:date>
    </item>
    <item>
      <title>Re: Escape and Text Enclosure for csv</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Escape-and-Text-Enclosure-for-csv/m-p/2310250#M81392</link>
      <description>&lt;P&gt;Hi @Manohar B​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you.&lt;/P&gt;&lt;P&gt;That's an option for the 2nd record , but the 3rd record with desc column "\" is being shifted.&lt;/P&gt;&lt;P&gt;I can remove the double quote, but is there a way to handle the 3rd record ?&lt;/P&gt;&lt;P&gt;Here, the escape character and text enclosure are used as &lt;B&gt;"""&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;B&gt;Input :&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;B&gt;Output&lt;/B&gt; :&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Nauman&lt;/P&gt;</description>
      <pubDate>Wed, 02 Dec 2020 16:54:53 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Escape-and-Text-Enclosure-for-csv/m-p/2310250#M81392</guid>
      <dc:creator>Gunner49</dc:creator>
      <dc:date>2020-12-02T16:54:53Z</dc:date>
    </item>
  </channel>
</rss>

