<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: [resolved] CSV Input Problem in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206089#M6293</link>
    <description>great , put post as resolved if it's the case 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MA9p.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/138034i5F552429DA646D6F/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MA9p.png" alt="0683p000009MA9p.png" /&gt;&lt;/span&gt;</description>
    <pubDate>Thu, 22 Oct 2015 13:28:45 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2015-10-22T13:28:45Z</dc:date>
    <item>
      <title>[resolved] CSV Input Problem</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206080#M6284</link>
      <description>Hello,
&lt;BR /&gt;I have a problem with my CSV Files, I cannot use tFileInputDelimited to input data from a CSV File without any problems.
&lt;BR /&gt;The CSV File got ~1million rows and 80 columns.
&lt;BR /&gt;Some Rows are comments that are just one big string and some Columns also contain comments.
&lt;BR /&gt;When using tFileInputDelimited I always get the Error: "For input string: "(whole comment)""
&lt;BR /&gt;The problem is that I want these "CommentRows" to be ignored, is there any way to do that?
&lt;BR /&gt;Thanks for your help.</description>
      <pubDate>Sat, 16 Nov 2024 10:58:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206080#M6284</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T10:58:51Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] CSV Input Problem</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206081#M6285</link>
      <description>Hi AWeller,&lt;BR /&gt;&lt;BLOCKQUOTE&gt;&lt;TABLE border="1"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;Some Rows are comments that are just one big string and some Columns also contain comments.&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;/BLOCKQUOTE&gt;&lt;BR /&gt;Could you show us some sample of your input source, please?&lt;BR /&gt;Best regards&lt;BR /&gt;Sabrina</description>
      <pubDate>Thu, 22 Oct 2015 08:31:34 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206081#M6285</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-10-22T08:31:34Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] CSV Input Problem</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206082#M6286</link>
      <description>Hi Sabrina, 
&lt;BR /&gt;I can't post data of these directly, so i will just make 2 example rows that look like the csv rows. 
&lt;BR /&gt;The Columns are separated by pipe (|) 
&lt;BR /&gt;My problem is that the first column should be a number, but some rows contain strings instead of numbers and i want to know if there is any possibility to filter these rows out of the input data so that i won't get an error like "For input string:..." 
&lt;BR /&gt;row 1: 
&lt;BR /&gt;ijfioejfiwehfiwoefewifewifewifwhrtherhezthtzrehjztrjtzrjtzrjztrjzjtzjztjtzjztfjftjtfrjrjrtjtrjtrjtrjtrjttjtjtzjtzjttzjztjtzjztjtzjztjtzjj||0|N|4|5|30||||1|43242523|Z|K|54350435345|gireginegioerjnhgiergjhrie|16.11.10.|fjf_fe|FA|mfioehfoiuwehfeowifjhie 
&lt;BR /&gt;row 2: 
&lt;BR /&gt;190226546503|1|14.05.11|1|1|1|1|61454500|613854545500545427801|1|0|0|4197156446|||9756407|234634010|0|6451|7641|266432,56|||||||0|||51|71|2112,56||||||32||242,56|||||||0|||23442,56||1||||332||trtrtrtrtr 
&lt;BR /&gt;Thanks, 
&lt;BR /&gt;Arno</description>
      <pubDate>Thu, 22 Oct 2015 08:50:20 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206082#M6286</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-10-22T08:50:20Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] CSV Input Problem</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206083#M6287</link>
      <description>hi, 
&lt;BR /&gt;could you read all fields as String, store somewhere and filter your data 'later' ? 
&lt;BR /&gt;it's often easier&amp;amp;helpful to extract raw data as String only. 
&lt;BR /&gt;If you don't have to do some calculation on your data, never mind to read number as a string 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MA9p.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/138034i5F552429DA646D6F/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MA9p.png" alt="0683p000009MA9p.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;regards</description>
      <pubDate>Thu, 22 Oct 2015 09:00:22 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206083#M6287</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-10-22T09:00:22Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] CSV Input Problem</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206084#M6288</link>
      <description>Thanks! I'm already trying this it seems to work 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MAB6.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/158321i00588DF41617C922/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MAB6.png" alt="0683p000009MAB6.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;Now I have another problem, I wanna connect those 2 CSV Files with tMap and output one combined CSV File, but it seems to be too much Data because it needs extraordinary long and only gets ~1462 rows/s and stops after 150 seconds with the Error 
&lt;BR /&gt;"Exception in thread "main" java.lang.OutOfMemoryError: GC overhead limit exceeded" 
&lt;BR /&gt;Any idea on that? 
&lt;BR /&gt;Regards</description>
      <pubDate>Thu, 22 Oct 2015 09:25:39 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206084#M6288</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-10-22T09:25:39Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] CSV Input Problem</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206085#M6289</link>
      <description>&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt; I wanna connect those 2 CSV Files with tMap and output one combined CSV File&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;what kind of 'combination' ? are you doing some mapping, join filter. ? 
&lt;BR /&gt;Do you need all the data to be in the tMap ? 
&lt;BR /&gt;First thing,&amp;nbsp; it's to manage Only the data you need =&amp;gt; filter, extract data to keep the right ones. 
&lt;BR /&gt;You can also increase allocated memory of your jvm. 
&lt;BR /&gt;GC overhead limit exceeded mean that Garbage collector is working too hard, too often 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACn.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154443iC5B8CACEF3D12C6A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACn.png" alt="0683p000009MACn.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;regards 
&lt;BR /&gt;laurent</description>
      <pubDate>Thu, 22 Oct 2015 09:45:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206085#M6289</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-10-22T09:45:07Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] CSV Input Problem</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206086#M6290</link>
      <description>&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;what kind of 'combination' ? are you doing some mapping, join filter. ?&lt;BR /&gt;Do you need all the data to be in the tMap ?&lt;BR /&gt;First thing,&amp;nbsp; it's to manage Only the data you need =&amp;gt; filter, extract data to keep the right ones.&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;Join filter. 
&lt;BR /&gt;No, i'm filtering Data in the tMap, seems to be a bad idea 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MA5A.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/143082iB236712184B767DA/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MA5A.png" alt="0683p000009MA5A.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;What components are good for filtering? I'm trying tFilterColumns and tFilterRow. 
&lt;BR /&gt;Thanks for all that help!</description>
      <pubDate>Thu, 22 Oct 2015 10:13:30 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206086#M6290</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-10-22T10:13:30Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] CSV Input Problem</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206087#M6291</link>
      <description>&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt; i'm filtering Data in the tMap, seems to be a bad idea &lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;it's depend on several things ... 
&lt;BR /&gt;but if you're made some join, it's better to filter before tMap. 
&lt;BR /&gt;be aware that all data from lookup flow are store in memory by default . it could be a reason for your "out of memory". 
&lt;BR /&gt; 
&lt;BR /&gt;Are you constraint by the memory that you can allocate to your jvm (base on the production environment) ? 
&lt;BR /&gt;A better way, if it's not a "real-time" application , could be filter data in a job &amp;amp; store in another file or tables (I/O with mysql isam engine could be&amp;nbsp; a good solution). 
&lt;BR /&gt;Read the filtering data and join your data in a separate job. 
&lt;BR /&gt;When you got some problem with memory, try to cut treatment in to several jobs. 
&lt;BR /&gt;Using tables to store raws data and make filter with a where clause can also be a solution. 
&lt;BR /&gt;When you haven't got a lot of allocated memory for your jvm, store data in database can help you. 
&lt;BR /&gt;for example by using ELT component to join data you will work with DB engine ressources =&amp;gt; less needed memory for your jvm. 
&lt;BR /&gt;Keep in mind that you have to optimize your Talend job before thinking to increase JVM memory 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MA9p.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/138034i5F552429DA646D6F/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MA9p.png" alt="0683p000009MA9p.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt; 
&lt;A href="https://community.qlik.com/s/article/ka03p0000006EZuAAM"&gt;https://community.talend.com/t5/Migration-Configuration-and/OutOfMemory-Exception/ta-p/21669?content-lang=en&lt;/A&gt;</description>
      <pubDate>Thu, 22 Oct 2015 10:34:05 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206087#M6291</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-10-22T10:34:05Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] CSV Input Problem</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206088#M6292</link>
      <description>Well, all worked fine with tFilterColumns before tMap. I even combined 3 CSV's with 6 million, 3 million and 2 million rows into a new CSV with 3 million rows with tmap in 600 seconds. 
&lt;BR /&gt;I only got Memory Warnings like "Warning: to avoid a Memory heap space error the buffer of the flow has been limited to a size of 2000000 , try to reduce the advanced parameter "Max buffer size" (~100000 or at least less than 2000000), then if needed try to increase the JVM Xmx parameter." 
&lt;BR /&gt;I made tMap store the temp Data in a extra folder so this worked fine for me. 
&lt;BR /&gt;Thanks!</description>
      <pubDate>Thu, 22 Oct 2015 13:01:26 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206088#M6292</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-10-22T13:01:26Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] CSV Input Problem</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206089#M6293</link>
      <description>great , put post as resolved if it's the case 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MA9p.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/138034i5F552429DA646D6F/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MA9p.png" alt="0683p000009MA9p.png" /&gt;&lt;/span&gt;</description>
      <pubDate>Thu, 22 Oct 2015 13:28:45 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-CSV-Input-Problem/m-p/2206089#M6293</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-10-22T13:28:45Z</dc:date>
    </item>
  </channel>
</rss>

