<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: [resolved] tStatCatcher/tFlowMeter for file iterations in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283415#M57337</link>
    <description>Ok it works,
&lt;BR /&gt;I only had to trigger the tFlowMeterCatcher step after each file iteration, else it would only trigger at the end of processing and store only the last filname processed.
&lt;BR /&gt;Thanks a lot!
&lt;BR /&gt;Chris</description>
    <pubDate>Wed, 14 Oct 2009 10:32:05 GMT</pubDate>
    <dc:creator>_AnonymousUser</dc:creator>
    <dc:date>2009-10-14T10:32:05Z</dc:date>
    <item>
      <title>[resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283411#M57333</link>
      <description>Hi,
&lt;BR /&gt;If I use the stat catcher and/or flowmeter, and I am iterating through a set if input files, will the counts be available for each file or will the count be for the combined total for all files that are looped through?
&lt;BR /&gt;Thanks in advance,</description>
      <pubDate>Sat, 16 Nov 2024 14:07:42 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283411#M57333</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T14:07:42Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283412#M57334</link>
      <description>Hello 
&lt;BR /&gt;The 'count' column will compute the row number for each file. 
&lt;BR /&gt;Best regards 
&lt;BR /&gt; shong</description>
      <pubDate>Fri, 12 Dec 2008 02:15:41 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283412#M57334</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2008-12-12T02:15:41Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283413#M57335</link>
      <description>Hi, 
&lt;BR /&gt;I'm having a similar situation and I'd like to store the filename in addition to the row count in the stat table. Is there a way to add custom fields to the FlowMeter default schema ? E.g., add a "filename" field in addition to "moment", "pid", "count", etc. Or is there another way to achieve the same result ? 
&lt;BR /&gt;Thanks, 
&lt;BR /&gt;Chris</description>
      <pubDate>Tue, 13 Oct 2009 19:00:30 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283413#M57335</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2009-10-13T19:00:30Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283414#M57336</link>
      <description>Hello 
&lt;BR /&gt;You can add a custom field on tMap component, eg:
&lt;BR /&gt;tFlowMeterCatcher--tMap--tMysqlOutput
&lt;BR /&gt;On tMap, you can add custom field and set its value.
&lt;BR /&gt;Best regards
&lt;BR /&gt; 
&lt;BR /&gt; shong</description>
      <pubDate>Wed, 14 Oct 2009 03:39:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283414#M57336</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-10-14T03:39:38Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283415#M57337</link>
      <description>Ok it works,
&lt;BR /&gt;I only had to trigger the tFlowMeterCatcher step after each file iteration, else it would only trigger at the end of processing and store only the last filname processed.
&lt;BR /&gt;Thanks a lot!
&lt;BR /&gt;Chris</description>
      <pubDate>Wed, 14 Oct 2009 10:32:05 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283415#M57337</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2009-10-14T10:32:05Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283416#M57338</link>
      <description>Ok this is not so simple, I have a problem with multi-threading now. Here is a description of the situation: 
&lt;BR /&gt;I'm using TOS 3.2.0M1, developing under Windows then executing on CentOS 5.3 64-bit. 
&lt;BR /&gt;The job i'm running is visible 
&lt;A href="http://img383.imageshack.us/img383/7223/talendy.png" target="_blank" rel="nofollow noopener noreferrer"&gt;here&lt;/A&gt;. Basically the goal is to open a number of binary files (Browse_Files) from different directories (Browse_Dirs), process them through a custom java step (Extract_Raw_Tickets), create csv files from the output (Fill_CSV), then load these csv files into an oracle database using sqlldr (Load_CSV). One CSV file is filled with the output from all the files in one directory, then loaded in one shot. The processing is done in parallel (cf the "Iterate x5" on Browse_Dirs). So if I have 10 directories, I have 10 CSV files processed by 5 concurrent threads. 
&lt;BR /&gt;This was all doing fine until I added the statistics collection. I would like to have a table updated in the database with the list of files processed and the number of lines in each one of them. So I added "Count_Rows" to count the lines produced by each source file; then once the file is processed "Get_Stats" is triggered, filename is added with "Get_Filename" and summary row is inserted into table with "Upload_Stats". (I'm not sure whether it's the best way to do this, it seemed to work on a small sample). 
&lt;BR /&gt;However I have encountered two exceptions when running this job: 
&lt;BR /&gt;First one non-fatal, after a few hundred iterations: 
&lt;BR /&gt; 
&lt;PRE&gt;Exception in component tFlowMeterCatcher_1&lt;BR /&gt;java.util.ConcurrentModificationException&lt;BR /&gt;        at java.util.AbstractList$Itr.checkForComodification(AbstractList.java:449)&lt;BR /&gt;        at java.util.AbstractList$Itr.next(AbstractList.java:420)&lt;BR /&gt;        at routines.system.MetterCatcherUtils.getMessages(MetterCatcherUtils.java:160)&lt;BR /&gt;        at ticketloader.pp2_call_extract_5_0.PP2_Call_Extract.tFlowMeterCatcher_1Process(PP2_Call_Extract.java:4463)&lt;BR /&gt;        at ticketloader.pp2_call_extract_5_0.PP2_Call_Extract.tFileList_1Process(PP2_Call_Extract.java:3680)&lt;BR /&gt;        at ticketloader.pp2_call_extract_5_0.PP2_Call_Extract.tFileDelete_1Process(PP2_Call_Extract.java:793)&lt;BR /&gt;        at ticketloader.pp2_call_extract_5_0.PP2_Call_Extract$1tJava_1Thread.run(PP2_Call_Extract.java:595)&lt;BR /&gt;        at routines.system.ThreadPoolWorker.runIt(TalendThreadPool.java:159)&lt;BR /&gt;        at routines.system.ThreadPoolWorker.runWork(TalendThreadPool.java:150)&lt;BR /&gt;        at routines.system.ThreadPoolWorker.access$0(TalendThreadPool.java:145)&lt;BR /&gt;        at routines.system.ThreadPoolWorker$1.run(TalendThreadPool.java:122)&lt;BR /&gt;        at java.lang.Thread.run(Thread.java:595)&lt;/PRE&gt; 
&lt;BR /&gt;Second one fatal, a few dozen iterations later: 
&lt;BR /&gt; 
&lt;PRE&gt;Exception in component tOracleBulkExec_1&lt;BR /&gt;java.lang.InterruptedException&lt;BR /&gt;        at java.lang.Object.wait(Native Method)&lt;BR /&gt;        at java.lang.Object.wait(Object.java:474)&lt;BR /&gt;        at java.lang.UNIXProcess.waitFor(UNIXProcess.java:165)&lt;BR /&gt;        at ticketloader.pp2_call_extract_5_0.PP2_Call_Extract.tOracleBulkExec_1Process(PP2_Call_Extract.java:4864)&lt;BR /&gt;        at ticketloader.pp2_call_extract_5_0.PP2_Call_Extract.tFileExist_1Process(PP2_Call_Extract.java:4696)&lt;BR /&gt;        at ticketloader.pp2_call_extract_5_0.PP2_Call_Extract.tFileList_1Process(PP2_Call_Extract.java:3704)&lt;BR /&gt;        at ticketloader.pp2_call_extract_5_0.PP2_Call_Extract.tFileDelete_1Process(PP2_Call_Extract.java:793)&lt;BR /&gt;        at ticketloader.pp2_call_extract_5_0.PP2_Call_Extract$1tJava_1Thread.run(PP2_Call_Extract.java:595)&lt;BR /&gt;        at routines.system.ThreadPoolWorker.runIt(TalendThreadPool.java:159)&lt;BR /&gt;        at routines.system.ThreadPoolWorker.runWork(TalendThreadPool.java:150)&lt;BR /&gt;        at routines.system.ThreadPoolWorker.access$0(TalendThreadPool.java:145)&lt;BR /&gt;        at routines.system.ThreadPoolWorker$1.run(TalendThreadPool.java:122)&lt;BR /&gt;        at java.lang.Thread.run(Thread.java:595)&lt;/PRE&gt; 
&lt;BR /&gt;So it looks like the statistics collection is done in an thread unsafe manner. Or is there a flaw in my design itself ? 
&lt;BR /&gt;Thanks a lot for your help, 
&lt;BR /&gt;Chris</description>
      <pubDate>Thu, 15 Oct 2009 18:34:17 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283416#M57338</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-10-15T18:34:17Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283417#M57339</link>
      <description>Hello Chris 
&lt;BR /&gt;I can't open your image, please upload them to our forum directly, don't add a link. 
&lt;BR /&gt;Best regards 
&lt;BR /&gt; 
&lt;BR /&gt; shong</description>
      <pubDate>Fri, 16 Oct 2009 07:09:39 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283417#M57339</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-10-16T07:09:39Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283418#M57340</link>
      <description>Ok I just did it, I only had to scale the image down for the forum to accept it.
&lt;BR /&gt;Chris</description>
      <pubDate>Fri, 16 Oct 2009 13:27:47 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283418#M57340</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-10-16T13:27:47Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283419#M57341</link>
      <description>Hi there,
&lt;BR /&gt;any idea about this one ?
&lt;BR /&gt;Thanks,
&lt;BR /&gt;Chris</description>
      <pubDate>Wed, 21 Oct 2009 18:21:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283419#M57341</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-10-21T18:21:24Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283420#M57342</link>
      <description>Hello
&lt;BR /&gt;There are two wrong place in your design job:
&lt;BR /&gt;1) Delete the 'oncomponentok' link, see picture 1.
&lt;BR /&gt;2) The tOracleOutputBulk and tOracleBulkExec components must be used together. see picture 2.
&lt;BR /&gt;Best regards
&lt;BR /&gt; shong</description>
      <pubDate>Thu, 22 Oct 2009 03:48:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283420#M57342</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-10-22T03:48:57Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283421#M57343</link>
      <description>Hi Shong, 
&lt;BR /&gt;thanks for your reply. 
&lt;BR /&gt; 
&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;1) Delete the 'oncomponentok' link, see picture 1.&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;That's the way I did it at first, but then the RowMeterCatcher fires only after the subjob "Browse Files" finishes; as a consequence, the "Get Filename" mapper reads the file name as it is after subjob termination, i.e., the last file that was processed, and outputs this single filename for each line of statistics. So for instance instead of getting 
&lt;BR /&gt;file1 100 lines 
&lt;BR /&gt;file2 200 lines 
&lt;BR /&gt;file3 300 lines 
&lt;BR /&gt;I get 
&lt;BR /&gt;file3 100 lines 
&lt;BR /&gt;file3 200 lines 
&lt;BR /&gt;file3 300 lines 
&lt;BR /&gt; 
&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;2) The tOracleOutputBulk and tOracleBulkExec components must be used together. see picture 2.&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;I don't understand this one too well. Wouldn't adding another OracleOutputBulk step introduce an additional, redundant, intermediate csv file? Currently the job fills up a csv file with the ouput from several input binary files then load the csv file in one go using sqlldr. This seems to work quite well as long as I don't introduce the statistics steps. Is there a better way to do that under Talend? Are the multithreading issues related to this architecture? 
&lt;BR /&gt;Thanks a lot for your help, 
&lt;BR /&gt;Chris</description>
      <pubDate>Thu, 22 Oct 2009 10:50:16 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283421#M57343</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-10-22T10:50:16Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283422#M57344</link>
      <description>Hi again Shong, 
&lt;BR /&gt;do you have any input regarding the points above? Your help is definitely appreciated! 
&lt;BR /&gt;Thanks, 
&lt;BR /&gt;Chris</description>
      <pubDate>Tue, 27 Oct 2009 17:24:28 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283422#M57344</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-10-27T17:24:28Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283423#M57345</link>
      <description>Hi there again, 
&lt;BR /&gt;sorry to insist but does anyone has any insight on the issue at hand? Is there a flaw in TOS multithreading support or should the job be designed differently? 
&lt;BR /&gt;Thanks to all for your help, 
&lt;BR /&gt;Chris</description>
      <pubDate>Tue, 03 Nov 2009 10:04:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283423#M57345</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-11-03T10:04:07Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] tStatCatcher/tFlowMeter for file iterations</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283424#M57346</link>
      <description>Hello Chris 
&lt;BR /&gt;Sorry, I miss this topic yesterday. 
&lt;BR /&gt; 
&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;The processing is done in parallel (cf the "Iterate x5" on Browse_Dirs&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;As I known, there was a bug on the parallel iterate link. So, don't check 'enable parallel execution' option and try if the problem still exsit. 
&lt;BR /&gt; 
&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;as a consequence, the "Get Filename" mapper reads the file name as it is after subjob termination,&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;Yes, using the 'oncomponentok' link after tOracleOutput if you want to get the current file name on tMap and output it. 
&lt;BR /&gt; 
&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;I don't understand this one too well. Wouldn't adding another OracleOutputBulk step introduce an additional, redundant, intermediate csv file?&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;Yes, you are right. If the file exists, you just need a tOracleBulkExec to read it and bulk insert records into db. 
&lt;BR /&gt;Best regards 
&lt;BR /&gt; 
&lt;BR /&gt; shong</description>
      <pubDate>Wed, 04 Nov 2009 02:30:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-tStatCatcher-tFlowMeter-for-file-iterations/m-p/2283424#M57346</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-11-04T02:30:24Z</dc:date>
    </item>
  </channel>
</rss>

