<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Slow tFilelist? in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374109#M136752</link>
    <description>Job logic is pretty simple ( in the test-job i created for finding the problem &lt;span class="lia-unicode-emoji" title=":disappointed_face:"&gt;😞&lt;/span&gt; &lt;BR /&gt;There are some things coming from a context.&lt;BR /&gt;tJava_1 : &lt;BR /&gt;if context property is null then the filemask should be different from when the context property is filled.&lt;BR /&gt;tFileList_1 : &lt;BR /&gt;search for all files with the filemask specified in the tJava_1 property (this takes 30 minutes in this example)&lt;BR /&gt;tJava_2 : &lt;BR /&gt;print the last record found&lt;BR /&gt;tFileExist_1 : &lt;BR /&gt;the start of the job if there is a last file.&lt;BR /&gt;In this example I was searching without a context property so the filemask should be : class.*&lt;BR /&gt;The file-specs are : &lt;BR /&gt;Total files : 16655&lt;BR /&gt;class.* files : 486&lt;BR /&gt;I don't see where the 30 minutes goes in this job. So there is no opening / closing of files involved. All the files on the server are closed</description>
    <pubDate>Tue, 17 Jan 2012 12:38:13 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2012-01-17T12:38:13Z</dc:date>
    <item>
      <title>Slow tFilelist?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374107#M136750</link>
      <description>Hello, 
&lt;BR /&gt;I have a job which needs the latest file from a directory with a lot of files ( over 56.000 with about 700 with the same filemask that I am searching for ). 
&lt;BR /&gt;The file I need is searchable and contains a datetimestamp in the file ( but not always from today or yesterday ). 
&lt;BR /&gt;On a local disk it runs adequate ( it finds the file in about 2 sec ) but if i try it on a windows share which has the files it takes over 40 minutes. What's wrong with it. 
&lt;BR /&gt;The filename I'm searching for is : "test." + context.customernumber + "*.txt" 
&lt;BR /&gt;with the settings sorted by date desc and then a iterate to a tjava which sets a globalvar if it's unset else it does nothing (so this way I get the latest file) 
&lt;BR /&gt;I have tried sorted by date asc and then keeping the last iteration but the time remains the same. 
&lt;BR /&gt;The setup : 
&lt;BR /&gt;Client (which runs Talend) Win7 
&lt;BR /&gt;Server (which has the files on a samba share) windows 2003 server 
&lt;BR /&gt;I am almost desperate enough to create a subjob which gets the complete filelisting unsorted and then sort them in the subjob. But I don't think this is the correct way to go.</description>
      <pubDate>Sat, 16 Nov 2024 12:26:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374107#M136750</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T12:26:51Z</dc:date>
    </item>
    <item>
      <title>Re: Slow tFilelist?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374108#M136751</link>
      <description>Hi
&lt;BR /&gt;Welcome to Talend Community!
&lt;BR /&gt;Could you explain in detail about your job logic?
&lt;BR /&gt;I need to know what the job will do if there is a latest file. It will copy this file or move it?
&lt;BR /&gt;Sometimes when Talend job try to handle a file(e.g. Excel) which is opend by other user, the job will wait until the file is not in use.
&lt;BR /&gt;Or i miss some detail?
&lt;BR /&gt;Regards,
&lt;BR /&gt;Pedro</description>
      <pubDate>Tue, 17 Jan 2012 08:27:54 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374108#M136751</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-01-17T08:27:54Z</dc:date>
    </item>
    <item>
      <title>Re: Slow tFilelist?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374109#M136752</link>
      <description>Job logic is pretty simple ( in the test-job i created for finding the problem &lt;span class="lia-unicode-emoji" title=":disappointed_face:"&gt;😞&lt;/span&gt; &lt;BR /&gt;There are some things coming from a context.&lt;BR /&gt;tJava_1 : &lt;BR /&gt;if context property is null then the filemask should be different from when the context property is filled.&lt;BR /&gt;tFileList_1 : &lt;BR /&gt;search for all files with the filemask specified in the tJava_1 property (this takes 30 minutes in this example)&lt;BR /&gt;tJava_2 : &lt;BR /&gt;print the last record found&lt;BR /&gt;tFileExist_1 : &lt;BR /&gt;the start of the job if there is a last file.&lt;BR /&gt;In this example I was searching without a context property so the filemask should be : class.*&lt;BR /&gt;The file-specs are : &lt;BR /&gt;Total files : 16655&lt;BR /&gt;class.* files : 486&lt;BR /&gt;I don't see where the 30 minutes goes in this job. So there is no opening / closing of files involved. All the files on the server are closed</description>
      <pubDate>Tue, 17 Jan 2012 12:38:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374109#M136752</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-01-17T12:38:13Z</dc:date>
    </item>
    <item>
      <title>Re: Slow tFilelist?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374110#M136753</link>
      <description>Ok, some further investigation revealed that tFilelist with sort-options set is terribly slow.&lt;BR /&gt;It's about 100x faster to build a tfilelist (without sorting) -&amp;gt; tfileinfo -&amp;gt; tsortrow than to use the sorting possibiilities on the tfilelist settings.</description>
      <pubDate>Fri, 20 Jan 2012 10:09:10 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374110#M136753</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-01-20T10:09:10Z</dc:date>
    </item>
    <item>
      <title>Re: Slow tFilelist?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374111#M136754</link>
      <description>tFileList seems to have some problem when working with network paths... I have a directory containing about 2k files and tFileList freezes in spite of the very good latency time of the connection... I suppose it is a bug?</description>
      <pubDate>Mon, 05 Mar 2012 15:38:05 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374111#M136754</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-03-05T15:38:05Z</dc:date>
    </item>
    <item>
      <title>Re: Slow tFilelist?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374112#M136755</link>
      <description>@nc : What kind of network connection are you using? I was using windows-UNC paths (so I guess it uses the SMB-components). 
&lt;BR /&gt;If you are using FTP or some other network connection the problem may be somewhere else...</description>
      <pubDate>Tue, 06 Mar 2012 19:03:18 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374112#M136755</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-03-06T19:03:18Z</dc:date>
    </item>
    <item>
      <title>Re: Slow tFilelist?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374113#M136756</link>
      <description>I'm using a standard windows UNC path as of "\\serverName.domainName.local\sharedDirectory". When I open the UNC path in the windows explorer I see the list of files in a flash and I'm able to walk in each directory without any delay... In spite of the above, when I try to print the directory list with a simple job as of "tFileList-&amp;gt;tLogRow" I have to wait many minutes...</description>
      <pubDate>Wed, 07 Mar 2012 10:49:49 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374113#M136756</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-03-07T10:49:49Z</dc:date>
    </item>
    <item>
      <title>Re: Slow tFilelist?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374114#M136757</link>
      <description>Just one note: the "order by" and "order action" setting are left on their default value.&lt;BR /&gt;I didn't reported well the simple job to test the behavior: it's "tFileList-&amp;gt;tIterateToFlow-&amp;gt;tLogRow".&lt;BR /&gt;Thanks,&lt;BR /&gt;N.</description>
      <pubDate>Wed, 07 Mar 2012 14:17:44 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374114#M136757</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-03-07T14:17:44Z</dc:date>
    </item>
    <item>
      <title>Re: Slow tFilelist?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374115#M136758</link>
      <description>&lt;P&gt;It's faster to list files using system command than using tfilelist.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 25 Jul 2018 10:45:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Slow-tFilelist/m-p/2374115#M136758</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-07-25T10:45:51Z</dc:date>
    </item>
  </channel>
</rss>

