Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik Open Lakehouse is Now Generally Available! Discover the key highlights and partner resources here.
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Reading only xml file from un-archived folder through tfilelist

Hi All ,
I am reading zip files from a folder unzipping that using tFileUnarchive. After Unzipping , i am reading only xml files using tfilelist after tFileUnarchive. I am iterating 2 files in parallel. But when i do so my whole process runs as many times as many are the files.
My workflow looks like this:
 tfilelist --> tfileunarchive --> tfilelist --> tfileinputxml --> tlogrow
i have 2 zip folders one contains 2 files (1 xml and 1jpeg) and other zip contains ( 1 xml ). I only want to process xml files after unzipping. When i iterate it with number of parallel execution as 2 it runs the workflow twice for 3 files.

Output that i am getting through this workflow is :
Process finished
273985234|ZO|1237351|JM|TestData|99
273985234|ZO|1237351|JM|TestData|99
273985234|ZO|1237351|JM|TestData|99
273985234|XO|1237161|SM|TestData|34
273985234|XO|1237161|SM|TestData|34
273985234|XO|1237161|SM|TestData|34

Expected Output:
273985234|ZO|1237351|JM|TestData|99
273985234|XO|1237161|SM|TestData|34

Please find attached screenshot of the workflow.

Thanks,
Saurabh.
0683p000009MGaV.png

Labels (2)
5 Replies
Anonymous
Not applicable
Author

Hi,
A parallelization-enabled Iterate connection allows the component that receives threads from the connection to read those threads in parallel.
Have you set any file mask in tFileList_2?(*.xml)?


Best regards
Sabrina
Anonymous
Not applicable
Author

Yes i have set the mask as "*.xml"
Anonymous
Not applicable
Author

Hi,

A parallelization-enabled Iterate connection allows the component that receives threads from the connection to read those threads in parallel.

Please disable option "Enable parallel exection" in iterate row to see if it works.
Best regards
Sabrina
Anonymous
Not applicable
Author

No it still process the files twice.Only when i remove tfilelist_2 it process it once. But in that case it process even the image files that are their in unzipped folder and gives me error for those.
Anonymous
Not applicable
Author

It only works when i introduce delete component , but even for that it works only with  "Enable parallel exection" =1 
I am attaching the config details of tfilelist_2 and tfiledelete_1. Output that i am getting using the specified configurations is:

[font=Verdana, Helvetica, Arial, sans-serif] Starting job prcs_zip_chronological_order at 11:23 15/07/2015.[/font]


[font=Verdana, Helvetica, Arial, sans-serif][size=1][statistics] connecting to socket on port 3941[/size][/font]
[font=Verdana, Helvetica, Arial, sans-serif][size=1][statistics] connected[/size][/font]
[font=Verdana, Helvetica, Arial, sans-serif] Processing archive E:\Blogtestdata\abctest.zip, please wait...[/font]


[font=Verdana, Helvetica, Arial, sans-serif] Process finished[/font]
[font=Verdana, Helvetica, Arial, sans-serif] 273985234|XO|1237161|SM|TestData|34[/font]
[font=Verdana, Helvetica, Arial, sans-serif] Processing archive E:\Blogtestdata\cdftest.zip, please wait...[/font]


[font=Verdana, Helvetica, Arial, sans-serif] Process finished[/font]
[font=Verdana, Helvetica, Arial, sans-serif] 273985234|ZO|1237351|JM|TestData|99[/font]
[font=Verdana, Helvetica, Arial, sans-serif][size=1][statistics] disconnected[/size][/font]

[font=Verdana, Helvetica, Arial, sans-serif][size=1]Job prcs_zip_chronological_order ended at 11:23 15/07/2015. [exit code=0][/size][/font]
Thanks,
Saurabh.
0683p000009MGWF.png 0683p000009MGaa.png