Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik Open Lakehouse is Now Generally Available! Discover the key highlights and partner resources here.
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

what is the correct way to enable parallelization for XML extract?

0683p000009Lrvu.jpg0683p000009LrZz.jpgI enabled the parallelization options on XML extract jobs that converts XML fields to pipe delimited text file. But it is causing data jumble. When I do partition and departition option jobs get stuck.

What is the correct way to set partition and departition? Could you please suggest. I am sharing the job screenshot below:

 

 

Labels (4)
2 Replies
Anonymous
Not applicable
Author

Hello,

From your screenshot, you are using tExtractXMLField component in your work flow. Do you want to run synchronously your job for each XML file? Could you please elaborate your case with an example with input and expected output value?

Best regards

Sabrina

Anonymous
Not applicable
Author

Hi Sabrina,

 

For the performance concern I am trying to use this parallelization option. I want extract to be performed parallel so that extract would be fast. The other combination that I tried gets jobs running fine but data jumble issue is there. Suppose timestamp column captures data only for timestamp normally or without this option the way I have in screenshot. May be there is something wrong with the option I selected as screenshot, I don't know but I am getting data jumbled up. Data that doesn't belong to timestamp column is populating to timestamp column which is wrong. Could you please let me know, what is the issue?


Depart.JPG