Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik Open Lakehouse is Now Generally Available! Discover the key highlights and partner resources here.
cancel
Showing results for 
Search instead for 
Did you mean: 
Bluemoon
Creator
Creator

Multiple empty files are created when loading data into HDFS using spark

Task:

I have group of messages in queue and they are consumed by consumer and get latest record among using spark streaming job and loaded into HDFS0683p000009LtNQ.png

 

Issue:

1. Wanted to save data into a file as .csv but some number pattern is added to file name which is given in tfileOutput component

 

0683p000009LsU4.png

  

Example: give below i wanted to save data in maindata.csv but it is creating maindata.csv-1522775132000 folder and saving data in that folder

0683p000009LtQJ.png

2. Creating 14 empty partitions files and inserting data into 15 partition file

 

Expected Output:

1. Can i insert data into maindata.csv ??

2. Can i determinate partitions according to data ??

 

Thanks in advance!!

Labels (2)
1 Reply
Anonymous
Not applicable

One solution option for Issue-1 is to check the 'Merge result to single file' option in tFileOutputDelimited  component properties. Set the property 'Merge File Path' to your file path for maindata.csv. 

This creates a file with a name of your choice, in the path defined by you, with all the part- files data merged into one file. Optionally you could remove the source directory and/or override target file. 

 

Hope this helps.