Re: Reading Flat Large file (2GB) in Big data job ... - Qlik Community

Anonymous · ‎2020-05-06

I am trying to process 2gb flat file using spark big data job.

it takes very long time (3+) to just read the file.

i have also updated no of executors and executor memory (reluctantly). it doesn't work either.

any suggestions is appreciated

manodwhb · ‎2020-05-06

@uganesh , how are you executing your job ? are you executing from Studio?

Anonymous · ‎2020-05-06

@manodwhb thanks for quick response.

I am running Job on RE (submit from studio) , hosted on my AWS env and connecting to EMR cluster.

manodwhb · ‎2020-05-06

@uganesh, so you were building a job and copying that zip file in the remote engine and executing .sh file?

Anonymous · ‎2020-05-06

Below is screenshot of my studio.

Studio ---publish--> RE --> EMR

manodwhb · ‎2020-05-06

@uganesh, all executors are utilizing?

Reading Flat Large file (2GB) in Big data job (spark)