Skip to main content
Announcements
Introducing Qlik Answers: A plug-and-play, Generative AI powered RAG solution. READ ALL ABOUT IT!
cancel
Showing results for 
Search instead for 
Did you mean: 
nnmh
Partner - Contributor
Partner - Contributor

Spark Jobs in Talend Big Data

I have a problem running any Big Data Batch job in Spark configuration which that any job takes a long time to deploy and start the job.

Every time I started a Big Data Batch job it uploads the job jars to HDFS dir which takes a long time.

Are there any spark configuration is required to avoid uploading these jars every time the job starts?

Labels (2)
2 Replies
Anonymous
Not applicable

Hello @Moataz Nader​ ,

What's the Spark Mode in your job's RUN/Spark Configuration?

How about to change it to 'Yarn Cluster' to see if the performance is better?

0695b00000htMGcAAM.png 

Best regards

Aiming

 

nnmh
Partner - Contributor
Partner - Contributor
Author

Dear @Aiming Chen​ 

 

Spark Mode in my job's RUN/Spark Configuration

0695b00000htMzIAAU.pngEvery time it runs it take a lot of time to upload its jars on HDFS