Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
Is there a way to run talend spark subjobs in parallel?.
Hi,
When you trigger a Spark BigData sub job, you are providing an independent process to run the sub job.
Talend jobs will not allow to run a Spark sub job without selecting the option "Use an independent process to run subjob".
You can very well trigger them using tparallelize component from a DI job (just to maintain the orchestration). But tparallelize component cannot be used in Talend Big Data jobs.
Warm Regards,
Nikhil Thampi
Hi,
When you trigger a Spark BigData sub job, you are providing an independent process to run the sub job.
Talend jobs will not allow to run a Spark sub job without selecting the option "Use an independent process to run subjob".
You can very well trigger them using tparallelize component from a DI job (just to maintain the orchestration). But tparallelize component cannot be used in Talend Big Data jobs.
Warm Regards,
Nikhil Thampi
So the solution is to put two subjobs into different talend spark jobs.
Call these two spark jobs in one talend standard job using the trun component.
set the trun component property to 'run as an independent process'