Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
I am trying to load an input file into Redshift and I want to split the file round robin before loading it into Redshift to make use of the computational power of multiple slices in my cluster. How do I split an input into n number of outputs in a round robin fashion using Talend?
Ex:
Input:
id name
1 Jon
2 Anne
3 Cole
4 Zack
5 Ellen
Output:
Main1
1 Jon
4 Zack
Main2
2 Anne
5 Ellen
Main 3
3 Cole
Thank you for the reply. I thought about doing that, but I actually need 6 outputs (I put down 3 in my question to simplify the problem). So with this method rowX.id % 3 = 0 and rowX.id % 2 = 0 and rowX.id % 6 = 0 when the id is divisible by 6. I can't think of a simple filter to be able to split it 6 ways.