we are in the process of setting up a Big Data environment to evaluate Talend Big Data. We are completely free in the decision which distribution to take.
Which is the recommended Big Data distribution that works best with Talend? With "best" I mean most performant, ease of use, least problems with the Talend components, ...
We have tried Hortonworks and Cloudera both are OK. Cloudera is well supported if you are aiming to use Spark as an execution engine, and if you have Kerberos security layer.
I know EMR and MapR are supported but i haven't tried them yet.