Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik Open Lakehouse is Now Generally Available! Discover the key highlights and partner resources here.
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Configuring tHDFSConnection to connect to Amazon EMR

Could someone provide an example or link to an example of how to configure the tHdfsConnection component to connect to an existing Amazon EMR cluster? I am using Talend Open Studio for Big Data (5.4.1) on windows 7 laptop.
thanks!

Labels (2)
6 Replies
Anonymous
Not applicable
Author

Hi,
Have you checked component reference TalendHelpCenter:tHDFSConnection firstly.
Did you have some issue on it?
Best regards
Sabrina
Anonymous
Not applicable
Author

Yes, we researched the tHDFSConnection component. However, we were unable to get it to connect to the remote Amazon EMR HDFS. The only way we found to make it work was to create an ssh tunnel for the namenode uri. Is there a better approach than this? Thanks!
Anonymous
Not applicable
Author

No one replied to this so i am reposting:
Yes, we researched the tHDFSConnection component. However, we were unable to get it to connect to the remote Amazon EMR HDFS. The only way we found to make it work was to create an ssh tunnel for the namenode uri. Is there a better approach than this? Thanks!
_AnonymousUser
Specialist III
Specialist III

No one replied to this so i am reposting:
Yes, we researched the tHDFSConnection component. However, we were unable to get it to connect to the remote Amazon EMR HDFS. The only way we found to make it work was to create an ssh tunnel for the namenode uri. Is there a better approach than this? Thanks!
Anonymous
Not applicable
Author

Hello,
Which version of Hadoop do you use on the EMR side?
_AnonymousUser
Specialist III
Specialist III

We use Hadoop version 1.0.3