Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik Open Lakehouse is Now Generally Available! Discover the key highlights and partner resources here.
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Trying to connect Talend Open Studio for Big Data with Hadoop on AWS

Hi everybody,
I have the Talend Open Studio on my desktop and a Hadoop cluster on Amazon Web Services. I am trying to connect those two, so that I'll be running my Hadoop jobs from Talend. As I learned from tutorials, in order to integrate Talend with Hadoop, I need to set up tHDFSConnection. As I double click on the tHDFSConnection icon on the Job Design space, I get the Component inset at the bottom of the screen. Here I choose the Hadoop distribution (Amazon EMR) and set my user name and my group. I am confused by the "NameNode URI" field though: what should it be? Sorry if my question is trivial, but all info I've found online is not quite helpful.
Labels (2)
1 Reply
willm1
Creator
Creator

Hi kpopov - take a look at this AWS Guide on connecting to AWS HDFS remotely - http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/emr-web-interfaces.html
Scroll to the bottom of the page.
And as the doc indicates, if you still can't directly access your URI, you'd have to do one of the 3 options listed on that page to connect remotely.