Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Talend Cloud AWS EU Scheduled Outage: Starting Tues 26 May 21:00 CEST with expected completion Wed 27 May 01:00 CEST
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Connection to HDFS on tHDFSInput takes a long time (HDFS HA)

Hello,
I have a kerberized High Availability HDFS cluster setup and followed the instructions here for setting up my connection to the HA cluster:

When configuring the Authentication settings in a tHDFSInput component after changing to the _HOST value the job ran much slower than when using the fully qualified host name.
Here is the log file from job in question.
< <


To see the whole post, download it here
Labels (5)
1 Reply
Anonymous
Not applicable
Author

Hi
There is a KB article about enabling the HDFS High Avaliability feature in the Studio, I hope it can help you.
: org.apache.hadoop.util.Shell - Failed to locate the winutils binary in the hadoop binary path
java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries.

It is a well-know error, see jlolling's https://community.talend.com/t5/Design-and-Development/Talentd-Big-Data-5-5-1-on-windows-against-clo... in this topic.
Best regards
Shong