<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: THiveInput throws an exception:java.io.IOException in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/THiveInput-throws-an-exception-java-io-IOException/m-p/2279877#M54895</link>
    <description>Hello, 
&lt;BR /&gt;Here is the explanation: 
&lt;BR /&gt;You have two different ways to connect to hive: the standalone mode and the embedded mode. 
&lt;BR /&gt;The standalone mode is a direct JDBC connection to the Hive server. The Hive server usually runs on the port 10000. 
&lt;BR /&gt;The embedded mode is a kind of indirect connection since a hive server is embedded in your client job. You then connect to the hive metastore through Thrift. The Thrift server doesn't run on the same port. 
&lt;BR /&gt;Finally, in order to fix the issue you have met above, you would have to specify the jobtracker (there is an option in the components). 
&lt;BR /&gt;HTH, 
&lt;BR /&gt;Rémy.</description>
    <pubDate>Sat, 17 Nov 2012 11:20:22 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2012-11-17T11:20:22Z</dc:date>
    <item>
      <title>THiveInput throws an exception:java.io.IOException</title>
      <link>https://community.qlik.com/t5/Talend-Studio/THiveInput-throws-an-exception-java-io-IOException/m-p/2279872#M54890</link>
      <description>Hello, 
&lt;BR /&gt;I am trying to execute a simple HIVE query with a select statement from Talend. The Hive connection succeeds and the job fails on trying to execute this query. 
&lt;BR /&gt;ENVIRONMENT: 
&lt;BR /&gt;Talend Big Data version: 5.2.0 Windows XP 
&lt;BR /&gt;Connecting to Apache 1.0.0 (Hive 0.9.0), Connection embedded 
&lt;BR /&gt;Thanks in advance for you help! 
&lt;BR /&gt;Exception: 
&lt;BR /&gt;java.io.IOException: Cannot run program "null/bin/hadoop" (in directory "C:\Talend\BigData\TOS_BD-r92826-V5.2.0"): CreateProcess error=2, The system cannot find the file specified 
&lt;BR /&gt; at java.lang.ProcessBuilder.start(Unknown Source) 
&lt;BR /&gt; at java.lang.Runtime.exec(Unknown Source) 
&lt;BR /&gt; at java.lang.Runtime.exec(Unknown Source) 
&lt;BR /&gt; at org.apache.hadoop.hive.ql.exec.MapRedTask.execute(MapRedTask.java:267) 
&lt;BR /&gt; at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:133) 
&lt;BR /&gt; at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:57) 
&lt;BR /&gt; at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1332) 
&lt;BR /&gt; at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1123) 
&lt;BR /&gt; at org.apache.hadoop.hive.ql.Driver.run(Driver.java:931) 
&lt;BR /&gt; at org.apache.hadoop.hive.service.HiveServer$HiveServerHandler.execute(HiveServer.java:191) 
&lt;BR /&gt; at org.apache.hadoop.hive.jdbc.HiveStatement.executeQuery(HiveStatement.java:187) 
&lt;BR /&gt; at lc_d_a.hivepreprod_0_1.HivePreProd.tHiveInput_1Process(HivePreProd.java:702) 
&lt;BR /&gt; at lc_d_a.hivepreprod_0_1.HivePreProd.tHiveConnection_1Process(HivePreProd.java:447) 
&lt;BR /&gt; at lc_d_a.hivepreprod_0_1.HivePreProd.runJobInTOS(HivePreProd.java:1759) 
&lt;BR /&gt; at lc_d_a.hivepreprod_0_1.HivePreProd.main(HivePreProd.java:1624) 
&lt;BR /&gt;Caused by: java.io.IOException: CreateProcess error=2, The system cannot find the file specified 
&lt;BR /&gt; at java.lang.ProcessImpl.create(Native Method) 
&lt;BR /&gt; at java.lang.ProcessImpl.&amp;lt;init&amp;gt;(Unknown Source) 
&lt;BR /&gt; at java.lang.ProcessImpl.start(Unknown Source) 
&lt;BR /&gt; ... 15 more</description>
      <pubDate>Sat, 16 Nov 2024 12:09:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/THiveInput-throws-an-exception-java-io-IOException/m-p/2279872#M54890</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2024-11-16T12:09:37Z</dc:date>
    </item>
    <item>
      <title>Re: THiveInput throws an exception:java.io.IOException</title>
      <link>https://community.qlik.com/t5/Talend-Studio/THiveInput-throws-an-exception-java-io-IOException/m-p/2279873#M54891</link>
      <description>Hi, 
&lt;BR /&gt;
&lt;BLOCKQUOTE&gt;
 &lt;TABLE border="1"&gt;
  &lt;TBODY&gt;
   &lt;TR&gt;
    &lt;TD&gt;Cannot run program "null/bin/hadoop"&lt;/TD&gt;
   &lt;/TR&gt;
  &lt;/TBODY&gt;
 &lt;/TABLE&gt;
&lt;/BLOCKQUOTE&gt;
&lt;BR /&gt;From the error info, please make sure the Environment variables is correct. 
&lt;BR /&gt;Best regards
&lt;BR /&gt;Sabrina</description>
      <pubDate>Mon, 05 Nov 2012 07:33:52 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/THiveInput-throws-an-exception-java-io-IOException/m-p/2279873#M54891</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-11-05T07:33:52Z</dc:date>
    </item>
    <item>
      <title>Re: THiveInput throws an exception:java.io.IOException</title>
      <link>https://community.qlik.com/t5/Talend-Studio/THiveInput-throws-an-exception-java-io-IOException/m-p/2279874#M54892</link>
      <description>Thanks for your replay. 
&lt;BR /&gt;Actually, it's not the problem with the environment variables.
&lt;BR /&gt;It seems the neither THiveConnection nor the THiveInput connects to the remote server, even though I specify the host and port for the remote connection. Instead, it tries to execute a query locally on my Windows workstation.
&lt;BR /&gt;How can I make Talend know, it needs to connect to a remote HIVE Thrift server...?
&lt;BR /&gt;Cheers,
&lt;BR /&gt;Agnieszka</description>
      <pubDate>Mon, 05 Nov 2012 12:18:18 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/THiveInput-throws-an-exception-java-io-IOException/m-p/2279874#M54892</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2012-11-05T12:18:18Z</dc:date>
    </item>
    <item>
      <title>Re: THiveInput throws an exception:java.io.IOException</title>
      <link>https://community.qlik.com/t5/Talend-Studio/THiveInput-throws-an-exception-java-io-IOException/m-p/2279875#M54893</link>
      <description>I have identified the problem. 
&lt;BR /&gt;I was using the "embedded" connection. The Talend's job was showing the connection was fine but in the reality the generated Java code had a connection string, omitting my host and port specified in the settings. As a result, Talend was trying to execute a HIVE query locally on my Windows machine. 
&lt;BR /&gt;Why is "embedded" wrong? Why the tool pretends the HIVE remote connection worked fine? Why it tries to run HIVE query locally even though I have specified the remote host and the port? I would consider it a bug...</description>
      <pubDate>Mon, 05 Nov 2012 16:39:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/THiveInput-throws-an-exception-java-io-IOException/m-p/2279875#M54893</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2012-11-05T16:39:33Z</dc:date>
    </item>
    <item>
      <title>Re: THiveInput throws an exception:java.io.IOException</title>
      <link>https://community.qlik.com/t5/Talend-Studio/THiveInput-throws-an-exception-java-io-IOException/m-p/2279876#M54894</link>
      <description>&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;I have identified the problem. &lt;BR /&gt;I was using the "embedded" connection. The Talend's job was showing the connection was fine but in the reality the generated Java code had a connection string, omitting my host and port specified in the settings. As a result, Talend was trying to execute a HIVE query locally on my Windows machine. &lt;BR /&gt;Why is "embedded" wrong? Why the tool pretends the HIVE remote connection worked fine? Why it tries to run HIVE query locally even though I have specified the remote host and the port? I would consider it a bug...&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;Please report a bug on our bugtracker, export your job and attach it! 
&lt;BR /&gt;Thank you!</description>
      <pubDate>Mon, 05 Nov 2012 18:28:09 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/THiveInput-throws-an-exception-java-io-IOException/m-p/2279876#M54894</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-11-05T18:28:09Z</dc:date>
    </item>
    <item>
      <title>Re: THiveInput throws an exception:java.io.IOException</title>
      <link>https://community.qlik.com/t5/Talend-Studio/THiveInput-throws-an-exception-java-io-IOException/m-p/2279877#M54895</link>
      <description>Hello, 
&lt;BR /&gt;Here is the explanation: 
&lt;BR /&gt;You have two different ways to connect to hive: the standalone mode and the embedded mode. 
&lt;BR /&gt;The standalone mode is a direct JDBC connection to the Hive server. The Hive server usually runs on the port 10000. 
&lt;BR /&gt;The embedded mode is a kind of indirect connection since a hive server is embedded in your client job. You then connect to the hive metastore through Thrift. The Thrift server doesn't run on the same port. 
&lt;BR /&gt;Finally, in order to fix the issue you have met above, you would have to specify the jobtracker (there is an option in the components). 
&lt;BR /&gt;HTH, 
&lt;BR /&gt;Rémy.</description>
      <pubDate>Sat, 17 Nov 2012 11:20:22 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/THiveInput-throws-an-exception-java-io-IOException/m-p/2279877#M54895</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-11-17T11:20:22Z</dc:date>
    </item>
  </channel>
</rss>

