<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Error while running tHDFSCopy in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Error-while-running-tHDFSCopy/m-p/2288687#M62152</link>
    <description>Hi everybody,
&lt;BR /&gt;I am testing Talend Open Studio for Big Data 5.5.0 with a Hadoop cluster on AWS (Cloudera distribution, CDH4.4.0 version). I have a file called customer.csv, which I am trying to copy from my home directory to a subdirectory called /new. I set up a job that consists of only one component - tHDFSCopy. The job runs for awhile producing an EMPTY file customer.csv in the target directory and ends with the following error:
&lt;BR /&gt;Exception in component tHDFSCopy_1
&lt;BR /&gt;org.apache.hadoop.hdfs.BlockMissingException: Could not obtain block: BP-425321293-10.15.244.108-1401446443266:blk_8047645766350991207_142708 file=/user/kpopov/customer.csv
&lt;BR /&gt; at org.apache.hadoop.hdfs.DFSInputStream.chooseDataNode(DFSInputStream.java:839)
&lt;BR /&gt; at org.apache.hadoop.hdfs.DFSInputStream.blockSeekTo(DFSInputStream.java:531)
&lt;BR /&gt; at org.apache.hadoop.hdfs.DFSInputStream.readWithStrategy(DFSInputStream.java:750)
&lt;BR /&gt; at org.apache.hadoop.hdfs.DFSInputStream.read(DFSInputStream.java:794)
&lt;BR /&gt; at java.io.DataInputStream.read(Unknown Source)
&lt;BR /&gt; at org.apache.hadoop.io.IOUtils.copyBytes(IOUtils.java:78)
&lt;BR /&gt; at org.apache.hadoop.io.IOUtils.copyBytes(IOUtils.java:52)
&lt;BR /&gt; at org.apache.hadoop.io.IOUtils.copyBytes(IOUtils.java:112)
&lt;BR /&gt; at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:260)
&lt;BR /&gt; at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:232)
&lt;BR /&gt; at copyfileinhdfs.copyfileinhdfs_0_1.CopyFileInHDFS.tHDFSCopy_1Process(CopyFileInHDFS.java:339)
&lt;BR /&gt; at copyfileinhdfs.copyfileinhdfs_0_1.CopyFileInHDFS.runJobInTOS(CopyFileInHDFS.java:589)
&lt;BR /&gt;Who can tell me, what is going on?</description>
    <pubDate>Sat, 16 Nov 2024 11:37:08 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2024-11-16T11:37:08Z</dc:date>
    <item>
      <title>Error while running tHDFSCopy</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Error-while-running-tHDFSCopy/m-p/2288687#M62152</link>
      <description>Hi everybody,
&lt;BR /&gt;I am testing Talend Open Studio for Big Data 5.5.0 with a Hadoop cluster on AWS (Cloudera distribution, CDH4.4.0 version). I have a file called customer.csv, which I am trying to copy from my home directory to a subdirectory called /new. I set up a job that consists of only one component - tHDFSCopy. The job runs for awhile producing an EMPTY file customer.csv in the target directory and ends with the following error:
&lt;BR /&gt;Exception in component tHDFSCopy_1
&lt;BR /&gt;org.apache.hadoop.hdfs.BlockMissingException: Could not obtain block: BP-425321293-10.15.244.108-1401446443266:blk_8047645766350991207_142708 file=/user/kpopov/customer.csv
&lt;BR /&gt; at org.apache.hadoop.hdfs.DFSInputStream.chooseDataNode(DFSInputStream.java:839)
&lt;BR /&gt; at org.apache.hadoop.hdfs.DFSInputStream.blockSeekTo(DFSInputStream.java:531)
&lt;BR /&gt; at org.apache.hadoop.hdfs.DFSInputStream.readWithStrategy(DFSInputStream.java:750)
&lt;BR /&gt; at org.apache.hadoop.hdfs.DFSInputStream.read(DFSInputStream.java:794)
&lt;BR /&gt; at java.io.DataInputStream.read(Unknown Source)
&lt;BR /&gt; at org.apache.hadoop.io.IOUtils.copyBytes(IOUtils.java:78)
&lt;BR /&gt; at org.apache.hadoop.io.IOUtils.copyBytes(IOUtils.java:52)
&lt;BR /&gt; at org.apache.hadoop.io.IOUtils.copyBytes(IOUtils.java:112)
&lt;BR /&gt; at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:260)
&lt;BR /&gt; at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:232)
&lt;BR /&gt; at copyfileinhdfs.copyfileinhdfs_0_1.CopyFileInHDFS.tHDFSCopy_1Process(CopyFileInHDFS.java:339)
&lt;BR /&gt; at copyfileinhdfs.copyfileinhdfs_0_1.CopyFileInHDFS.runJobInTOS(CopyFileInHDFS.java:589)
&lt;BR /&gt;Who can tell me, what is going on?</description>
      <pubDate>Sat, 16 Nov 2024 11:37:08 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Error-while-running-tHDFSCopy/m-p/2288687#M62152</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T11:37:08Z</dc:date>
    </item>
    <item>
      <title>Re: Error while running tHDFSCopy</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Error-while-running-tHDFSCopy/m-p/2288688#M62153</link>
      <description>Have you checked the scenario - 
&lt;A href="https://help.talend.com/pages/viewpage.action?pageId=9310644#ychen-20120907-bigdata-thdfslist_scenario" rel="nofollow noopener noreferrer"&gt;https://help.talend.com/pages/viewpage.action?pageId=9310644#ychen-20120907-bigdata-thdfslist_scenario&lt;/A&gt;
&lt;BR /&gt;Whether the connection is ok ?
&lt;BR /&gt;Vaibhav</description>
      <pubDate>Mon, 09 Jun 2014 14:31:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Error-while-running-tHDFSCopy/m-p/2288688#M62153</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-09T14:31:40Z</dc:date>
    </item>
    <item>
      <title>Re: Error while running tHDFSCopy</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Error-while-running-tHDFSCopy/m-p/2288689#M62154</link>
      <description>Yes, the connection is correct: the IP is right, the port (8020) is right, the Hadoop version is correct. As I open the Component inset and and click the  button next to the File Name field, the Open Studio connects with HDFS fine and lets me choose a directory for my file to be copied. The only problem is, like I said before, the copied file turns out to be empty and the Open Studio ends the job with the error.</description>
      <pubDate>Tue, 10 Jun 2014 12:46:59 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Error-while-running-tHDFSCopy/m-p/2288689#M62154</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-10T12:46:59Z</dc:date>
    </item>
    <item>
      <title>Re: Error while running tHDFSCopy</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Error-while-running-tHDFSCopy/m-p/2288690#M62155</link>
      <description>Ok, it seems like the issue was the closed 50010 port for data transfer on the datanode.</description>
      <pubDate>Tue, 10 Jun 2014 16:49:32 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Error-while-running-tHDFSCopy/m-p/2288690#M62155</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-10T16:49:32Z</dc:date>
    </item>
    <item>
      <title>Re: Error while running tHDFSCopy</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Error-while-running-tHDFSCopy/m-p/2288691#M62156</link>
      <description>Hi kpopov, &lt;BR /&gt;Is the component working well for you?  If the issue is fixed, may I ask you to click the "Set this topic as resolved" link which is right underneath your initial post? This way, other users will be informed that this thread has been resolved.&lt;BR /&gt;Many thanks&lt;BR /&gt;Best regards&lt;BR /&gt;Sabrina</description>
      <pubDate>Wed, 11 Jun 2014 08:44:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Error-while-running-tHDFSCopy/m-p/2288691#M62156</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-11T08:44:12Z</dc:date>
    </item>
  </channel>
</rss>

