<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: HDFS transformation error in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/HDFS-transformation-error/m-p/2240510#M27979</link>
    <description>Hi, 
&lt;BR /&gt;On which build version you got this issue? Do you have any problem when upload the screenshots into forum? 
&lt;BR /&gt;More information will be helpful for us to address your issue. 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Sabrina</description>
    <pubDate>Fri, 04 Mar 2016 07:13:07 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2016-03-04T07:13:07Z</dc:date>
    <item>
      <title>HDFS transformation error</title>
      <link>https://community.qlik.com/t5/Talend-Studio/HDFS-transformation-error/m-p/2240509#M27978</link>
      <description>Its a simple extraction job, no transformations but still getting error: 
&lt;BR /&gt;Not able to upload images - its a Big Data batch job, and the same one runs for other files on hdfs but this one. 
&lt;BR /&gt; 
&lt;PRE&gt;java.lang.ClassCastException: org.apache.avro.generic.GenericData$Record cannot be cast to talendpoc.extract_loans_0_1.row1Struct&lt;BR /&gt;at talendpoc.extract_loans_0_1.extract_loans$tFileOutputDelimited_1StructOutputFormat$HDFSRecordWriter.write(extract_loans.java:1)&lt;BR /&gt;at org.apache.spark.SparkHadoopWriter.write(SparkHadoopWriter.scala:95)&lt;BR /&gt;at org.apache.spark.rdd.PairRDDFunctions$$anonfun$13.apply(PairRDDFunctions.scala:1073)&lt;BR /&gt;at org.apache.spark.rdd.PairRDDFunctions$$anonfun$13.apply(PairRDDFunctions.scala:1059)&lt;BR /&gt;at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:61)&lt;BR /&gt;at org.apache.spark.scheduler.Task.run(Task.scala:64)&lt;BR /&gt;at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:203)&lt;BR /&gt;at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)&lt;BR /&gt;at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)&lt;BR /&gt;at java.lang.Thread.run(Thread.java:745)&lt;BR /&gt;: org.apache.spark.scheduler.TaskSetManager - Task 0 in stage 0.0 failed 4 times; aborting job&lt;BR /&gt;org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 0.0 failed 4 times, most recent failure: Lost task 0.3 in stage 0.0 (TID 3, ip-10-9-1-197.ec2.internal): java.lang.ClassCastException: org.apache.avro.generic.GenericData$Record cannot be cast to talendpoc.extract_loans_0_1.row1Struct&lt;BR /&gt;at talendpoc.extract_loans_0_1.extract_loans$tFileOutputDelimited_1StructOutputFormat$HDFSRecordWriter.write(extract_loans.java:1)&lt;BR /&gt;at org.apache.spark.SparkHadoopWriter.write(SparkHadoopWriter.scala:95)&lt;BR /&gt;at org.apache.spark.rdd.PairRDDFunctions$$anonfun$13.apply(PairRDDFunctions.scala:1073)&lt;BR /&gt;at org.apache.spark.rdd.PairRDDFunctions$$anonfun$13.apply(PairRDDFunctions.scala:1059)&lt;BR /&gt;at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:61)&lt;BR /&gt;at org.apache.spark.scheduler.Task.run(Task.scala:64)&lt;BR /&gt;at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:203)&lt;BR /&gt;at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)&lt;BR /&gt;at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)&lt;BR /&gt;at java.lang.Thread.run(Thread.java:745)&lt;BR /&gt;Driver stacktrace:&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler.org$apache$spark$scheduler$DAGScheduler$$failJobAndIndependentStages(DAGScheduler.scala:1204)&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply(DAGScheduler.scala:1193)&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply(DAGScheduler.scala:1192)&lt;BR /&gt;at scala.collection.mutable.ResizableArray$class.foreach(ResizableArray.scala:59)&lt;BR /&gt;at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:47)&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler.abortStage(DAGScheduler.scala:1192)&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$1.apply(DAGScheduler.scala:693)&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$1.apply(DAGScheduler.scala:693)&lt;BR /&gt;at scala.Option.foreach(Option.scala:236)&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler.handleTaskSetFailed(DAGScheduler.scala:693)&lt;BR /&gt;at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:1393)&lt;BR /&gt;at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:1354)&lt;BR /&gt;at org.apache.spark.util.EventLoop$$anon$1.run(EventLoop.scala:48)&lt;BR /&gt;1|20160303071453_LSC0S|20160303071453_LSC0S|20160303071453_LSC0S|out|0|2|100.0&lt;BR /&gt;1|20160303071453_LSC0S|20160303071453_LSC0S|20160303071453_LSC0S|row1|0|2|100.0&lt;BR /&gt;1|20160303071453_LSC0S|20160303071453_LSC0S|20160303071453_LSC0S|row_tFileInputDelimited_1_HDFSInputFormat|0|2|100.0&lt;BR /&gt;org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 0.0 failed 4 times, most recent failure: Lost task 0.3 in stage 0.0 (TID 3, ip-10-9-1-197.ec2.internal): java.lang.ClassCastException: org.apache.avro.generic.GenericData$Record cannot be cast to talendpoc.extract_loans_0_1.row1Struct&lt;BR /&gt;at talendpoc.extract_loans_0_1.extract_loans$tFileOutputDelimited_1StructOutputFormat$HDFSRecordWriter.write(extract_loans.java:1)&lt;BR /&gt;at org.apache.spark.SparkHadoopWriter.write(SparkHadoopWriter.scala:95)&lt;BR /&gt;at org.apache.spark.rdd.PairRDDFunctions$$anonfun$13.apply(PairRDDFunctions.scala:1073)&lt;BR /&gt;at org.apache.spark.rdd.PairRDDFunctions$$anonfun$13.apply(PairRDDFunctions.scala:1059)&lt;BR /&gt;at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:61)&lt;BR /&gt;at org.apache.spark.scheduler.Task.run(Task.scala:64)&lt;BR /&gt;at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:203)&lt;BR /&gt;at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)&lt;BR /&gt;at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)&lt;BR /&gt;at java.lang.Thread.run(Thread.java:745)&lt;BR /&gt;Driver stacktrace:&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler.org$apache$spark$scheduler$DAGScheduler$$failJobAndIndependentStages(DAGScheduler.scala:1204)&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply(DAGScheduler.scala:1193)&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply(DAGScheduler.scala:1192)&lt;BR /&gt;at scala.collection.mutable.ResizableArray$class.foreach(ResizableArray.scala:59)&lt;BR /&gt;at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:47)&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler.abortStage(DAGScheduler.scala:1192)&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$1.apply(DAGScheduler.scala:693)&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$1.apply(DAGScheduler.scala:693)&lt;BR /&gt;at scala.Option.foreach(Option.scala:236)&lt;BR /&gt;at org.apache.spark.scheduler.DAGScheduler.handleTaskSetFailed(DAGScheduler.scala:693)&lt;BR /&gt;at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:1393)&lt;BR /&gt;at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:1354)&lt;BR /&gt;at org.apache.spark.util.EventLoop$$anon$1.run(EventLoop.scala:48)&lt;BR /&gt;&lt;I&gt;Job extract_loans ended at 16:39 02/03/2016. &lt;/I&gt;&lt;BR /&gt;&lt;/PRE&gt;</description>
      <pubDate>Sat, 16 Nov 2024 10:45:29 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/HDFS-transformation-error/m-p/2240509#M27978</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T10:45:29Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS transformation error</title>
      <link>https://community.qlik.com/t5/Talend-Studio/HDFS-transformation-error/m-p/2240510#M27979</link>
      <description>Hi, 
&lt;BR /&gt;On which build version you got this issue? Do you have any problem when upload the screenshots into forum? 
&lt;BR /&gt;More information will be helpful for us to address your issue. 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Sabrina</description>
      <pubDate>Fri, 04 Mar 2016 07:13:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/HDFS-transformation-error/m-p/2240510#M27979</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-03-04T07:13:07Z</dc:date>
    </item>
  </channel>
</rss>

