<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic In Talend BigDataBatch job Parquet DataType Issues in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/In-Talend-BigDataBatch-job-Parquet-DataType-Issues/m-p/2233602#M23229</link>
    <description>Hi All
&lt;BR /&gt;I'm new to Talend. Trying to fetch data from a parquet file in HDFS and apply transformations on in it and store it as a separate parquet file in HDFS.
&lt;BR /&gt;Error: java.lang.ClassCastException: parquet.example.data.simple.LongValue cannot be cast to parquet.example.data.simple.BinaryValue
&lt;BR /&gt;at parquet.example.data.simple.SimpleGroup.getString(SimpleGroup.java:121)
&lt;BR /&gt;at parquet.example.data.GroupValueSource.getString(GroupValueSource.java:32)
&lt;BR /&gt;at local_project.hadoop_1_0_1.hadoop_1$TalendParquetInputMapper_tFileInputParquet_1.map(hadoop_1.java:450)
&lt;BR /&gt;at local_project.hadoop_1_0_1.hadoop_1$TalendParquetInputMapper_tFileInputParquet_1.map(hadoop_1.java:1)
&lt;BR /&gt;at org.talend.hadoop.mapred.lib.ChainMapper.map(ChainMapper.java:63)
&lt;BR /&gt;at org.talend.hadoop.mapred.lib.DelegatingMapper.map(DelegatingMapper.java:44)
&lt;BR /&gt;at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:54)
&lt;BR /&gt;at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
&lt;BR /&gt;at org.apache.hadoop.mapred.MapTask.run(MapTask.java:347)
&lt;BR /&gt;at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168)
&lt;BR /&gt;at java.security.AccessController.doPrivileged(Native Method)
&lt;BR /&gt;at javax.security.auth.Subject.doAs(Subject.java:422)
&lt;BR /&gt;at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1566)
&lt;BR /&gt;at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:163)
&lt;BR /&gt;my main aim is in BigDataBatch job &amp;nbsp;tFileInputParquet input is take from hive parquet table it has &amp;nbsp;BigInt datatype for some columns,when i retrieve schema in Talend Bigint change to BigDecimal, BigDecimal is not supported by Talend, so i changed to String or Long and then typecast to Long in tMap. The main reason to conversion is the output parquet table has some columns are Bigint, so when i load directly it wil through error, so i am typecasting String to long at tmap ouput Schema columns and run in server then it was giveing error like 
&lt;FONT color="#ff6633"&gt;(Error: java.lang.ClassCastException: parquet.example.data.simple.LongValue cannot be cast to parquet.example.data.simple.BinaryValue&lt;/FONT&gt;). when i loading data from &amp;nbsp;tFileInputParquet to tFileOutParquet ,i.e parquet to parquet getting those errors.
&lt;BR /&gt;And also &amp;nbsp;when the datatype of both tFileInputParquet and tFileOutParquet has FLOAT,DOUBLE,DATE,LONG Datatypes in Talend it is giveing error, But it is supporting only STRING and INT. when datatypes are FLOAT,DOUBLE,DATE,LONG throwing error like, i.e
&lt;FONT color="#ff6633"&gt; The method getLong(String, int) is undefined for the type Group&lt;/FONT&gt;. And,i have added some screen shorts below which is releated to errors.
&lt;BR /&gt;
&lt;BR /&gt;Where am i going wrong? Any help will be appreciated.
&lt;BR /&gt;Thanks in advance.
&lt;BR /&gt;Here is the screenshots</description>
    <pubDate>Sat, 16 Nov 2024 10:06:22 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2024-11-16T10:06:22Z</dc:date>
    <item>
      <title>In Talend BigDataBatch job Parquet DataType Issues</title>
      <link>https://community.qlik.com/t5/Talend-Studio/In-Talend-BigDataBatch-job-Parquet-DataType-Issues/m-p/2233602#M23229</link>
      <description>Hi All
&lt;BR /&gt;I'm new to Talend. Trying to fetch data from a parquet file in HDFS and apply transformations on in it and store it as a separate parquet file in HDFS.
&lt;BR /&gt;Error: java.lang.ClassCastException: parquet.example.data.simple.LongValue cannot be cast to parquet.example.data.simple.BinaryValue
&lt;BR /&gt;at parquet.example.data.simple.SimpleGroup.getString(SimpleGroup.java:121)
&lt;BR /&gt;at parquet.example.data.GroupValueSource.getString(GroupValueSource.java:32)
&lt;BR /&gt;at local_project.hadoop_1_0_1.hadoop_1$TalendParquetInputMapper_tFileInputParquet_1.map(hadoop_1.java:450)
&lt;BR /&gt;at local_project.hadoop_1_0_1.hadoop_1$TalendParquetInputMapper_tFileInputParquet_1.map(hadoop_1.java:1)
&lt;BR /&gt;at org.talend.hadoop.mapred.lib.ChainMapper.map(ChainMapper.java:63)
&lt;BR /&gt;at org.talend.hadoop.mapred.lib.DelegatingMapper.map(DelegatingMapper.java:44)
&lt;BR /&gt;at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:54)
&lt;BR /&gt;at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
&lt;BR /&gt;at org.apache.hadoop.mapred.MapTask.run(MapTask.java:347)
&lt;BR /&gt;at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168)
&lt;BR /&gt;at java.security.AccessController.doPrivileged(Native Method)
&lt;BR /&gt;at javax.security.auth.Subject.doAs(Subject.java:422)
&lt;BR /&gt;at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1566)
&lt;BR /&gt;at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:163)
&lt;BR /&gt;my main aim is in BigDataBatch job &amp;nbsp;tFileInputParquet input is take from hive parquet table it has &amp;nbsp;BigInt datatype for some columns,when i retrieve schema in Talend Bigint change to BigDecimal, BigDecimal is not supported by Talend, so i changed to String or Long and then typecast to Long in tMap. The main reason to conversion is the output parquet table has some columns are Bigint, so when i load directly it wil through error, so i am typecasting String to long at tmap ouput Schema columns and run in server then it was giveing error like 
&lt;FONT color="#ff6633"&gt;(Error: java.lang.ClassCastException: parquet.example.data.simple.LongValue cannot be cast to parquet.example.data.simple.BinaryValue&lt;/FONT&gt;). when i loading data from &amp;nbsp;tFileInputParquet to tFileOutParquet ,i.e parquet to parquet getting those errors.
&lt;BR /&gt;And also &amp;nbsp;when the datatype of both tFileInputParquet and tFileOutParquet has FLOAT,DOUBLE,DATE,LONG Datatypes in Talend it is giveing error, But it is supporting only STRING and INT. when datatypes are FLOAT,DOUBLE,DATE,LONG throwing error like, i.e
&lt;FONT color="#ff6633"&gt; The method getLong(String, int) is undefined for the type Group&lt;/FONT&gt;. And,i have added some screen shorts below which is releated to errors.
&lt;BR /&gt;
&lt;BR /&gt;Where am i going wrong? Any help will be appreciated.
&lt;BR /&gt;Thanks in advance.
&lt;BR /&gt;Here is the screenshots</description>
      <pubDate>Sat, 16 Nov 2024 10:06:22 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/In-Talend-BigDataBatch-job-Parquet-DataType-Issues/m-p/2233602#M23229</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T10:06:22Z</dc:date>
    </item>
    <item>
      <title>Re: In Talend BigDataBatch job Parquet DataType Issues</title>
      <link>https://community.qlik.com/t5/Talend-Studio/In-Talend-BigDataBatch-job-Parquet-DataType-Issues/m-p/2233603#M23230</link>
      <description>Hi,&lt;BR /&gt;&lt;FONT size="2"&gt;&lt;FONT face="Calibri"&gt;Thank you for your post! We can't see the screenshot on our side. Could you attach it on the forum, please? That would be great.&lt;/FONT&gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;&lt;FONT face="Calibri"&gt;Best regards&lt;/FONT&gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;&lt;FONT face="Calibri"&gt;Sabrina&lt;/FONT&gt;&lt;/FONT&gt;</description>
      <pubDate>Tue, 07 Feb 2017 11:15:52 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/In-Talend-BigDataBatch-job-Parquet-DataType-Issues/m-p/2233603#M23230</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-02-07T11:15:52Z</dc:date>
    </item>
    <item>
      <title>Re: In Talend BigDataBatch job Parquet DataType Issues</title>
      <link>https://community.qlik.com/t5/Talend-Studio/In-Talend-BigDataBatch-job-Parquet-DataType-Issues/m-p/2233604#M23231</link>
      <description>Hi sabrinaa,&lt;BR /&gt;i couldn't attach the screen shots what so ever.Tell me how to attach,&lt;BR /&gt;&amp;nbsp;my screen shorts are around 23KB to 10KB in png format but it was not loading.&amp;nbsp;</description>
      <pubDate>Tue, 07 Feb 2017 12:38:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/In-Talend-BigDataBatch-job-Parquet-DataType-Issues/m-p/2233604#M23231</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-02-07T12:38:38Z</dc:date>
    </item>
  </channel>
</rss>

