<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Parquet performance optimization in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Parquet-performance-optimization/m-p/2264163#M44095</link>
    <description>&lt;P&gt;HI All,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We have use case where we want to convert 17TB of data(csv files) into parquet. we are using EMR spark cluster for conversion. we have designed Big data job with tFileOutputParquet component to create the file. currently our job is taking long time to convert the files. did anyone achieved parquet conversion with alternate approach and optimized design? kindly share some inputs if known.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks.&lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 07:14:47 GMT</pubDate>
    <dc:creator>ankushd</dc:creator>
    <dc:date>2024-11-16T07:14:47Z</dc:date>
    <item>
      <title>Parquet performance optimization</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parquet-performance-optimization/m-p/2264163#M44095</link>
      <description>&lt;P&gt;HI All,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We have use case where we want to convert 17TB of data(csv files) into parquet. we are using EMR spark cluster for conversion. we have designed Big data job with tFileOutputParquet component to create the file. currently our job is taking long time to convert the files. did anyone achieved parquet conversion with alternate approach and optimized design? kindly share some inputs if known.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 07:14:47 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parquet-performance-optimization/m-p/2264163#M44095</guid>
      <dc:creator>ankushd</dc:creator>
      <dc:date>2024-11-16T07:14:47Z</dc:date>
    </item>
  </channel>
</rss>

