<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to include partition/bucket parameter when writing file to HDFS in Qlik Compose</title>
    <link>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1694344#M1573</link>
    <description>&lt;P&gt;Currently we have been using compose to transfer the data from storage layer to provision layer(HDFS in parquet format)&lt;/P&gt;&lt;P&gt;By default it is using the below command for writing the file to HDFS.&lt;/P&gt;&lt;P&gt;&amp;lt;D_F&amp;gt;.write&lt;BR /&gt;.mode("Overwrite")&lt;BR /&gt;.format("PARQUET")&lt;BR /&gt;.save("hdfs:///....")&lt;/P&gt;&lt;P&gt;We need to include partition/bucket parameter based on some column to write the parquet files in HDFS based on partition/bucket key.&lt;/P&gt;&lt;P&gt;Kindly advise if there any way to do.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 20 Mar 2025 15:56:15 GMT</pubDate>
    <dc:creator>varadharaj</dc:creator>
    <dc:date>2025-03-20T15:56:15Z</dc:date>
    <item>
      <title>How to include partition/bucket parameter when writing file to HDFS</title>
      <link>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1694344#M1573</link>
      <description>&lt;P&gt;Currently we have been using compose to transfer the data from storage layer to provision layer(HDFS in parquet format)&lt;/P&gt;&lt;P&gt;By default it is using the below command for writing the file to HDFS.&lt;/P&gt;&lt;P&gt;&amp;lt;D_F&amp;gt;.write&lt;BR /&gt;.mode("Overwrite")&lt;BR /&gt;.format("PARQUET")&lt;BR /&gt;.save("hdfs:///....")&lt;/P&gt;&lt;P&gt;We need to include partition/bucket parameter based on some column to write the parquet files in HDFS based on partition/bucket key.&lt;/P&gt;&lt;P&gt;Kindly advise if there any way to do.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 20 Mar 2025 15:56:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1694344#M1573</guid>
      <dc:creator>varadharaj</dc:creator>
      <dc:date>2025-03-20T15:56:15Z</dc:date>
    </item>
    <item>
      <title>Re: How to include partition/bucket parameter when writing file to HDFS</title>
      <link>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1697463#M1574</link>
      <description>&lt;P&gt;I think you can modify the generated Scripts generated by Compose to add additional parameters.&lt;/P&gt;&lt;P&gt;What version of Compose4DL and What version Hadoop ?&lt;/P&gt;</description>
      <pubDate>Tue, 28 Apr 2020 17:22:42 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1697463#M1574</guid>
      <dc:creator>John_Park</dc:creator>
      <dc:date>2020-04-28T17:22:42Z</dc:date>
    </item>
    <item>
      <title>Re: How to include partition/bucket parameter when writing file to HDFS</title>
      <link>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1697488#M1575</link>
      <description>&lt;P&gt;Correction you cannot modified the generated scripts.&lt;/P&gt;&lt;P&gt;Partitioning is not supported with Spark based projects with HWX/EMR.&lt;/P&gt;</description>
      <pubDate>Tue, 28 Apr 2020 19:14:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1697488#M1575</guid>
      <dc:creator>John_Park</dc:creator>
      <dc:date>2020-04-28T19:14:57Z</dc:date>
    </item>
    <item>
      <title>Re: How to include partition/bucket parameter when writing file to HDFS</title>
      <link>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1697556#M1576</link>
      <description>&lt;P&gt;So Partitioning is not supported with spark (hortonworks).&lt;/P&gt;&lt;P&gt;Is there any way to bucket the hdfs files with spark option in compose&lt;/P&gt;</description>
      <pubDate>Wed, 29 Apr 2020 12:48:46 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1697556#M1576</guid>
      <dc:creator>varadharaj</dc:creator>
      <dc:date>2020-04-29T12:48:46Z</dc:date>
    </item>
    <item>
      <title>Re: How to include partition/bucket parameter when writing file to HDFS</title>
      <link>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1703862#M1577</link>
      <description>&lt;P&gt;Can someone reply whether bucketing supports with spark based projects&lt;/P&gt;</description>
      <pubDate>Wed, 20 May 2020 13:28:52 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1703862#M1577</guid>
      <dc:creator>varadharaj</dc:creator>
      <dc:date>2020-05-20T13:28:52Z</dc:date>
    </item>
    <item>
      <title>Re: How to include partition/bucket parameter when writing file to HDFS</title>
      <link>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1707293#M1578</link>
      <description>&lt;P&gt;Currently, Compose does not support specifying bucketing or partitioning for Spark projects.&amp;nbsp;&lt;/P&gt;&lt;P&gt;This is supported natively in Hive projects, and can be applied to databricks projects by simply altering the DDL for databricks.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If this is a feature you'd like to see in the product, I suggest creating an "Idea" in the Qlik Product Insight &amp;amp; Ideas section of the community.&amp;nbsp; (In the left menu of this page&amp;nbsp; &amp;nbsp;&amp;lt;&amp;lt;&amp;lt;&amp;lt;&amp;nbsp; &amp;nbsp;you should see this icon and you can put in requests)&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="TimGarrod_0-1591050748879.png" style="width: 400px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/34780i65A4A485B02FC58D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="TimGarrod_0-1591050748879.png" alt="TimGarrod_0-1591050748879.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 01 Jun 2020 22:33:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1707293#M1578</guid>
      <dc:creator>TimGarrod</dc:creator>
      <dc:date>2020-06-01T22:33:38Z</dc:date>
    </item>
    <item>
      <title>Re: How to include partition/bucket parameter when writing file to HDFS</title>
      <link>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1708715#M1579</link>
      <description>&lt;P&gt;You can run a mock S3 server (there are many projects that can do this, have a google and choose one you like) and then point Spark at the server by setting the fs.s3a.endpoint property.&lt;/P&gt;&lt;P&gt;The fs.s3a... properties are Hadoop properties, you can set them directly in core-site.xml. If you want to set them dynamically in your spark context, all properties are prefixed with spark.hadoop.&lt;/P&gt;&lt;P&gt;So to set the new endpoint in your test code:&lt;/P&gt;&lt;P&gt;val spark = SparkSession.builder&lt;BR /&gt;.master("local")&lt;BR /&gt;.appName("test suite")&lt;BR /&gt;.config("spark.hadoop.fs.s3a.endpoint", "localhost:9090")&lt;BR /&gt;.getOrCreate()&lt;/P&gt;</description>
      <pubDate>Sat, 06 Jun 2020 00:12:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Compose/How-to-include-partition-bucket-parameter-when-writing-file-to/m-p/1708715#M1579</guid>
      <dc:creator>jacobfrey121</dc:creator>
      <dc:date>2020-06-06T00:12:13Z</dc:date>
    </item>
  </channel>
</rss>

