<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Replicate to Cloudera in ORC format, with table properties set. in Qlik Replicate</title>
    <link>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/1797193#M846</link>
    <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/124114"&gt;@Abrie_M&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;From Hive 3&amp;nbsp;Hive is enforcing a strict separation between the&amp;nbsp;hive.metastore.warehouse.external.dir vs hive.metastore.warehouse.dir. That leads the lower versions of Replicate cannot work with CDP 7.x.&lt;/P&gt;&lt;P&gt;Replicate 7.x supports CDP 7.x by Endpoint type "&lt;STRONG&gt;Cloudera Data Platform (CDP) Private Cloud&lt;/STRONG&gt;" however you know there is some limitations, include does not support HttpFS, does not support ORC format etc.&lt;/P&gt;&lt;P&gt;I'm afraid you cannot use general Hadoop endpoint type "&lt;STRONG&gt;Hadoop&lt;/STRONG&gt;" connect to CDP 7.x.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;John.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 06 Apr 2021 13:16:12 GMT</pubDate>
    <dc:creator>john_wang</dc:creator>
    <dc:date>2021-04-06T13:16:12Z</dc:date>
    <item>
      <title>Replicate to Cloudera in ORC format, with table properties set.</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/1795568#M828</link>
      <description>&lt;P&gt;Good day&lt;/P&gt;&lt;P&gt;We need to do a full refresh to Cloudera in ORC format, with the following table properties set.&lt;/P&gt;&lt;P&gt;'transactional'='true'&lt;BR /&gt;'transactional_properties'='insert_only'&lt;/P&gt;&lt;P&gt;We are using the Hadoop endpoint and the syntax file is Hive13.&lt;/P&gt;&lt;P&gt;I believe this should be possible by changing the syntax file. Any idea where in the syntax file this could be changed?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanx&lt;/P&gt;&lt;P&gt;Abrie&lt;/P&gt;</description>
      <pubDate>Tue, 30 Mar 2021 09:03:53 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/1795568#M828</guid>
      <dc:creator>Abrie_M</dc:creator>
      <dc:date>2021-03-30T09:03:53Z</dc:date>
    </item>
    <item>
      <title>Re: Replicate to Cloudera in ORC format, with table properties set.</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/1796604#M840</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/124114"&gt;@Abrie_M&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;The default Hive13 table creation syntax is&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;"create_table":	"CREATE ${TABLE_TYPE} TABLE ${QO}${TABLE_NAME}${QC} ( ${COLUMN_LIST} )",&lt;/LI-CODE&gt;&lt;P&gt;For your scenario it should be like:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;"create_table":	"CREATE ${TABLE_TYPE} TABLE ${QO}${TABLE_NAME}${QC} ( ${COLUMN_LIST} stored as orc TBLPROPERTIES('transactional'='true','transactional_properties'='insert_only'))",&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However it depends on the Cloudera cluster version. if your Cloudera Cluster is&amp;nbsp;CDP 7.x then Replicate 7.0 does not support ORC storage format. if it's a lower Cloudera version (still in support scope) then maybe it's doable . We'd like to suggest you open a support case.&amp;nbsp; And even we recommend getting our Professional Services team involved.&amp;nbsp;&lt;/P&gt;&lt;P&gt;BTW, how about if you create the target tables with additional properties out of Replicate ? Then you let Replicate to use the pre-defined tables rather than re-create them again.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;John.&lt;/P&gt;</description>
      <pubDate>Fri, 02 Apr 2021 12:56:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/1796604#M840</guid>
      <dc:creator>john_wang</dc:creator>
      <dc:date>2021-04-02T12:56:51Z</dc:date>
    </item>
    <item>
      <title>Re: Replicate to Cloudera in ORC format, with table properties set.</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/1797018#M844</link>
      <description>&lt;P&gt;Hi John&lt;/P&gt;&lt;P&gt;Thanx for the reply. I will try your suggestion.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are using CDP 7.x and although we could not get the CDP type endpoint working (CDP endpoint only supports&amp;nbsp;&lt;SPAN&gt;WebHDFS, and not&amp;nbsp;HttpFS, as in the Hadoop endpoint), we are continuing using the Hadoop endpoint and want to use the ORC format.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Our BI developers requested the specific ORC format and properties. Unfortunately there are more than a thousand tables, so I'm looking for the easiest way to provide the tables.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Abrie&lt;/P&gt;</description>
      <pubDate>Tue, 06 Apr 2021 04:57:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/1797018#M844</guid>
      <dc:creator>Abrie_M</dc:creator>
      <dc:date>2021-04-06T04:57:13Z</dc:date>
    </item>
    <item>
      <title>Re: Replicate to Cloudera in ORC format, with table properties set.</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/1797193#M846</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/124114"&gt;@Abrie_M&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;From Hive 3&amp;nbsp;Hive is enforcing a strict separation between the&amp;nbsp;hive.metastore.warehouse.external.dir vs hive.metastore.warehouse.dir. That leads the lower versions of Replicate cannot work with CDP 7.x.&lt;/P&gt;&lt;P&gt;Replicate 7.x supports CDP 7.x by Endpoint type "&lt;STRONG&gt;Cloudera Data Platform (CDP) Private Cloud&lt;/STRONG&gt;" however you know there is some limitations, include does not support HttpFS, does not support ORC format etc.&lt;/P&gt;&lt;P&gt;I'm afraid you cannot use general Hadoop endpoint type "&lt;STRONG&gt;Hadoop&lt;/STRONG&gt;" connect to CDP 7.x.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;John.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 06 Apr 2021 13:16:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/1797193#M846</guid>
      <dc:creator>john_wang</dc:creator>
      <dc:date>2021-04-06T13:16:12Z</dc:date>
    </item>
    <item>
      <title>Re: Replicate to Cloudera in ORC format, with table properties set.</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/1797245#M847</link>
      <description>&lt;P&gt;Hi John&lt;/P&gt;&lt;P&gt;Please see case nr 02094431, where we worked with Qlik support to resolve the CDP endpoint setup, as well as&amp;nbsp;the final confirmation we received from Qlik regarding support for the Hadoop endpoint with CDP.&lt;/P&gt;&lt;P&gt;"Hello Abrie,&lt;/P&gt;&lt;P&gt;We can confirm that Qlik Replicate supports Hadoop endpoint to connect to CDP cluster.&lt;/P&gt;&lt;P&gt;Let us know if we can help with anything else.&lt;/P&gt;&lt;P&gt;Kind regards,&lt;BR /&gt;Pedro&lt;BR /&gt;09 March 2021 at 15:43"&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;I will discuss the possible issues with our architects to decide on the way forward.&lt;/P&gt;&lt;P&gt;Thanx again for help in this matter.&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Abrie&lt;/P&gt;</description>
      <pubDate>Tue, 06 Apr 2021 14:51:08 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/1797245#M847</guid>
      <dc:creator>Abrie_M</dc:creator>
      <dc:date>2021-04-06T14:51:08Z</dc:date>
    </item>
    <item>
      <title>Re: Replicate to Cloudera in ORC format, with table properties set.</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/1797391#M854</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/124114"&gt;@Abrie_M&lt;/a&gt;&amp;nbsp; ,&lt;/P&gt;&lt;P&gt;I&amp;nbsp;apologize for my mistake.&amp;nbsp; The general "&lt;STRONG&gt;Hadoop&lt;/STRONG&gt;" endpoint should work with CDP 7.x.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Replicate 7.0&lt;/STRONG&gt; introduces a new parameter "&lt;STRONG&gt;Use default location for managed tables&lt;/STRONG&gt;" in "Hadoop" endpoint. That means Replicate 7.0 distinguish the file locations between the external tables and internal (managed) tables. It solved the problem of&amp;nbsp;Hive 3 enforcing strict I mentioned above. I'm attaching&amp;nbsp; a screen copy of the comparison between 6.6 and 7.0.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Replicate 7.0 - Use default location for managed tables.jpg" style="width: 999px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/52490iF6818859736DBC89/image-size/large?v=v2&amp;amp;px=999" role="button" title="Replicate 7.0 - Use default location for managed tables.jpg" alt="Replicate 7.0 - Use default location for managed tables.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Good luck,&lt;/P&gt;&lt;P&gt;John.&lt;/P&gt;</description>
      <pubDate>Wed, 07 Apr 2021 03:02:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/1797391#M854</guid>
      <dc:creator>john_wang</dc:creator>
      <dc:date>2021-04-07T03:02:50Z</dc:date>
    </item>
    <item>
      <title>Re: Replicate to Cloudera in ORC format, with table properties set.</title>
      <link>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/2129198#M7753</link>
      <description>&lt;P&gt;Hi John,&lt;/P&gt;
&lt;P&gt;we are using the Hadoop endpoint as target and using parquet file format for CDP7.x version. The issue we are facing is in wrt the external table creation with parquet files its always defaulting to the managed table creation in the HIVE. Is there any solution to create the external table with the parquet..&lt;/P&gt;
&lt;P&gt;I tried changing the Hive13 syntax and also by adding the internal parameter&lt;SPAN&gt;$info.query_syntax.create_table =&amp;nbsp;CREATE EXTERNAL TABLE ${QO}${TABLE_NAME}${QC} ( ${COLUMN_LIST} stored as parquet TBLPROPERTIES('transactional'='true','transactional_properties'='insert_only'))&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;But its not reflecting in the target and always defaulted to the managed table type. we need this as external table with parquet using Hadoop. Please do the needful.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 17 Oct 2023 12:47:04 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Qlik-Replicate/Replicate-to-Cloudera-in-ORC-format-with-table-properties-set/m-p/2129198#M7753</guid>
      <dc:creator>skokanay</dc:creator>
      <dc:date>2023-10-17T12:47:04Z</dc:date>
    </item>
  </channel>
</rss>

