<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Sqoop Import - Hive Parquet Import Error - Unknown dataset URI pattern: in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Sqoop-Import-Hive-Parquet-Import-Error-Unknown-dataset-URI/m-p/2299360#M71710</link>
    <description>&lt;P&gt;Good morning. I have an issue attempting to activate and test the direct hive import feature.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Talend version: 7.2&lt;/P&gt;&lt;P&gt;Cloudera version: 6.1.1&lt;/P&gt;&lt;P&gt;Hive version: 2.1.1-cdh6.1.1&lt;/P&gt;&lt;P&gt;Kerberos: yes&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I am testing for a client an evolution of a sqoop import job. The client currently imports its data in Parquet file, using the Java API mode of the tSqoopImport component, and for the next step we want to create/fill the hive table at the same time.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Here is the error I get when executing the job:&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;&lt;I&gt;[ERROR]: org.apache.sqoop.Sqoop - Got exception running Sqoop: org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI pattern: dataset:hive:/dev_ingestion/test_import_direct&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&lt;I&gt;Check that JARs for hive datasets are on the classpath&lt;/I&gt;&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The database and its table exist. I have tried everything I tried changing the parameters to put the database name into the hive.table parameter. I also tried to add the path to Cloudera's hadoop and hive libraries (&lt;I&gt;/opt/cloudera/parcels/CDH/lib&lt;/I&gt;) to the classpath in the execution tab. Both had no effect.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You will find attached all the relevant screenshots&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you in advance for your answer and advice.&lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 01:18:31 GMT</pubDate>
    <dc:creator>CPorrot1602485748</dc:creator>
    <dc:date>2024-11-16T01:18:31Z</dc:date>
    <item>
      <title>Sqoop Import - Hive Parquet Import Error - Unknown dataset URI pattern:</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Sqoop-Import-Hive-Parquet-Import-Error-Unknown-dataset-URI/m-p/2299360#M71710</link>
      <description>&lt;P&gt;Good morning. I have an issue attempting to activate and test the direct hive import feature.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Talend version: 7.2&lt;/P&gt;&lt;P&gt;Cloudera version: 6.1.1&lt;/P&gt;&lt;P&gt;Hive version: 2.1.1-cdh6.1.1&lt;/P&gt;&lt;P&gt;Kerberos: yes&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I am testing for a client an evolution of a sqoop import job. The client currently imports its data in Parquet file, using the Java API mode of the tSqoopImport component, and for the next step we want to create/fill the hive table at the same time.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Here is the error I get when executing the job:&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;&lt;I&gt;[ERROR]: org.apache.sqoop.Sqoop - Got exception running Sqoop: org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI pattern: dataset:hive:/dev_ingestion/test_import_direct&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&lt;I&gt;Check that JARs for hive datasets are on the classpath&lt;/I&gt;&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The database and its table exist. I have tried everything I tried changing the parameters to put the database name into the hive.table parameter. I also tried to add the path to Cloudera's hadoop and hive libraries (&lt;I&gt;/opt/cloudera/parcels/CDH/lib&lt;/I&gt;) to the classpath in the execution tab. Both had no effect.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You will find attached all the relevant screenshots&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you in advance for your answer and advice.&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 01:18:31 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Sqoop-Import-Hive-Parquet-Import-Error-Unknown-dataset-URI/m-p/2299360#M71710</guid>
      <dc:creator>CPorrot1602485748</dc:creator>
      <dc:date>2024-11-16T01:18:31Z</dc:date>
    </item>
    <item>
      <title>Re: Sqoop Import - Hive Parquet Import Error - Unknown dataset URI pattern:</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Sqoop-Import-Hive-Parquet-Import-Error-Unknown-dataset-URI/m-p/2299361#M71711</link>
      <description>&lt;P&gt;Hi, &lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you for writing to us. This is resolved in 7.3 version with the latest patch in place.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Mon, 13 Jun 2022 12:57:39 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Sqoop-Import-Hive-Parquet-Import-Error-Unknown-dataset-URI/m-p/2299361#M71711</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-06-13T12:57:39Z</dc:date>
    </item>
  </channel>
</rss>

