<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Consume kafka topic and store it in Hive using Talend Studio in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307188#M78643</link>
    <description>&lt;P&gt;Hi Shong, I have tried using tExtractJsonField as in below configuration:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0695b00000SpUJHAA3.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/149481iB0BFFA2D6EBF7433/image-size/large?v=v2&amp;amp;px=999" role="button" title="0695b00000SpUJHAA3.png" alt="0695b00000SpUJHAA3.png" /&gt;&lt;/span&gt;Here, I loop the json path RequestHeader to fetch the data in it. However, once I run the job, there is no result from talend. &lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0695b00000SpUJbAAN.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/133381i79BF8C18CCB30C41/image-size/large?v=v2&amp;amp;px=999" role="button" title="0695b00000SpUJbAAN.png" alt="0695b00000SpUJbAAN.png" /&gt;&lt;/span&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 14 Jun 2022 06:21:28 GMT</pubDate>
    <dc:creator>WSyahirah21</dc:creator>
    <dc:date>2022-06-14T06:21:28Z</dc:date>
    <item>
      <title>Consume kafka topic and store it in Hive using Talend Studio</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307186#M78641</link>
      <description>&lt;P&gt;I am currently trying to create an ingestion job workflow using kafka in Talend Studio. The job will read the json data in topic "work" and store into the hive table. My idea is to use the following workflow in Talend:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;tKafKaInput &amp;gt; tLogRow &amp;gt; tJava &amp;gt; tMap&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;tKafKaInput and tLogRow : Consume the json data in topic "Work"&lt;/P&gt;&lt;P&gt;tJava : Fetch the json data and bring data to tMap&lt;/P&gt;&lt;P&gt;tMap : Structure the data and save into Hive table&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0695b00000RkFFVAA3.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/149360i49501293EB30D138/image-size/large?v=v2&amp;amp;px=999" role="button" title="0695b00000RkFFVAA3.png" alt="0695b00000RkFFVAA3.png" /&gt;&lt;/span&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Note : Snippet of json data in Kafka topic outputs from&amp;nbsp;tLogRow_1 is as in attachment (data).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The code in tJava to fetch the json data is basically in this line in which its trying to catch "Vers" data from json:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;I&gt;String output=((String)globalMap.get("tLogRow_1_OUTPUT"));&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&lt;I&gt;JSONObject jsonObject = new JSONObject(output);&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&lt;I&gt;System.out.println(jsonObject);&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&lt;I&gt;String sourceDBName=(jsonObject.getString("Vers"));&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, I received the error as mentioned in attachment (Error). &lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My questions are:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Is my workflow the best practice to ingest topic from json topic to hive table. Or is there any other possible ways to perform this activity using Talend Studio?&lt;/LI&gt;&lt;LI&gt;If this workflow is correctly designed, how do I modify the java code in tJava component so that it able to capture the json result from tLogRow and bring it forward to tMap for next activity?&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any helps if much appreciated, thanks.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 15 Nov 2024 22:50:25 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307186#M78641</guid>
      <dc:creator>WSyahirah21</dc:creator>
      <dc:date>2024-11-15T22:50:25Z</dc:date>
    </item>
    <item>
      <title>Re: Consume kafka topic and store it in Hive using Talend Studio</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307187#M78642</link>
      <description>&lt;P&gt;Hi &lt;/P&gt;&lt;P&gt;tExtractJsonField is the best component used to extract data from a Json string. Please try it and let me know if it does not fit your need or you have any questions. &lt;/P&gt;&lt;P&gt;tKafKaInput &amp;gt; tExtractJsonField &amp;gt; tMap&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Shong&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;</description>
      <pubDate>Tue, 14 Jun 2022 04:32:36 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307187#M78642</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-06-14T04:32:36Z</dc:date>
    </item>
    <item>
      <title>Re: Consume kafka topic and store it in Hive using Talend Studio</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307188#M78643</link>
      <description>&lt;P&gt;Hi Shong, I have tried using tExtractJsonField as in below configuration:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0695b00000SpUJHAA3.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/149481iB0BFFA2D6EBF7433/image-size/large?v=v2&amp;amp;px=999" role="button" title="0695b00000SpUJHAA3.png" alt="0695b00000SpUJHAA3.png" /&gt;&lt;/span&gt;Here, I loop the json path RequestHeader to fetch the data in it. However, once I run the job, there is no result from talend. &lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0695b00000SpUJbAAN.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/133381i79BF8C18CCB30C41/image-size/large?v=v2&amp;amp;px=999" role="button" title="0695b00000SpUJbAAN.png" alt="0695b00000SpUJbAAN.png" /&gt;&lt;/span&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 14 Jun 2022 06:21:28 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307188#M78643</guid>
      <dc:creator>WSyahirah21</dc:creator>
      <dc:date>2022-06-14T06:21:28Z</dc:date>
    </item>
    <item>
      <title>Re: Consume kafka topic and store it in Hive using Talend Studio</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307189#M78644</link>
      <description>&lt;P&gt;Hi &lt;/P&gt;&lt;P&gt;Set Loop Jsonpath query as "$.RequestHeader" and try again. &lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Shong&lt;/P&gt;</description>
      <pubDate>Wed, 15 Jun 2022 01:04:01 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307189#M78644</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-06-15T01:04:01Z</dc:date>
    </item>
    <item>
      <title>Re: Consume kafka topic and store it in Hive using Talend Studio</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307190#M78645</link>
      <description>&lt;P&gt;Hi Shong, &lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Currently I am trying to read the nested json from 2 parent json. As you can see here in image:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0695b00000SpkdSAAR.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/142465iD813F17DCC806B28/image-size/large?v=v2&amp;amp;px=999" role="button" title="0695b00000SpkdSAAR.png" alt="0695b00000SpkdSAAR.png" /&gt;&lt;/span&gt;Currently, I able to read the json data from RequestHeader using tJsonExtractFields component (You can see at the main connection between 2 tJsonExtractFields components).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, the 2nd component I can only do is the "Reject" connection, instead of Main/onComponentOk . Is this possible to read the nested data this way?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Or any idea on this. Thanks.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Reference I used : &lt;A href="https://help.talend.com/r/Eizi~hPs0B4M_mO2ot6_1g/Ao7wb2mUfg1hug8GfwRXLw" alt="https://help.talend.com/r/Eizi~hPs0B4M_mO2ot6_1g/Ao7wb2mUfg1hug8GfwRXLw" target="_blank"&gt;https://help.talend.com/r/Eizi~hPs0B4M_mO2ot6_1g/Ao7wb2mUfg1hug8GfwRXLw&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jun 2022 08:53:06 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307190#M78645</guid>
      <dc:creator>WSyahirah21</dc:creator>
      <dc:date>2022-06-16T08:53:06Z</dc:date>
    </item>
    <item>
      <title>Re: Consume kafka topic and store it in Hive using Talend Studio</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307191#M78646</link>
      <description>&lt;P&gt;Using a tReplicated after tKafkaInput to replicated the data flow so that you can read the json string several times. eg:&lt;/P&gt;&lt;P&gt;tKafkaInput--tReplicated--tExtractJsonField1&lt;/P&gt;&lt;P&gt; ***************************** --tExtractJsonField2&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jun 2022 11:49:53 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307191#M78646</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-06-16T11:49:53Z</dc:date>
    </item>
    <item>
      <title>Re: Consume kafka topic and store it in Hive using Talend Studio</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307192#M78647</link>
      <description>&lt;P&gt;Hi Shong, &lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;is there any ways to merge those 2 component in TDF. Tried tUnite and tMaps, however that didnt worked well. &lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0695b00000SpqxJAAR.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/130150iBCCE74AA2C64FA4C/image-size/large?v=v2&amp;amp;px=999" role="button" title="0695b00000SpqxJAAR.png" alt="0695b00000SpqxJAAR.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 17 Jun 2022 02:50:06 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307192#M78647</guid>
      <dc:creator>WSyahirah21</dc:creator>
      <dc:date>2022-06-17T02:50:06Z</dc:date>
    </item>
    <item>
      <title>Re: Consume kafka topic and store it in Hive using Talend Studio</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307193#M78648</link>
      <description>&lt;P&gt;Store the results to  thashOutput, read the data back from memory using tHashInput and merge the data in next subjob.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 17 Jun 2022 03:44:30 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Consume-kafka-topic-and-store-it-in-Hive-using-Talend-Studio/m-p/2307193#M78648</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-06-17T03:44:30Z</dc:date>
    </item>
  </channel>
</rss>

