<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Storing Data from Kafka Input in smaller files in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Storing-Data-from-Kafka-Input-in-smaller-files/m-p/2245064#M31040</link>
    <description>&lt;P&gt;Looking on Your original screenshot - You have something wrong in 2 components&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Not for HDFS, but it work like this:&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screen Shot 2017-06-07 at 11.47.10 PM.png" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009Lu6W.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154439iDE5B9CE5CCF7E486/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009Lu6W.png" alt="0683p000009Lu6W.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;You must define variable before component start work, in my case - tJavaFlex3, an result will be:&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screen Shot 2017-06-07 at 11.47.22 PM.png" style="width: 337px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009LuaL.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/128841i3D3F52A44C77A9AC/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009LuaL.png" alt="0683p000009LuaL.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 07 Jun 2017 12:49:30 GMT</pubDate>
    <dc:creator>vapukov</dc:creator>
    <dc:date>2017-06-07T12:49:30Z</dc:date>
    <item>
      <title>Storing Data from Kafka Input in smaller files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Storing-Data-from-Kafka-Input-in-smaller-files/m-p/2245060#M31036</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;In my usecase, i am receiving data in Kafka 24*7 in ebcdic. I read the data as byte array from kafkainput and parse the data using the tHMAP. Output of tHmap goes to tjavarow as byte array. output from the tJavaRow is getting stored in tHDFSOutput. Initial tJava component to create the filename with datetimestamp.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Here are my challenges&lt;/P&gt; 
&lt;P&gt;1) my data is getting stored as a single file in hdfs. I need to have them smaller file, as i might need to see the data from morning to afternoon (or to any point of time). Because of me having the data in a single file, i am unable to fetch data.&lt;/P&gt; 
&lt;P&gt;2) I tried updating the filename in the tJavaRow (that is the reason for inserting the component here). But i am unable to change the file name.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Need some suggestions to get this done.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Below is a flow, recreated for understanding.&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="desktop.PNG" style="width: 746px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009Lupt.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/153792iD689A37F82AC6EFC/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009Lupt.png" alt="0683p000009Lupt.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 09:42:06 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Storing-Data-from-Kafka-Input-in-smaller-files/m-p/2245060#M31036</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T09:42:06Z</dc:date>
    </item>
    <item>
      <title>Re: Storing Data from Kafka Input in smaller files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Storing-Data-from-Kafka-Input-in-smaller-files/m-p/2245061#M31037</link>
      <description>&lt;P&gt;just the question, if reason for split only access to timed data - why not query over HDFS files?&lt;/P&gt;&lt;P&gt;- SQL&lt;/P&gt;&lt;P&gt;- Hive&lt;/P&gt;&lt;P&gt;- Drill&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 01 Jun 2017 23:13:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Storing-Data-from-Kafka-Input-in-smaller-files/m-p/2245061#M31037</guid>
      <dc:creator>vapukov</dc:creator>
      <dc:date>2017-06-01T23:13:13Z</dc:date>
    </item>
    <item>
      <title>Re: Storing Data from Kafka Input in smaller files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Storing-Data-from-Kafka-Input-in-smaller-files/m-p/2245062#M31038</link>
      <description>&lt;P&gt;For a day, i am expecting around 100GB data to be received. So, i am storing the data in HDFS with partition. But, the file initially created only being used to store the data despite the partition on date created on run time in tjavarow component.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 02 Jun 2017 05:05:54 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Storing-Data-from-Kafka-Input-in-smaller-files/m-p/2245062#M31038</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-06-02T05:05:54Z</dc:date>
    </item>
    <item>
      <title>Re: Storing Data from Kafka Input in smaller files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Storing-Data-from-Kafka-Input-in-smaller-files/m-p/2245063#M31039</link>
      <description>&lt;P&gt;any suggestions ?&lt;/P&gt;</description>
      <pubDate>Wed, 07 Jun 2017 11:56:58 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Storing-Data-from-Kafka-Input-in-smaller-files/m-p/2245063#M31039</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-06-07T11:56:58Z</dc:date>
    </item>
    <item>
      <title>Re: Storing Data from Kafka Input in smaller files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Storing-Data-from-Kafka-Input-in-smaller-files/m-p/2245064#M31040</link>
      <description>&lt;P&gt;Looking on Your original screenshot - You have something wrong in 2 components&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Not for HDFS, but it work like this:&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screen Shot 2017-06-07 at 11.47.10 PM.png" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009Lu6W.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154439iDE5B9CE5CCF7E486/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009Lu6W.png" alt="0683p000009Lu6W.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;You must define variable before component start work, in my case - tJavaFlex3, an result will be:&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screen Shot 2017-06-07 at 11.47.22 PM.png" style="width: 337px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009LuaL.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/128841i3D3F52A44C77A9AC/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009LuaL.png" alt="0683p000009LuaL.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 07 Jun 2017 12:49:30 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Storing-Data-from-Kafka-Input-in-smaller-files/m-p/2245064#M31040</guid>
      <dc:creator>vapukov</dc:creator>
      <dc:date>2017-06-07T12:49:30Z</dc:date>
    </item>
  </channel>
</rss>

