<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Real-Time Big Data - Storm job - help with configuration in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Real-Time-Big-Data-Storm-job-help-with-configuration/m-p/2332170#M101076</link>
    <description>My conclusion at this point: 
&lt;BR /&gt;- Talend doesn't actively support Storm; they recommend using Spark Streaming (so why not just removing Storm connector?) 
&lt;BR /&gt;- Buying Talend Fabric (Talend Enterprise edition) is a waste of money without getting support; better trying to work with Talend Open Studio at that point.</description>
    <pubDate>Thu, 15 Sep 2016 18:14:35 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2016-09-15T18:14:35Z</dc:date>
    <item>
      <title>Real-Time Big Data - Storm job - help with configuration</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Real-Time-Big-Data-Storm-job-help-with-configuration/m-p/2332168#M101074</link>
      <description>&lt;P&gt;I'm completed a job that uses Storm components by following this tutorial: 'Getting started with a Storm Job' on help talend website.&lt;BR /&gt;I'm using Talend Fabric 6.2.1&lt;BR /&gt;When I run the job I get the following error:&lt;BR /&gt;java.lang.RuntimeException: java.io.FileNotFoundException: stormtestfromstandard_0_1.jar (The system cannot find the file specified)&lt;BR /&gt;at backtype.storm.StormSubmitter.submitJar(StormSubmitter.java:164)&lt;BR /&gt;at org.talend.libs.tbd.ee.libstorm.ClusterStormJobRunHelper.submitJob(ClusterStormJobRunHelper.java:66)&lt;BR /&gt;at org.talend.libs.tbd.ee.libstorm.StormJobRunHelper.runStorm(StormJobRunHelper.java:96)&lt;BR /&gt;at bigdata_project.stormtestfromstandard_0_1.StormTestFromStandard.runJobInTOS(StormTestFromStandard.java:627)&lt;BR /&gt;at bigdata_project.stormtestfromstandard_0_1.StormTestFromStandard.main(StormTestFromStandard.java:572)&lt;BR /&gt;Caused by: java.io.FileNotFoundException: stormtestfromstandard_0_1.jar (The system cannot find the file specified)&lt;BR /&gt;at java.io.FileInputStream.open0(Native Method)&lt;BR /&gt;at java.io.FileInputStream.open(FileInputStream.java:195)&lt;BR /&gt;at java.io.FileInputStream.&amp;lt;init&amp;gt;(FileInputStream.java:138)&lt;BR /&gt;at java.io.FileInputStream.&amp;lt;init&amp;gt;(FileInputStream.java:93)&lt;BR /&gt;at backtype.storm.utils.BufferFileInputStream.&amp;lt;init&amp;gt;(BufferFileInputStream.java:31)&lt;BR /&gt;&lt;BR /&gt;What I noticed is that when I convert a standard job to a 'Big Data Streaming' job, some components are marked as missing, like for example: tHDFSConnection and KafkaConnection that are critical components to configure Kafka and Hadoop. When I look at the configuration available under 'Storm Configuration' (Run tab) there is a section called Storm Configuration where you are supposed to enter configuration parameters in order to connect to Hadoop/Storm cluster. I couldn't find any document and/or tutorial that explains which parameters are required and how the 'Name' should be formatted.&lt;BR /&gt;Below is my job (it's very basic, but quite frustrating):&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MFhB.jpg"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/145047i18BFF9384C249213/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MFhB.jpg" alt="0683p000009MFhB.jpg" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;The main components I'm using:&lt;BR /&gt;- tKafkaInput: to ingest a stream of data&lt;BR /&gt;- tJavaStorm (for Storm job) and tJavarow (for Spark job) to convert the incoming string to an array&lt;BR /&gt;- tAggregaterow to count the number of elements after applying a grouping&lt;BR /&gt;- tLogRow to display the result.&lt;BR /&gt;&lt;BR /&gt;Any help or direction is highly appreciated.&lt;BR /&gt;&lt;BR /&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 10:25:28 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Real-Time-Big-Data-Storm-job-help-with-configuration/m-p/2332168#M101074</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T10:25:28Z</dc:date>
    </item>
    <item>
      <title>Re: Real-Time Big Data - Storm job - help with configuration</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Real-Time-Big-Data-Storm-job-help-with-configuration/m-p/2332169#M101075</link>
      <description>Other elements for the troubleshooting: 
&lt;BR /&gt;- the file that is claimed to be missing, it's actually not missing; I can see it in the folder C:/Users/user/workspace/.Java/target 
&lt;BR /&gt;- I can see also on the remote Hadoop cluster the jar file, under&amp;nbsp;/hadoop/storm/nimbus/inbox/ 
&lt;BR /&gt; 
&lt;BR /&gt;I guess it's an issue with permissions, but like I said in my previous post, there is not flexibility to configure HDFS or Kafka in Talend (when selecting a Big Data Streaming job); the only hope is in the 'Storm Configuration' parameters, but I'm going blind without any documents that explains how to use them. 
&lt;BR /&gt; 
&lt;BR /&gt;Thanks.</description>
      <pubDate>Wed, 31 Aug 2016 01:28:03 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Real-Time-Big-Data-Storm-job-help-with-configuration/m-p/2332169#M101075</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-08-31T01:28:03Z</dc:date>
    </item>
    <item>
      <title>Re: Real-Time Big Data - Storm job - help with configuration</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Real-Time-Big-Data-Storm-job-help-with-configuration/m-p/2332170#M101076</link>
      <description>My conclusion at this point: 
&lt;BR /&gt;- Talend doesn't actively support Storm; they recommend using Spark Streaming (so why not just removing Storm connector?) 
&lt;BR /&gt;- Buying Talend Fabric (Talend Enterprise edition) is a waste of money without getting support; better trying to work with Talend Open Studio at that point.</description>
      <pubDate>Thu, 15 Sep 2016 18:14:35 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Real-Time-Big-Data-Storm-job-help-with-configuration/m-p/2332170#M101076</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-09-15T18:14:35Z</dc:date>
    </item>
  </channel>
</rss>

