<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Count Occurrence Word From Social Media in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299967#M72246</link>
    <description>Hi, 
&lt;BR /&gt;I just wonder and need everyone of you on this matter. I required to count the occurrence of word from social media such as blog, facebook etc. But im not sure if there's any freeware than can integrated with Talend to count the occurrences. 
&lt;BR /&gt;I don't think by creating ETL job can counting the occurrence fast and real-time. 
&lt;BR /&gt;Plz help to advice me 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MPcz.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/157233iD1A564EF62DE3BC2/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MPcz.png" alt="0683p000009MPcz.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt; 
&lt;BR /&gt;Regards, 
&lt;BR /&gt;Kal</description>
    <pubDate>Sat, 16 Nov 2024 12:05:35 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2024-11-16T12:05:35Z</dc:date>
    <item>
      <title>Count Occurrence Word From Social Media</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299967#M72246</link>
      <description>Hi, 
&lt;BR /&gt;I just wonder and need everyone of you on this matter. I required to count the occurrence of word from social media such as blog, facebook etc. But im not sure if there's any freeware than can integrated with Talend to count the occurrences. 
&lt;BR /&gt;I don't think by creating ETL job can counting the occurrence fast and real-time. 
&lt;BR /&gt;Plz help to advice me 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MPcz.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/157233iD1A564EF62DE3BC2/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MPcz.png" alt="0683p000009MPcz.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt; 
&lt;BR /&gt;Regards, 
&lt;BR /&gt;Kal</description>
      <pubDate>Sat, 16 Nov 2024 12:05:35 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299967#M72246</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T12:05:35Z</dc:date>
    </item>
    <item>
      <title>Re: Count Occurrence Word From Social Media</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299968#M72247</link>
      <description>Hi, 
&lt;BR /&gt;The most important thing is that you need extract the information from Facebook or Social Media by talend, first and then do the action of counting . So I think the 
&lt;A href="https://community.qlik.com/s/feed/0D53p00007vCp6wCAC" target="_blank" rel="nofollow noopener noreferrer"&gt;https://community.talend.com/t5/Design-and-Development/FaceBook/td-p/99612&lt;/A&gt; is useful for you. 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Sabrina</description>
      <pubDate>Thu, 07 Mar 2013 10:03:32 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299968#M72247</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-03-07T10:03:32Z</dc:date>
    </item>
    <item>
      <title>Re: Count Occurrence Word From Social Media</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299969#M72248</link>
      <description>Hi,&lt;BR /&gt;Thanks for the information, after i extract the information from social media/facebook, how do i want to counting it?&lt;BR /&gt;Rgds,&lt;BR /&gt;Kal</description>
      <pubDate>Fri, 08 Mar 2013 03:18:03 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299969#M72248</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-03-08T03:18:03Z</dc:date>
    </item>
    <item>
      <title>Re: Count Occurrence Word From Social Media</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299970#M72249</link>
      <description>Hi, 
&lt;BR /&gt;There is component 
&lt;A href="https://help.talend.com/search/all?query=tFileRowCount&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt;tFileRowCount&lt;/A&gt;.The function is counting the number of rows in a file.
&lt;BR /&gt;The work flow may be Source file--&amp;gt;tFileInputxx--&amp;gt;tFileRowCount--&amp;gt;tFileOutputxx
&lt;BR /&gt;Best regards
&lt;BR /&gt;Sabrina</description>
      <pubDate>Fri, 08 Mar 2013 03:35:19 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299970#M72249</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-03-08T03:35:19Z</dc:date>
    </item>
    <item>
      <title>Re: Count Occurrence Word From Social Media</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299971#M72250</link>
      <description>Hi,&lt;BR /&gt;My source file is SQL Server. How do i wants to connect to tFileRowCount? Also, i wants to count the occurrence of each word. Is that possible?&lt;BR /&gt;Thanks,&lt;BR /&gt;Kal</description>
      <pubDate>Fri, 08 Mar 2013 04:22:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299971#M72250</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-03-08T04:22:37Z</dc:date>
    </item>
    <item>
      <title>Re: Count Occurrence Word From Social Media</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299972#M72251</link>
      <description>&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;Hi,&lt;BR /&gt;My source file is SQL Server. How do i wants to connect to tFileRowCount? Also, i wants to count the occurrence of each word. Is that possible?&lt;BR /&gt;Thanks,&lt;BR /&gt;Kal&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;Yes, you can count each word of a string, use tNormalize to normalize the data to multiple lines with the separator " ", for example, you have a data like: 
&lt;BR /&gt;"this is an example for tNormalize component" 
&lt;BR /&gt;to: 
&lt;BR /&gt;this 
&lt;BR /&gt;is 
&lt;BR /&gt;an 
&lt;BR /&gt;example 
&lt;BR /&gt;for 
&lt;BR /&gt;tNormalize 
&lt;BR /&gt;component 
&lt;BR /&gt;Then link tNormalize to 
&lt;A href="https://help.talend.com/search/all?query=tAggregateRow&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt;tAggregateRow&lt;/A&gt; to for counting the number of each word with the 'count' operator. 
&lt;BR /&gt;tMSSQLlnput--main--tNormalize--main--tAggregateRow---tLogRow 
&lt;BR /&gt;Shong</description>
      <pubDate>Fri, 08 Mar 2013 04:34:47 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299972#M72251</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-03-08T04:34:47Z</dc:date>
    </item>
    <item>
      <title>Re: Count Occurrence Word From Social Media</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299973#M72252</link>
      <description>Hi,
&lt;BR /&gt;I've followed your suggestion and it's worked but there's a little issue i faced where a few words are not isolated and i noticed it happened on the first word of sentence after full stop sign "."
&lt;BR /&gt;For example:
&lt;BR /&gt;"i like to watch movie. I like eat too"
&lt;BR /&gt;Expected output:
&lt;BR /&gt;-------------------
&lt;BR /&gt;i
&lt;BR /&gt;like
&lt;BR /&gt;to
&lt;BR /&gt;watch
&lt;BR /&gt;movie
&lt;BR /&gt;i
&lt;BR /&gt;like
&lt;BR /&gt;eat
&lt;BR /&gt;too
&lt;BR /&gt;Current output:
&lt;BR /&gt;-----------------
&lt;BR /&gt;i
&lt;BR /&gt;like
&lt;BR /&gt;to
&lt;BR /&gt;watch
&lt;BR /&gt;movie. I \\this is the issue
&lt;BR /&gt;like
&lt;BR /&gt;eat
&lt;BR /&gt;too
&lt;BR /&gt;
&lt;BR /&gt;Could you figure out the issue?</description>
      <pubDate>Thu, 14 Mar 2013 04:19:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299973#M72252</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-03-14T04:19:37Z</dc:date>
    </item>
    <item>
      <title>Re: Count Occurrence Word From Social Media</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299974#M72253</link>
      <description>Hi 
&lt;BR /&gt;Remove the special character such as ",", "." and so on before normalizing the string, for example: 
&lt;BR /&gt;row1.line.replaceAll(".","") 
&lt;BR /&gt;If the string may contains more types of special character, it is better to define a function to handle the special characters in a routine, define a list to add all characters that may exist in the string, then each character and remove it from the string. Then, call the routine to remove all special characters on a tMap for example before tNormalize: 
&lt;BR /&gt;tMSSQLlnput--main--tMap-main--&amp;gt;tNormalize--main--tAggregateRow---tLogRow 
&lt;BR /&gt; 
&lt;BR /&gt;Shong</description>
      <pubDate>Thu, 14 Mar 2013 05:37:55 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299974#M72253</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-03-14T05:37:55Z</dc:date>
    </item>
    <item>
      <title>Re: Count Occurrence Word From Social Media</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299975#M72254</link>
      <description>Hi Shong, 
&lt;BR /&gt;Actually, I did removed special characters including ".". But it returned me like this 
&lt;BR /&gt;Current output: 
&lt;BR /&gt;----------------- 
&lt;BR /&gt;i 
&lt;BR /&gt;like 
&lt;BR /&gt;to 
&lt;BR /&gt;watch 
&lt;BR /&gt;movie I \\this is the issue 
&lt;BR /&gt;like 
&lt;BR /&gt;eat 
&lt;BR /&gt;too 
&lt;BR /&gt;Refer my job design. 
&lt;BR /&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MDuK.jpg"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/132455i0531A106425B9660/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MDuK.jpg" alt="0683p000009MDuK.jpg" /&gt;&lt;/span&gt;</description>
      <pubDate>Thu, 14 Mar 2013 07:36:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299975#M72254</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-03-14T07:36:40Z</dc:date>
    </item>
    <item>
      <title>Re: Count Occurrence Word From Social Media</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299976#M72255</link>
      <description>Hi 
&lt;BR /&gt;In principle, there should be a space after character in English, however there is no a space after "." in your case, in order to avoid this situation, you can always replace a character with a space, for example: 
&lt;BR /&gt;row1.line.replaceAll("\\."," ") 
&lt;BR /&gt;And then, use a tfiterRow to remove the empty lines. 
&lt;BR /&gt;Shong</description>
      <pubDate>Thu, 14 Mar 2013 08:38:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299976#M72255</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-03-14T08:38:14Z</dc:date>
    </item>
    <item>
      <title>Re: Count Occurrence Word From Social Media</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299977#M72256</link>
      <description>Hi, 
&lt;BR /&gt;I want to replace with ; sign. For example i have a sentence like this 
&lt;BR /&gt;"i like eat. i like drink" 
&lt;BR /&gt;expected output 
&lt;BR /&gt;------------------ 
&lt;BR /&gt;"i;like;eat.;i;like;drink" 
&lt;BR /&gt;current output 
&lt;BR /&gt;---------------- 
&lt;BR /&gt;"i;like;eat. i;like;drink" 
&lt;BR /&gt;How do i wants to put any function to replace between end of sentence and 1st word of next sentence? 
&lt;BR /&gt;Plz help me 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MPcz.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/157233iD1A564EF62DE3BC2/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MPcz.png" alt="0683p000009MPcz.png" /&gt;&lt;/span&gt;</description>
      <pubDate>Fri, 15 Mar 2013 02:15:27 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299977#M72256</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-03-15T02:15:27Z</dc:date>
    </item>
    <item>
      <title>Re: Count Occurrence Word From Social Media</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299978#M72257</link>
      <description>Hi 
&lt;BR /&gt;Please make sure there is a space after "." in your string, if I use the expression 
&lt;BR /&gt;row1.c.replaceAll(" ",";") 
&lt;BR /&gt;It output the right result: 
&lt;BR /&gt; 
&lt;PRE&gt; connecting to socket on port 3480&lt;BR /&gt; connected&lt;BR /&gt;i;like;eat.;i;like;drink&lt;BR /&gt; disconnected&lt;/PRE&gt; 
&lt;BR /&gt;Shong</description>
      <pubDate>Fri, 15 Mar 2013 02:39:22 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Count-Occurrence-Word-From-Social-Media/m-p/2299978#M72257</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-03-15T02:39:22Z</dc:date>
    </item>
  </channel>
</rss>

