<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to identify recurrent terms and remove timestamp/stopwords in an unstructured text field in App Development</title>
    <link>https://community.qlik.com/t5/App-Development/How-to-identify-recurrent-terms-and-remove-timestamp-stopwords/m-p/1547830#M39448</link>
    <description>&lt;P&gt;Hi Experts,&lt;/P&gt;&lt;P&gt;With reference to&amp;nbsp;&lt;A title="How to identify recurrent terms in text fields?" href="https://community.qlik.com/t5/QlikView-App-Development/How-to-identify-recurrent-terms-in-text-fields/m-p/1547796/highlight/true#M440004" target="_blank" rel="noopener"&gt;How to identify recurrent terms in text fields?&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Which my script is based on, I'm looking at how i can modify the script to stop collecting&amp;nbsp; things such as date timestamp, stop words etc into the WordTuple. Also using Tri-gram idea to be able to identify recurring terms so as to be able to use this data for topic modeling later.&lt;/P&gt;&lt;P&gt;Due to the sensitivity of my data I'm unable to share it here.&lt;/P&gt;&lt;P&gt;Like to have it just store 3 word phrases.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 11 Sep 2019 07:51:29 GMT</pubDate>
    <dc:creator>Keitaru</dc:creator>
    <dc:date>2019-09-11T07:51:29Z</dc:date>
    <item>
      <title>How to identify recurrent terms and remove timestamp/stopwords in an unstructured text field</title>
      <link>https://community.qlik.com/t5/App-Development/How-to-identify-recurrent-terms-and-remove-timestamp-stopwords/m-p/1547830#M39448</link>
      <description>&lt;P&gt;Hi Experts,&lt;/P&gt;&lt;P&gt;With reference to&amp;nbsp;&lt;A title="How to identify recurrent terms in text fields?" href="https://community.qlik.com/t5/QlikView-App-Development/How-to-identify-recurrent-terms-in-text-fields/m-p/1547796/highlight/true#M440004" target="_blank" rel="noopener"&gt;How to identify recurrent terms in text fields?&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Which my script is based on, I'm looking at how i can modify the script to stop collecting&amp;nbsp; things such as date timestamp, stop words etc into the WordTuple. Also using Tri-gram idea to be able to identify recurring terms so as to be able to use this data for topic modeling later.&lt;/P&gt;&lt;P&gt;Due to the sensitivity of my data I'm unable to share it here.&lt;/P&gt;&lt;P&gt;Like to have it just store 3 word phrases.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 11 Sep 2019 07:51:29 GMT</pubDate>
      <guid>https://community.qlik.com/t5/App-Development/How-to-identify-recurrent-terms-and-remove-timestamp-stopwords/m-p/1547830#M39448</guid>
      <dc:creator>Keitaru</dc:creator>
      <dc:date>2019-09-11T07:51:29Z</dc:date>
    </item>
    <item>
      <title>Re: How to identify recurrent terms in text fields and also not include timestamp/stopwords</title>
      <link>https://community.qlik.com/t5/App-Development/How-to-identify-recurrent-terms-and-remove-timestamp-stopwords/m-p/1622458#M46340</link>
      <description>Anyone able to help?</description>
      <pubDate>Wed, 11 Sep 2019 07:18:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/App-Development/How-to-identify-recurrent-terms-and-remove-timestamp-stopwords/m-p/1622458#M46340</guid>
      <dc:creator>Keitaru</dc:creator>
      <dc:date>2019-09-11T07:18:15Z</dc:date>
    </item>
  </channel>
</rss>

