Skip to main content
Announcements
Qlik Connect 2024! Seize endless possibilities! LEARN MORE
cancel
Showing results for 
Search instead for 
Did you mean: 
Keitaru
Creator
Creator

How to identify recurrent terms and remove timestamp/stopwords in an unstructured text field

Hi Experts,

With reference to How to identify recurrent terms in text fields? 

Which my script is based on, I'm looking at how i can modify the script to stop collecting  things such as date timestamp, stop words etc into the WordTuple. Also using Tri-gram idea to be able to identify recurring terms so as to be able to use this data for topic modeling later.

Due to the sensitivity of my data I'm unable to share it here.

Like to have it just store 3 word phrases.

 

 

 

 

1 Reply
Keitaru
Creator
Creator
Author

Anyone able to help?