<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Search through documents with QlikView in QlikView</title>
    <link>https://community.qlik.com/t5/QlikView/Search-through-documents-with-QlikView/m-p/218230#M71395</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello all,&lt;/P&gt;&lt;P&gt;I have just started using a sparkly new Win7Pro x64 PC at work. The primary reason for this was to utilise higher levels of RAM for QlikView applications so I stop developing on the server! However, one of the first issues I have hit is the (deliberate) omission by Microsoft of indexing for network shares unless you make them offline folders (not a practical option!). A colleague was telling me about Google Search Appliance (details &lt;A href="http://www.google.co.uk/intl/en-gb/enterprise/search/gsa.html#utm_campaign=en&amp;amp;utm_source=en-ha-emea-uk-bk&amp;amp;utm_medium=ha&amp;amp;utm_term=google%20enterprise%20search%20appliance" target="_blank" title="Google Search Appliance"&gt;here&lt;/A&gt;) which got me thinking about a QlikView solution.&lt;/P&gt;&lt;P&gt;I've tried to see if someone's already done this but not found anything. Do any of you gentlemen or ladies have some idea about how QlikView could be used to "index" a folder of documents for easy searching? Maybe even get some content from them given the new .docx open format?&lt;/P&gt;&lt;P&gt;Would anyone like to take up the challenge of building one!? Just a thought...[8-|]&lt;/P&gt;&lt;P&gt;Cheers,&lt;/P&gt;&lt;P&gt;Jason&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 16 Feb 2011 11:21:25 GMT</pubDate>
    <dc:creator>Jason_Michaelides</dc:creator>
    <dc:date>2011-02-16T11:21:25Z</dc:date>
    <item>
      <title>Search through documents with QlikView</title>
      <link>https://community.qlik.com/t5/QlikView/Search-through-documents-with-QlikView/m-p/218230#M71395</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello all,&lt;/P&gt;&lt;P&gt;I have just started using a sparkly new Win7Pro x64 PC at work. The primary reason for this was to utilise higher levels of RAM for QlikView applications so I stop developing on the server! However, one of the first issues I have hit is the (deliberate) omission by Microsoft of indexing for network shares unless you make them offline folders (not a practical option!). A colleague was telling me about Google Search Appliance (details &lt;A href="http://www.google.co.uk/intl/en-gb/enterprise/search/gsa.html#utm_campaign=en&amp;amp;utm_source=en-ha-emea-uk-bk&amp;amp;utm_medium=ha&amp;amp;utm_term=google%20enterprise%20search%20appliance" target="_blank" title="Google Search Appliance"&gt;here&lt;/A&gt;) which got me thinking about a QlikView solution.&lt;/P&gt;&lt;P&gt;I've tried to see if someone's already done this but not found anything. Do any of you gentlemen or ladies have some idea about how QlikView could be used to "index" a folder of documents for easy searching? Maybe even get some content from them given the new .docx open format?&lt;/P&gt;&lt;P&gt;Would anyone like to take up the challenge of building one!? Just a thought...[8-|]&lt;/P&gt;&lt;P&gt;Cheers,&lt;/P&gt;&lt;P&gt;Jason&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 16 Feb 2011 11:21:25 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Search-through-documents-with-QlikView/m-p/218230#M71395</guid>
      <dc:creator>Jason_Michaelides</dc:creator>
      <dc:date>2011-02-16T11:21:25Z</dc:date>
    </item>
    <item>
      <title>AW:Search through documents with QlikView</title>
      <link>https://community.qlik.com/t5/QlikView/Search-through-documents-with-QlikView/m-p/218231#M71396</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Jason,&lt;/P&gt;&lt;P&gt;this is doable, we did it for one of our customers recently with indexing PDF files &lt;span class="lia-unicode-emoji" title=":grinning_face_with_smiling_eyes:"&gt;😄&lt;/span&gt;&lt;/P&gt;&lt;P&gt;You can write a recursive subroutine which reads folder by folder and file by file. You may want to exclude some directories such as C:\Windows or file types. However the basic script is as simple as the following.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="font-style: italic; padding-left: 30px"&gt;SUB GetPaths(dir)&lt;/P&gt;&lt;P style="font-style: italic; padding-left: 30px"&gt;FOR EACH subdir in dirlist( '$(dir)' &amp;amp; '\*' )&lt;/P&gt;&lt;P style="font-style: italic; padding-left: 30px"&gt;docpath:&lt;/P&gt;&lt;P style="font-style: italic; padding-left: 30px"&gt;LOAD '$(subdir)' as docpath&lt;/P&gt;&lt;P style="font-style: italic; padding-left: 30px"&gt;AUTOGENERATE 1;&lt;/P&gt;&lt;P style="font-style: italic; padding-left: 30px"&gt;CALL GetPaths('$(subdir)')&lt;/P&gt;&lt;P style="font-style: italic; padding-left: 30px"&gt;NEXT&lt;/P&gt;&lt;P style="font-style: italic; padding-left: 30px"&gt;END SUB&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="font-style: italic; padding-left: 30px"&gt;CALL GetPaths('C:');&lt;/P&gt;&lt;BR /&gt;&lt;BR /&gt; &lt;P&gt;&lt;/P&gt;&lt;P&gt;You can do similiar for files then:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="font-style: italic; padding-left: 30px"&gt;for each file in filelist(subdir &amp;amp; '\*.*')&lt;/P&gt;&lt;P style="padding-left:30px;"&gt;&lt;I&gt;...&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Within this piece of code you can check for the file type and if readable (e.g. txt) load it into QlikView either as one, line by line or however. I never tried that for *.docx but it seems to work except that you will loose some meta data like page number etc. You also need to handle the header "crap", but I didn't say it is easy &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Feel free to ask further...&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;cheers&lt;/P&gt;&lt;P&gt;Florian&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 16 Feb 2011 17:19:45 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Search-through-documents-with-QlikView/m-p/218231#M71396</guid>
      <dc:creator />
      <dc:date>2011-02-16T17:19:45Z</dc:date>
    </item>
  </channel>
</rss>

