<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: [Resolved] How to use tTikaExtractor ? in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300335#M72572</link>
    <description>Thanks for your reply,&lt;BR /&gt;No I'm trying to parse .docx files.&lt;BR /&gt;When I try to use tFixedFlowInput, I canot even make the link between the 2 components. Should I change something in the tFixedFlow Input ?&lt;BR /&gt;What should be the shema for example ?&lt;BR /&gt;Thanks !</description>
    <pubDate>Tue, 19 May 2015 08:58:52 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2015-05-19T08:58:52Z</dc:date>
    <item>
      <title>[Resolved] How to use tTikaExtractor ?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300333#M72570</link>
      <description>Hello!&lt;BR /&gt;I'm trying to use tTikaExtractor to parse some word files.&lt;BR /&gt;But I have no idea what component I should use for the output. When I try with a fixedflowinput I cannot connect it.&lt;BR /&gt;Any help ?&lt;BR /&gt;Thanks a lot !</description>
      <pubDate>Mon, 18 May 2015 16:22:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300333#M72570</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-05-18T16:22:37Z</dc:date>
    </item>
    <item>
      <title>Re: [Resolved] How to use tTikaExtractor ?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300334#M72571</link>
      <description>Hi,&lt;BR /&gt;Do you want to parse HTML?&lt;BR /&gt;Have you tried to use tTikaExtractor -&amp;gt; tFixedFlowInput -&amp;gt; tFileOutputDelimited?&lt;BR /&gt;Best regards&lt;BR /&gt;Sabrina</description>
      <pubDate>Tue, 19 May 2015 04:09:16 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300334#M72571</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-05-19T04:09:16Z</dc:date>
    </item>
    <item>
      <title>Re: [Resolved] How to use tTikaExtractor ?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300335#M72572</link>
      <description>Thanks for your reply,&lt;BR /&gt;No I'm trying to parse .docx files.&lt;BR /&gt;When I try to use tFixedFlowInput, I canot even make the link between the 2 components. Should I change something in the tFixedFlow Input ?&lt;BR /&gt;What should be the shema for example ?&lt;BR /&gt;Thanks !</description>
      <pubDate>Tue, 19 May 2015 08:58:52 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300335#M72572</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-05-19T08:58:52Z</dc:date>
    </item>
    <item>
      <title>Re: [Resolved] How to use tTikaExtractor ?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300336#M72573</link>
      <description>Hi,&lt;BR /&gt;Have you already checked component introduction about &lt;A href="https://exchange.talend.com/#marketplaceproductoverview:gallery=marketplace%2F1&amp;amp;pi=marketplace%2F1%2Fproducts%2F134%2Fitems%2F170" target="_blank" rel="nofollow noopener noreferrer"&gt;TalendExchange:tTikaExtractor&lt;/A&gt;?&lt;BR /&gt;Best regards&lt;BR /&gt;Sabrina</description>
      <pubDate>Tue, 19 May 2015 10:18:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300336#M72573</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-05-19T10:18:37Z</dc:date>
    </item>
    <item>
      <title>Re: [Resolved] How to use tTikaExtractor ?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300337#M72574</link>
      <description>Yes, I have already checked the component description, for example I would like to use the CONTENT_XHTML property, how can I define this in the tFixedFlowInput ? 
&lt;BR /&gt;Edit : 
&lt;BR /&gt;For example, I created this job : 
&lt;BR /&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MDi2.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/132892i3A3AE2EA85E6C972/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MDi2.png" alt="0683p000009MDi2.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;What is the configuration of the FixedFlowInput ? 
&lt;BR /&gt; 
&lt;BR /&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MDi7.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/137216i4FDBCE2D97100DD5/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MDi7.png" alt="0683p000009MDi7.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;I can't figure out how to configure this 
&lt;BR /&gt;Any help ? Thanks !</description>
      <pubDate>Tue, 19 May 2015 13:51:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300337#M72574</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-05-19T13:51:13Z</dc:date>
    </item>
    <item>
      <title>Re: [Resolved] How to use tTikaExtractor ?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300338#M72575</link>
      <description>Ok, I found how to do it, maybe it will be uselfull for someone else. 
&lt;BR /&gt;How to get data from tTikaExctrator in a tRowGenerator component : 
&lt;BR /&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MDh5.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/138914i3EB8533A4BC17F38/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MDh5.png" alt="0683p000009MDh5.png" /&gt;&lt;/span&gt;</description>
      <pubDate>Wed, 20 May 2015 16:42:22 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300338#M72575</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-05-20T16:42:22Z</dc:date>
    </item>
    <item>
      <title>Re: [Resolved] How to use tTikaExtractor ?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300339#M72576</link>
      <description>Hi, 
&lt;BR /&gt;Tika extractor is a very powerfull component for pdf extraction and doc also. I recently downloaded the 1.11 version from&amp;nbsp; apache, put il in the ttika folder and just change the reference to it on tTikaExtractor_java.xml in the section : 
&lt;BR /&gt;&amp;lt;CODEGENERATION&amp;gt; 
&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;lt;IMPORTS&amp;gt; 
&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;lt;IMPORT 
&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; NAME="tika" 
&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; MODULE="tika-app-1.11.jar" 
&lt;BR /&gt;Requires java 1.7</description>
      <pubDate>Wed, 28 Oct 2015 12:41:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Resolved-How-to-use-tTikaExtractor/m-p/2300339#M72576</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-10-28T12:41:15Z</dc:date>
    </item>
  </channel>
</rss>

