<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: read pdf file or word document in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/read-pdf-file-or-word-document/m-p/2334704#M103345</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt; 
&lt;P&gt;Here is a custom component written&amp;nbsp; by talend community user and shared on talend exchange portal.&lt;/P&gt; 
&lt;P&gt;&lt;A title="https://exchange.talend.com/#marketplaceproductoverview:marketplace=marketplace%252F1&amp;amp;p=marketplace%252F1%252Fproducts%252F140&amp;amp;pi=marketplace%252F1%252Fproducts%252F140%252Fitems%252F180" href="https://exchange.talend.com/#marketplaceproductoverview:marketplace=marketplace%252F1&amp;amp;p=marketplace%252F1%252Fproducts%252F140&amp;amp;pi=marketplace%252F1%252Fproducts%252F140%252Fitems%252F180" target="_self" rel="nofollow noopener noreferrer"&gt;https://exchange.talend.com/#marketplaceproductoverview:marketplace=marketplace%252F1&amp;amp;p=marketplace%252F1%252Fproducts%252F140&amp;amp;pi=marketplace%252F1%252Fproducts%252F140%252Fitems%252F180&lt;/A&gt;&lt;/P&gt; 
&lt;P&gt;If you want to install a custom component into studio, this online document &lt;A title="TalendHelpCenter:How to install and update a custom component" href="https://help.talend.com/reader/2AWmA~w4VvlfP3JC7dTR2w/Kvx1JE1dQGJgGhfFoHK8xA" target="_self" rel="nofollow noopener noreferrer"&gt;TalendHelpCenter:How to install and update a custom component&lt;/A&gt; will help.&lt;/P&gt; 
&lt;P&gt;Best regards&lt;/P&gt; 
&lt;P&gt;Sabrina&lt;/P&gt;</description>
    <pubDate>Wed, 28 Mar 2018 11:03:29 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2018-03-28T11:03:29Z</dc:date>
    <item>
      <title>read pdf file or word document</title>
      <link>https://community.qlik.com/t5/Talend-Studio/read-pdf-file-or-word-document/m-p/2334703#M103344</link>
      <description>&lt;P&gt;hi all,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;i have a problem statement to read a pdf file's content(which is text not images) and extract the text with bold letters. Since there are no PDF related components, i tried converting the pdf to word document prior reading with talend. I tried reading the word doc with tfileinput(fullrow/delimited) but of no luck.&lt;/P&gt;
&lt;P&gt;How can i read the data in any of the formats?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Any help is appreciated.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks in advance.&lt;/P&gt;</description>
      <pubDate>Tue, 27 Mar 2018 07:49:45 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/read-pdf-file-or-word-document/m-p/2334703#M103344</guid>
      <dc:creator>KarthikGs</dc:creator>
      <dc:date>2018-03-27T07:49:45Z</dc:date>
    </item>
    <item>
      <title>Re: read pdf file or word document</title>
      <link>https://community.qlik.com/t5/Talend-Studio/read-pdf-file-or-word-document/m-p/2334704#M103345</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt; 
&lt;P&gt;Here is a custom component written&amp;nbsp; by talend community user and shared on talend exchange portal.&lt;/P&gt; 
&lt;P&gt;&lt;A title="https://exchange.talend.com/#marketplaceproductoverview:marketplace=marketplace%252F1&amp;amp;p=marketplace%252F1%252Fproducts%252F140&amp;amp;pi=marketplace%252F1%252Fproducts%252F140%252Fitems%252F180" href="https://exchange.talend.com/#marketplaceproductoverview:marketplace=marketplace%252F1&amp;amp;p=marketplace%252F1%252Fproducts%252F140&amp;amp;pi=marketplace%252F1%252Fproducts%252F140%252Fitems%252F180" target="_self" rel="nofollow noopener noreferrer"&gt;https://exchange.talend.com/#marketplaceproductoverview:marketplace=marketplace%252F1&amp;amp;p=marketplace%252F1%252Fproducts%252F140&amp;amp;pi=marketplace%252F1%252Fproducts%252F140%252Fitems%252F180&lt;/A&gt;&lt;/P&gt; 
&lt;P&gt;If you want to install a custom component into studio, this online document &lt;A title="TalendHelpCenter:How to install and update a custom component" href="https://help.talend.com/reader/2AWmA~w4VvlfP3JC7dTR2w/Kvx1JE1dQGJgGhfFoHK8xA" target="_self" rel="nofollow noopener noreferrer"&gt;TalendHelpCenter:How to install and update a custom component&lt;/A&gt; will help.&lt;/P&gt; 
&lt;P&gt;Best regards&lt;/P&gt; 
&lt;P&gt;Sabrina&lt;/P&gt;</description>
      <pubDate>Wed, 28 Mar 2018 11:03:29 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/read-pdf-file-or-word-document/m-p/2334704#M103345</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-03-28T11:03:29Z</dc:date>
    </item>
    <item>
      <title>Re: read pdf file or word document</title>
      <link>https://community.qlik.com/t5/Talend-Studio/read-pdf-file-or-word-document/m-p/2334705#M103346</link>
      <description>&lt;P&gt;hi,&lt;/P&gt;
&lt;P&gt;thanks for your reply, i understood the component tpdftoText is capable of converting a PDF in to a text file. But my requirement is to read the PDF or word(.docx) file and to apply transformations while reading it.&lt;/P&gt;
&lt;P&gt;can i read the PDF file as it is and extract the required string from it by applying filters?&lt;/P&gt;</description>
      <pubDate>Thu, 29 Mar 2018 05:40:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/read-pdf-file-or-word-document/m-p/2334705#M103346</guid>
      <dc:creator>KarthikGs</dc:creator>
      <dc:date>2018-03-29T05:40:51Z</dc:date>
    </item>
  </channel>
</rss>

