<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: OCR (Optical character recognition) Scanner for talend. in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299984#M72263</link>
    <description>Hi, 
&lt;BR /&gt;You can open a JIRA issue in the Talend DI project of the 
&lt;A href="https://jira.talendforge.org/secure/Dashboard.jspa" target="_blank" rel="nofollow noopener noreferrer"&gt;JIRA bugtracker&lt;/A&gt; for your new feature. Our component developer will see if this feature can be available in further version.
&lt;BR /&gt;Certainly, you can create a custom component by yourself.
&lt;BR /&gt;Please see the reference:
&lt;A href="http://powerupbi.com/talend/componentCreation_index.html" target="_blank" rel="nofollow noopener noreferrer"&gt;componentCreation&lt;/A&gt;
&lt;BR /&gt;Best regards
&lt;BR /&gt;Sabrina</description>
    <pubDate>Wed, 24 Apr 2013 08:19:22 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2013-04-24T08:19:22Z</dc:date>
    <item>
      <title>OCR (Optical character recognition) Scanner for talend.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299981#M72260</link>
      <description>Hi 
&lt;BR /&gt;I have been scouring the internet for about an hour and a half looking for some way to scan a pdf or doc. into talend but have not found any answers or components that can help. 
&lt;BR /&gt;I was wondering if anyone had made or knows of a component that can scan documents and put their produced text into talend for processing. At the moment I am using Free-OCR which is not really the way I want to go as I have to run the program before each talend process which is not very efficient. 
&lt;BR /&gt;Im really hoping somene has a solution to this. 
&lt;BR /&gt;Thanks in advance. 
&lt;BR /&gt;Dean Wake 
&lt;BR /&gt;P.S. I wasnt quite sure where to post this.</description>
      <pubDate>Tue, 23 Apr 2013 13:21:01 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299981#M72260</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-23T13:21:01Z</dc:date>
    </item>
    <item>
      <title>Re: OCR (Optical character recognition) Scanner for talend.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299982#M72261</link>
      <description>Hi, 
&lt;BR /&gt;We don't have such a component to scan a pdf or doc. Talend is a code generator ETL which use JAVA as the underline technology generated to perform the Data Extraction, Transformation and Loading.
&lt;BR /&gt;Best regareds
&lt;BR /&gt;Sarbina</description>
      <pubDate>Wed, 24 Apr 2013 03:31:27 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299982#M72261</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-24T03:31:27Z</dc:date>
    </item>
    <item>
      <title>Re: OCR (Optical character recognition) Scanner for talend.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299983#M72262</link>
      <description>Is it possible for me to request such a component? I know it is possible to do through talends as there are many OCR SDK's based on java</description>
      <pubDate>Wed, 24 Apr 2013 07:59:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299983#M72262</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-24T07:59:57Z</dc:date>
    </item>
    <item>
      <title>Re: OCR (Optical character recognition) Scanner for talend.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299984#M72263</link>
      <description>Hi, 
&lt;BR /&gt;You can open a JIRA issue in the Talend DI project of the 
&lt;A href="https://jira.talendforge.org/secure/Dashboard.jspa" target="_blank" rel="nofollow noopener noreferrer"&gt;JIRA bugtracker&lt;/A&gt; for your new feature. Our component developer will see if this feature can be available in further version.
&lt;BR /&gt;Certainly, you can create a custom component by yourself.
&lt;BR /&gt;Please see the reference:
&lt;A href="http://powerupbi.com/talend/componentCreation_index.html" target="_blank" rel="nofollow noopener noreferrer"&gt;componentCreation&lt;/A&gt;
&lt;BR /&gt;Best regards
&lt;BR /&gt;Sabrina</description>
      <pubDate>Wed, 24 Apr 2013 08:19:22 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299984#M72263</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-24T08:19:22Z</dc:date>
    </item>
    <item>
      <title>Re: OCR (Optical character recognition) Scanner for talend.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299985#M72264</link>
      <description>Hi, using 
&lt;A href="http://www.rasteredge.com/how-to/csharp-imaging/ocr-sdk/" target="_blank" rel="nofollow noopener noreferrer"&gt;ocr scanning technique&lt;/A&gt; to extract text or images from pdf, it supports full-page OCR, auto and manual zonal OCR creation, meanwhile, you can do some simple image processing, such as deskew, despeckle...
&lt;BR /&gt;
&lt;A href="http://www.rasteredge.com/how-to/csharp-imaging/ocr-sdk/" rel="nofollow noopener noreferrer"&gt;http://www.rasteredge.com/how-to/csharp-imaging/ocr-sdk/&lt;/A&gt;</description>
      <pubDate>Tue, 04 Jun 2013 03:16:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299985#M72264</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-06-04T03:16:24Z</dc:date>
    </item>
    <item>
      <title>Re: OCR (Optical character recognition) Scanner for talend.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299986#M72265</link>
      <description>&lt;BLOCKQUOTE&gt;
 &lt;TABLE border="1"&gt;
  &lt;TBODY&gt;
   &lt;TR&gt;
    &lt;TD&gt;Hi, using &lt;A href="http://www.rasteredge.com/how-to/csharp-imaging/ocr-sdk/" target="_blank" rel="nofollow noopener noreferrer"&gt;ocr scanning technique&lt;/A&gt; to extract text or images from pdf, it supports full-page OCR, auto and manual zonal OCR creation, meanwhile, you can do some simple image processing, such as deskew, despeckle...&lt;BR /&gt;&lt;A href="http://www.rasteredge.com/how-to/csharp-imaging/ocr-sdk/" target="_blank" rel="nofollow noopener noreferrer"&gt;http://www.rasteredge.com/how-to/csharp-imaging/ocr-sdk/&lt;/A&gt;&lt;/TD&gt;
   &lt;/TR&gt;
  &lt;/TBODY&gt;
 &lt;/TABLE&gt;
&lt;/BLOCKQUOTE&gt;
&lt;BR /&gt;i have seen it , looked wonderful</description>
      <pubDate>Tue, 11 Mar 2014 04:25:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299986#M72265</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-03-11T04:25:13Z</dc:date>
    </item>
    <item>
      <title>Re: OCR (Optical character recognition) Scanner for talend.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299987#M72266</link>
      <description>if you want to use free ocr, you can try this 
&lt;A href="http://www.online-code.net/ocr.html" target="_blank" rel="nofollow noopener noreferrer"&gt;free online ocr&lt;/A&gt; service,&amp;nbsp;it supports 40+ languages, and can save converted text to editable txt file and searchable pdf document.</description>
      <pubDate>Sat, 19 Dec 2015 06:40:48 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OCR-Optical-character-recognition-Scanner-for-talend/m-p/2299987#M72266</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-12-19T06:40:48Z</dc:date>
    </item>
  </channel>
</rss>

