Hi there..
What palette should i use to
1. Accept a pdf file
2. Convert this file to text (would using tmap be a good option???)
3. Store the text file on HDFS
I have generate a routine which will convert the pdf file to text file.
But i kind of stuck with the palette!!! Plz help!!
Hi,
There is no official PDF component in Talend.
Here is a custom component tPDFToText wriiten by Talend community user and shared on Talend Exchange which is used to convert a PDF to text file.
Best regards
Sabrina
Thank you for the reply..
The component tPDFtoText is quite static.
It takes only one file at a time and also have to specify the output file name in the component setting.
Any way to make it dynamic!!!