Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Join us in Bucharest on Sept 18th for Qlik's AI Reality Tour! Register Now
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

PDF to Text conversion in Talend

Hi there..
What palette should i use to 
1. Accept a pdf file
2. Convert this file to text (would using tmap be a good option???)
3. Store the text file on HDFS
I have generate a routine which will convert the pdf file to text file.
But i kind of stuck with the palette!!! Plz help!!
Labels (2)
3 Replies
Anonymous
Not applicable
Author

Hi,
There is no official PDF component in Talend.
Here is a custom component tPDFToText wriiten by Talend community user and shared on Talend Exchange which is used to convert a PDF to text file.
Best regards
Sabrina
Anonymous
Not applicable
Author

Thank you for the reply..
The component tPDFtoText is quite static.
It takes only one file at a time and also have to specify the output file name in the component setting.
Any way to make it dynamic!!!
Anonymous
Not applicable
Author

Hi,
Feel free to create a custom component for your job requirement.
Here is a Knowleadge base article about:TalendHelpCenter:How to create a custom component
Best regards
Sabrina