Skip to main content
Announcements
A fresh, new look for the Data Integration & Quality forums and navigation! Read more about what's changed.
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Screen scraping and feed into a MySQL database

Hi all,
I am new to Talend and hope I will be able to find the answers to my questions here.
I need to scrape data from websites like this one:  
and to feed data into a local MySQL database. I would like to know:
1/ If Talend DI is the appropriate soft to do that
2/ If yes, how to proceed (if someone had a example of a similar job I can inspire from that would be awesome)
Thank you in advance for you help,
Carl
Labels (3)
2 Replies
Anonymous
Not applicable
Author

Hi Carl,
Talend is a supporting tool for screen scrapping. Screen scrapping you will have to do with third party tools like selenium(perl/java) / iMacros etc. Use tSystem component to pass parameters and execute the script and once the scrapping is performed, you will get the file in the required format. Create metadata for that file and insert into the MySQL.
Hope you got an idea.
Thanks
vaibhav
Anonymous
Not applicable
Author

This can be done with Talend, HTMLParser (Java API) and Java. It does take a bit of Java knowledge, but it really isn't too hard. I am currently writing a tutorial on this for my website. You can download the example Job to extrapolate from, as with all of my tutorials.