
Anonymous
Not applicable
2014-08-21
06:09 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Screen scraping and feed into a MySQL database
Hi all,
I am new to Talend and hope I will be able to find the answers to my questions here.
I need to scrape data from websites like this one:
and to feed data into a local MySQL database. I would like to know:
1/ If Talend DI is the appropriate soft to do that
2/ If yes, how to proceed (if someone had a example of a similar job I can inspire from that would be awesome)
Thank you in advance for you help,
Carl
I am new to Talend and hope I will be able to find the answers to my questions here.
I need to scrape data from websites like this one:
and to feed data into a local MySQL database. I would like to know:
1/ If Talend DI is the appropriate soft to do that
2/ If yes, how to proceed (if someone had a example of a similar job I can inspire from that would be awesome)
Thank you in advance for you help,
Carl
154 Views
2 Replies

Anonymous
Not applicable
2014-08-21
06:12 AM
Author
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Carl,
Talend is a supporting tool for screen scrapping. Screen scrapping you will have to do with third party tools like selenium(perl/java) / iMacros etc. Use tSystem component to pass parameters and execute the script and once the scrapping is performed, you will get the file in the required format. Create metadata for that file and insert into the MySQL.
Hope you got an idea.
Thanks
vaibhav
Talend is a supporting tool for screen scrapping. Screen scrapping you will have to do with third party tools like selenium(perl/java) / iMacros etc. Use tSystem component to pass parameters and execute the script and once the scrapping is performed, you will get the file in the required format. Create metadata for that file and insert into the MySQL.
Hope you got an idea.
Thanks
vaibhav
154 Views

Anonymous
Not applicable
2015-03-24
07:51 PM
Author
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
This can be done with Talend, HTMLParser (Java API) and Java. It does take a bit of Java knowledge, but it really isn't too hard. I am currently writing a tutorial on this for my
website. You can download the example Job to extrapolate from, as with all of my tutorials.
154 Views
