Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Join us in Toronto Sept 9th for Qlik's AI Reality Tour! Register Now
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Scrape data from a aspx website

Hi,
I have to scrape data from a aspx website. Is this possible with Talend? If it is, what components will come in handy in this scenario?
Thanks in Advance,
Labels (2)
3 Replies
Anonymous
Not applicable
Author

You can do this, but there isn't a "one size fits all" solution with Talend. I have written a tutorial on how I achieved this with a Formula 1 site. The tutorial is here:  https://www.rilhia.com/tutorials/using-third-party-java-library-scrape-content-table-web-page
I included the job so you can take that and have a play. But remember it was written specifically for the site I was working with.
Anonymous
Not applicable
Author

Thanks r_hall,
My requirement is like this, I have a csv file which holds addresses and I want Talend to look up those addresses on the site and copy relevant data in a new csv file or dump it in the database. 
csv----------------talend----------fetch the website-----------look up addresses from csv on website--------return relevant data and save
You think this can be possible with components or I have to create convoluted Java routine for this (I am not fluent with Java)? 
Thanks,
Anonymous
Not applicable
Author

I doubt there is a component that will handle this, but I am not aware of every component available in Talend Exchange (maybe check there). However, writing a Java routine making use of a third party API really isn't that hard. If you are new to Talend, it may be a bit more of a challenge, but any gain in Java knowledge can only be a benefit when using Talend. It opens so many extra doors.