Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik Open Lakehouse is Now Generally Available! Discover the key highlights and partner resources here.
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Scrape data from a aspx website

Hi,
I have to scrape data from a aspx website. Is this possible with Talend? If it is, what components will come in handy in this scenario?
Thanks in Advance,
Labels (2)
3 Replies
Anonymous
Not applicable
Author

You can do this, but there isn't a "one size fits all" solution with Talend. I have written a tutorial on how I achieved this with a Formula 1 site. The tutorial is here:  https://www.rilhia.com/tutorials/using-third-party-java-library-scrape-content-table-web-page
I included the job so you can take that and have a play. But remember it was written specifically for the site I was working with.
Anonymous
Not applicable
Author

Thanks r_hall,
My requirement is like this, I have a csv file which holds addresses and I want Talend to look up those addresses on the site and copy relevant data in a new csv file or dump it in the database. 
csv----------------talend----------fetch the website-----------look up addresses from csv on website--------return relevant data and save
You think this can be possible with components or I have to create convoluted Java routine for this (I am not fluent with Java)? 
Thanks,
Anonymous
Not applicable
Author

I doubt there is a component that will handle this, but I am not aware of every component available in Talend Exchange (maybe check there). However, writing a Java routine making use of a third party API really isn't that hard. If you are new to Talend, it may be a bit more of a challenge, but any gain in Java knowledge can only be a benefit when using Talend. It opens so many extra doors.