Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Join us in Toronto Sept 9th for Qlik's AI Reality Tour! Register Now
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Extracting tables from web output

Hi, all!

There is a website with tables (tag <table></table>. How to extract table data to rows from tHttpRequest output (string)? 

Regards

Labels (2)
1 Solution

Accepted Solutions
Anonymous
Not applicable
Author

I've written a tutorial on this very subject. The example job is included. You might be able to download the job and configure it to your requirements

 

https://www.rilhia.com/tutorials/using-third-party-java-library-scrape-content-table-web-page

View solution in original post

4 Replies
Aukema
Creator
Creator

You could try using regular expressions, however best-practice is to use a parser like JSOUP. 

I would go for the parser in combination with tLibaryLoad and tJavaFlex

Anonymous
Not applicable
Author

I've written a tutorial on this very subject. The example job is included. You might be able to download the job and configure it to your requirements

 

https://www.rilhia.com/tutorials/using-third-party-java-library-scrape-content-table-web-page

Anonymous
Not applicable
Author

Thank you! It works!

Anonymous
Not applicable
Author

Unfortunately the tutorial is no longer available. However this post covers everything in the tutorial.....
https://community.talend.com/t5/Design-and-Development/Extract-Multiple-table-using-tHTTPTableInput-...