Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content
Announcements
Gartner® Magic Quadrant™: 15 YEARS A LEADER - GET THE REPORT
cancel
Showing results for 
Search instead for 
Did you mean: 
sifatnabil
Specialist
Specialist

Scrape website

Hi all,

This website Eurex Clearing - Clearing volume does not have any HTML tables but has data in lists. Is there still a way to pull this into Qlikview? Thanks.

1 Reply
petter
Partner - Champion III
Partner - Champion III

Yes it is possible. I have done it by retrieving the entire HTML page as text. Then I use TextBetween() and other text functions to get the info I want.

The trick is to deselect HTML as format with the Web File wizard and choose FIXED as format. Then all the HTML text-lines will be retrieved into one field.

The load script will look like this:

ALLHTML:

LOAD

    Concat(Line, Chr(13) & Chr(10) , R ) AS HTML    // merge the table lines into a single field with linefeeds

                                                                                    // R is to get them concatenated in the right order.

;

LOAD

    RecNo() AS R,

    [@1:n] AS Line

FROM

    [http://www.eurexclearing.com/clearing-en/markets-services/eurex-otc-clear/interest-rate-swaps/cleari...]

    (fix, utf8, no labels, no eof);

If you then refer to HTML in an expression you will get this single field as the full text of the page.