Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik Open Lakehouse is Now Generally Available! Discover the key highlights and partner resources here.
cancel
Showing results for 
Search instead for 
Did you mean: 
sifatnabil
Specialist
Specialist

Scrape website

Hi all,

This website Eurex Clearing - Clearing volume does not have any HTML tables but has data in lists. Is there still a way to pull this into Qlikview? Thanks.

1 Reply
petter
Partner - Champion III
Partner - Champion III

Yes it is possible. I have done it by retrieving the entire HTML page as text. Then I use TextBetween() and other text functions to get the info I want.

The trick is to deselect HTML as format with the Web File wizard and choose FIXED as format. Then all the HTML text-lines will be retrieved into one field.

The load script will look like this:

ALLHTML:

LOAD

    Concat(Line, Chr(13) & Chr(10) , R ) AS HTML    // merge the table lines into a single field with linefeeds

                                                                                    // R is to get them concatenated in the right order.

;

LOAD

    RecNo() AS R,

    [@1:n] AS Line

FROM

    [http://www.eurexclearing.com/clearing-en/markets-services/eurex-otc-clear/interest-rate-swaps/cleari...]

    (fix, utf8, no labels, no eof);

If you then refer to HTML in an expression you will get this single field as the full text of the page.