Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Join us in NYC Sept 4th for Qlik's AI Reality Tour! Register Now
cancel
Showing results for 
Search instead for 
Did you mean: 
_AnonymousUser
Specialist III
Specialist III

parsing HTML

Hi everybody.
I have to get some information (everything legal) from html pages. There's a way to take just the information I need, deleting html tags??
Thanks in advance.
Labels (2)
13 Replies
_AnonymousUser
Specialist III
Specialist III
Author

Hi,
I tried tFileInputFullRow -> tFileOutputDelimited skipping empy row, but it doesn't clean empty row... 0683p000009MPcz.png
Regards
_AnonymousUser
Specialist III
Specialist III
Author

Hi everybody,
fine I don't need to have an orderly file anymore.
I just need to extract some lines... is there a component that help me with that?? I need to specify some start words and some end words.
Thanks in advance.
_AnonymousUser
Specialist III
Specialist III
Author

I'm using tFileInputRegex and it's matching the lines I need... but how can I write these lines in an output files?
Using tFileInputRegex -> tFileOutputDelimited doesn't work.
regards
Ashok_Panda
Contributor
Contributor

Hi Everybody,
I am reading a html file using tFileInputFullRow ,but it's not reading the html file from starting. I mean to say it should start reading the file at <html> tag ,but it's starting at somewhere i am not sure where . Note: i have not checked the random option of the component.