Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Join us in Bucharest on Sept 18th for Qlik's AI Reality Tour! Register Now
cancel
Showing results for 
Search instead for 
Did you mean: 
_AnonymousUser
Specialist III
Specialist III

parsing HTML

Hi everybody.
I have to get some information (everything legal) from html pages. There's a way to take just the information I need, deleting html tags??
Thanks in advance.
Labels (2)
13 Replies
_AnonymousUser
Specialist III
Specialist III
Author

Hi,
I tried tFileInputFullRow -> tFileOutputDelimited skipping empy row, but it doesn't clean empty row... 0683p000009MPcz.png
Regards
_AnonymousUser
Specialist III
Specialist III
Author

Hi everybody,
fine I don't need to have an orderly file anymore.
I just need to extract some lines... is there a component that help me with that?? I need to specify some start words and some end words.
Thanks in advance.
_AnonymousUser
Specialist III
Specialist III
Author

I'm using tFileInputRegex and it's matching the lines I need... but how can I write these lines in an output files?
Using tFileInputRegex -> tFileOutputDelimited doesn't work.
regards
Ashok_Panda
Contributor
Contributor

Hi Everybody,
I am reading a html file using tFileInputFullRow ,but it's not reading the html file from starting. I mean to say it should start reading the file at <html> tag ,but it's starting at somewhere i am not sure where . Note: i have not checked the random option of the component.