Re: parsing HTML - Page 2 - Qlik Community

_AnonymousUser · ‎2014-03-28

Hi everybody.
I have to get some information (everything legal) from html pages. There's a way to take just the information I need, deleting html tags??
Thanks in advance.

_AnonymousUser · ‎2014-04-03

Hi,
I tried tFileInputFullRow -> tFileOutputDelimited skipping empy row, but it doesn't clean empty row...

Regards

_AnonymousUser · ‎2014-04-04

Hi everybody,
fine I don't need to have an orderly file anymore.
I just need to extract some lines... is there a component that help me with that?? I need to specify some start words and some end words.
Thanks in advance.

_AnonymousUser · ‎2014-04-04

I'm using tFileInputRegex and it's matching the lines I need... but how can I write these lines in an output files?
Using tFileInputRegex -> tFileOutputDelimited doesn't work.
regards

Ashok_Panda · ‎2014-04-07

Hi Everybody,
I am reading a html file using tFileInputFullRow ,but it's not reading the html file from starting. I mean to say it should start reading the file at <html> tag ,but it's starting at somewhere i am not sure where . Note: i have not checked the random option of the component.

parsing HTML

Talend Data Integration

v5.x