Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik Connect 2026! Turn data into bold moves, April 13 -15: Learn More!
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

how extract a table that exists in url

extract a URL table
hello to all
I want to extract a table that exists in the following url,

 https://security-tracker.debian.org/tracker/status/itp

which component I can use with which configuration

How I would like the output to look like:

packageissuedebainbug



I thank you in advance for your time and help

Labels (3)
13 Replies
fdenis
Master
Master

trest is a good one
Anonymous
Not applicable
Author

just as @fdenis said, I suggest tREST to get the web page,
tFileInputRegex to parse the file and extract columns as you want

Or, you check on exchange.talend.com to find suitable component for web scraping.
Anonymous
Not applicable
Author

I thank you for your answer but by using trest I couldn't extract a table from  the site https://security-tracker.debian.org/tracker/status/itp  and save it in my database

Anonymous
Not applicable
Author

I thank you for your answer 

 I checked on exchange.talend.com ; I would like to use tHttpTableInput to get the table in the below link https://security-tracker.debian.org/tracker/status/itp  

but I didn't succeed I don't know how I configure this component

regards

fdenis
Master
Master

what is the error message?
Anonymous
Not applicable
Author

it's my site and the table I want to extract  https://security-tracker.debian.org/tracker/status/itp

0683p000009M5ZS.png

it's the configuration : 0683p000009M5XI.png0683p000009M5ZX.png thanlks

regards 

fdenis
Master
Master

I cannot get this component from open studio 7
Anonymous
Not applicable
Author

it's the configurationDownload tHTTPTableInput component from https://exchange.talend.com to extract URL data because Talend doesn't have a standard component for it

Step 1: Once you have downloaded the component from http://talendforge.org/exchange/, simply add it to the folder \talend\plugins\org.talend.designer.components.localprovider_7.1.1.1.20181026_1147\components
of Talend, then relaunch Talend.
Step 2: put this tHTTPTableInput in a specific folder, i. e. my components.
Then: Under Windows / Preferences, under Talend / Components, select the myComponents folder as the User component folder.
Click on Apply, then Ok.

 

fdenis
Master
Master

so to have support on this get file by trest then open it as xml it will be more processionnal.