<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: how extract a table that exists in url in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363636#M127484</link>
    <description>I cannot get this component from open studio 7</description>
    <pubDate>Tue, 11 Jun 2019 15:24:37 GMT</pubDate>
    <dc:creator>fdenis</dc:creator>
    <dc:date>2019-06-11T15:24:37Z</dc:date>
    <item>
      <title>how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363629#M127477</link>
      <description>&lt;P&gt;extract a URL table&lt;BR /&gt;hello to all&lt;BR /&gt;I want to extract a table that exists in the following url,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;A href="https://security-tracker.debian.org/tracker/status/itp" target="_blank" rel="nofollow noopener noreferrer"&gt;https://security-tracker.debian.org/tracker/status/itp&lt;/A&gt;&lt;/P&gt;&lt;P&gt;which component I can use with which configuration&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;How I would like the output to look like:&lt;/SPAN&gt;&lt;/P&gt;&lt;TABLE border="1px" cellpadding="3px"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;package&lt;/TD&gt;&lt;TD&gt;issue&lt;/TD&gt;&lt;TD&gt;debainbug&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;I thank you in advance for your time and help&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 05:37:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363629#M127477</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T05:37:40Z</dc:date>
    </item>
    <item>
      <title>Re: how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363630#M127478</link>
      <description>trest is a good one&lt;BR /&gt;</description>
      <pubDate>Tue, 11 Jun 2019 11:21:39 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363630#M127478</guid>
      <dc:creator>fdenis</dc:creator>
      <dc:date>2019-06-11T11:21:39Z</dc:date>
    </item>
    <item>
      <title>Re: how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363631#M127479</link>
      <description>just as 
&lt;A href="https://community.qlik.com/s/profile/0053p000007LKpMAAW"&gt;@fdenis&lt;/A&gt; said, I suggest tREST to get the web page,
&lt;BR /&gt;tFileInputRegex to parse the file and extract columns as you want
&lt;BR /&gt;
&lt;BR /&gt;Or, you check on exchange.talend.com to find suitable component for web scraping.
&lt;BR /&gt;</description>
      <pubDate>Tue, 11 Jun 2019 12:42:29 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363631#M127479</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-06-11T12:42:29Z</dc:date>
    </item>
    <item>
      <title>Re: how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363632#M127480</link>
      <description>&lt;P&gt;I thank you for your answer but by using trest I couldn't extract a table from&amp;nbsp; the site &lt;A href="https://security-tracker.debian.org/tracker/status/itp" target="_blank" rel="nofollow noopener noreferrer"&gt;https://security-tracker.debian.org/tracker/status/itp&lt;/A&gt;&amp;nbsp; and save it in my database&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2019 12:48:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363632#M127480</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-06-11T12:48:50Z</dc:date>
    </item>
    <item>
      <title>Re: how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363633#M127481</link>
      <description>&lt;P&gt;&lt;SPAN&gt;I thank you for your answer&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;I checked on exchange.talend.com&amp;nbsp;; I would like to use tHttpTableInput to get the table in the below link&amp;nbsp;&lt;A href="https://security-tracker.debian.org/tracker/status/itp" target="_blank" rel="nofollow noopener noreferrer"&gt;https://security-tracker.debian.org/tracker/status/itp&lt;/A&gt;&amp;nbsp;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;but I didn't succeed I don't know how I configure this component&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;regards&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2019 13:30:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363633#M127481</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-06-11T13:30:33Z</dc:date>
    </item>
    <item>
      <title>Re: how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363634#M127482</link>
      <description>what is the error message?&lt;BR /&gt;</description>
      <pubDate>Tue, 11 Jun 2019 13:43:53 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363634#M127482</guid>
      <dc:creator>fdenis</dc:creator>
      <dc:date>2019-06-11T13:43:53Z</dc:date>
    </item>
    <item>
      <title>Re: how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363635#M127483</link>
      <description>&lt;P&gt;it's my site and the table I want to extract&amp;nbsp;&amp;nbsp;&lt;A href="https://security-tracker.debian.org/tracker/status/itp" target="_blank" rel="nofollow noopener noreferrer"&gt;https://security-tracker.debian.org/tracker/status/itp&lt;/A&gt;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="3.PNG" style="width: 521px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M5ZS.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/131384i1D2D4325187764DC/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M5ZS.png" alt="0683p000009M5ZS.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;it's the configuration :&amp;nbsp;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="2.PNG" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M5XI.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/150484iAC8C0202081C14E0/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M5XI.png" alt="0683p000009M5XI.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="1.PNG" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M5ZX.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/147623i07DC4BD5787D5FA3/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M5ZX.png" alt="0683p000009M5ZX.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&amp;nbsp;thanlks&lt;/P&gt; 
&lt;P&gt;regards&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2019 14:03:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363635#M127483</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-06-11T14:03:50Z</dc:date>
    </item>
    <item>
      <title>Re: how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363636#M127484</link>
      <description>I cannot get this component from open studio 7</description>
      <pubDate>Tue, 11 Jun 2019 15:24:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363636#M127484</guid>
      <dc:creator>fdenis</dc:creator>
      <dc:date>2019-06-11T15:24:37Z</dc:date>
    </item>
    <item>
      <title>Re: how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363637#M127485</link>
      <description>&lt;P&gt;it's the configurationDownload tHTTPTableInput component from &lt;A href="https://exchange.talend.com" target="_blank" rel="nofollow noopener noreferrer"&gt;https://exchange.talend.com&lt;/A&gt; to extract URL data because Talend doesn't have a standard component for it&lt;/P&gt;&lt;P&gt;Step 1: Once you have downloaded the component from &lt;A href="http://talendforge.org/exchange/" target="_blank" rel="nofollow noopener noreferrer"&gt;http://talendforge.org/exchange/&lt;/A&gt;, simply add it to the folder \talend\plugins\org.talend.designer.components.localprovider_7.1.1.1.20181026_1147\components&lt;BR /&gt;of Talend, then relaunch Talend.&lt;BR /&gt;Step 2: put this tHTTPTableInput in a specific folder, i. e. my components.&lt;BR /&gt;Then: Under Windows / Preferences, under Talend / Components, select the myComponents folder as the User component folder.&lt;BR /&gt;Click on Apply, then Ok.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2019 15:27:48 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363637#M127485</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-06-11T15:27:48Z</dc:date>
    </item>
    <item>
      <title>Re: how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363638#M127486</link>
      <description>so to have support on this get file by trest then open it as xml it will be more processionnal.</description>
      <pubDate>Tue, 11 Jun 2019 15:31:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363638#M127486</guid>
      <dc:creator>fdenis</dc:creator>
      <dc:date>2019-06-11T15:31:15Z</dc:date>
    </item>
    <item>
      <title>Re: how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363639#M127487</link>
      <description>&lt;P&gt;&lt;SPAN&gt;I am not sure what you mean by this. Can you explain this with the photo please&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;thanks&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;regards&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2019 15:43:21 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363639#M127487</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-06-11T15:43:21Z</dc:date>
    </item>
    <item>
      <title>Re: how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363640#M127488</link>
      <description>&lt;P&gt;I think it will be difficult for you to do data scraping with talend.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Data scraping depend on the struture of the html page (the attached file is the structure of your web page)&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I don't think you can find any generic component to do what you are trying to do.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I suggest you go on freelancer.com, get someone who can provide to you a simple java code to scrape that page (it will not work on other pages), then you can integrate it in talend.&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="html_page_structure.PNG" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M5a1.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/149557i0F9A90A82607528B/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M5a1.png" alt="0683p000009M5a1.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2019 15:43:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363640#M127488</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-06-11T15:43:50Z</dc:date>
    </item>
    <item>
      <title>Re: how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363641#M127489</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp; &amp;nbsp;I could see that you are currently planning for JSON parsing of the message which is the right approach rather than trying to parse from the output HTML page.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;A href="https://community.qlik.com/s/feed/0D53p00007vCoymCAC#M99914" target="_blank"&gt;https://community.talend.com/t5/Design-and-Development/parsing-and-extract-json/m-p/164151#M99914&lt;/A&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp; So I hope you do not have any outstanding query in this post.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Warm Regards,&lt;BR /&gt;Nikhil Thampi&lt;/P&gt; 
&lt;P&gt;Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 12 Jun 2019 14:36:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363641#M127489</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-06-12T14:36:33Z</dc:date>
    </item>
    <item>
      <title>Re: how extract a table that exists in url</title>
      <link>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363642#M127490</link>
      <description>&lt;P&gt;thank you for your attention, in fact these are two different things I need data extraction from a json file and also I need data extraction from url and also I need data extraction from pdf in different sites and in the end I will integrate all the tables I have extracted so for me it is necessary that I find a way to extract the data or by using the tHTTPInputtable component or in other way&lt;/P&gt;
&lt;P&gt;regards&lt;/P&gt;</description>
      <pubDate>Wed, 12 Jun 2019 15:35:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/how-extract-a-table-that-exists-in-url/m-p/2363642#M127490</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-06-12T15:35:37Z</dc:date>
    </item>
  </channel>
</rss>

