Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Join us to spark ideas for how to put the latest capabilities into action. Register here!
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

tHTMLinput

I would like to parse the table on the following page:
http://english.mnb.hu/arfolyamok
So the HTML is something like:

<td class="firstcell noborder">AUD</td>
<td>Australian Dollar</td>
<td>1</td>
<td>209.43</td>
<td></td>
<td class="firstcell">KRW</td>
<td>South Korean Won</td>
<td>100</td>
<td>24.87</td>
</tr>

So I am trying to use the tHTMLinput component and currently I get the following message:
Exécution en erreur :Échec de la génération du code.


Could you help me ?
Thanks
Didier
0683p000009MAqP.png
Labels (2)
14 Replies
Anonymous
Not applicable
Author

Today I will upload new version of tHTMLInput please download it, It will give you option to choose how many times you want to try to connect certain webpage. this will avoid errors at some extent. 
Anonymous
Not applicable
Author

Hi Umesh,
I have looked on the Talend Exchange site and I see tHTMLinput Release date : 22-Apr and tHTMLInput_extended with a release date of 2-July
There is another version??
Thanks
Didier
Anonymous
Not applicable
Author

I have install the V2
Now it works:
connecting to socket on port 3628
connected
.------+--------------.
|      tLogRow_2      |
|=-----+-------------=|
|euro  |parseErrorText|
|=-----+-------------=|
|309.33|null          |
'------+--------------'
disconnected
but the component does not seem very stable:
connected
Exception in component tHTMLInput_2
java.lang.NullPointerException
    at pmi.getcurrency_bnh_html_0_1.GetCurrency_BNH_HTML.tHTMLInput_2Process(GetCurrency_BNH_HTML.java:806)
    at pmi.getcurrency_bnh_html_0_1.GetCurrency_BNH_HTML.runJobInTOS(GetCurrency_BNH_HTML.java:1209)
    at pmi.getcurrency_bnh_html_0_1.GetCurrency_BNH_HTML.main(GetCurrency_BNH_HTML.java:1030)
disconnected

when you try to execute several times the same job!


0683p000009MAft.png
Anonymous
Not applicable
Author

You need to add tSleep component to slow down process, and avoid frequent requesting to web server, it may block your IP address. Other wise there are options but need to add like proxy. It will be added in next version, right now I am too busy. 
 
Anonymous
Not applicable
Author

With a tSleep:
Démarrage du job GetCurrency_BNH_HTML a 15:45 27/07/2015.
connecting to socket on port 3510
connected
Exception in component tHTMLInput_2
java.lang.NullPointerException
    at pmi.getcurrency_bnh_html_0_1.GetCurrency_BNH_HTML.tHTMLInput_2Process(GetCurrency_BNH_HTML.java:1478)
    at pmi.getcurrency_bnh_html_0_1.GetCurrency_BNH_HTML.tSleep_1Process(GetCurrency_BNH_HTML.java:2076)
    at pmi.getcurrency_bnh_html_0_1.GetCurrency_BNH_HTML.runJobInTOS(GetCurrency_BNH_HTML.java:2338)
    at pmi.getcurrency_bnh_html_0_1.GetCurrency_BNH_HTML.main(GetCurrency_BNH_HTML.java:2159)
disconnected
Job GetCurrency_BNH_HTML terminé à 15:46 27/07/2015.

0683p000009MAgo.png