<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: html to xml in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287975#M61495</link>
    <description>Thanks..I restarted it and it works fine.&lt;BR /&gt;However, when I try input_row.htmlstring.startsWith("&amp;lt;body&amp;gt;") in tfilterrow component then I see only first line. Seems that it breaks when there is new line within body tag. How can I solve this?</description>
    <pubDate>Tue, 02 Aug 2011 14:32:28 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2011-08-02T14:32:28Z</dc:date>
    <item>
      <title>html to xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287971#M61491</link>
      <description>Is there any component or way to convert html files to xml document?&lt;BR /&gt;I am nore interested in body and title tags. Everything in &amp;lt;body&amp;gt; of html can stay in &amp;lt;body&amp;gt; tag of xml&lt;BR /&gt;Cheers.</description>
      <pubDate>Sat, 16 Nov 2024 12:46:16 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287971#M61491</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T12:46:16Z</dc:date>
    </item>
    <item>
      <title>Re: html to xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287972#M61492</link>
      <description>Hi 
&lt;BR /&gt;There is no component can be used to convert html file to xml file directly, you have to extract records from html file and then insert them into xml file. 
&lt;BR /&gt;Consider the following job design to extract desired records from html file: 
&lt;BR /&gt;tFileInputFullRow--main--&amp;gt;tFilterRow--&amp;gt;tExtractRegexFields. 
&lt;BR /&gt;tFileInputFullRow: read each row of html file one by one 
&lt;BR /&gt;tFilterRow: filter the desired row, for example: row startsWith &amp;lt;body&amp;gt; 
&lt;BR /&gt;tExtractRegexFields: use regular expresstion to extract fields 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Shong</description>
      <pubDate>Sun, 31 Jul 2011 07:36:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287972#M61492</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-07-31T07:36:40Z</dc:date>
    </item>
    <item>
      <title>Re: html to xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287973#M61493</link>
      <description>Thanks shong for your reply. I tried it. However, I get this "advanced condition failed" error from tFileInputFullRow.
&lt;BR /&gt;Log output:
&lt;BR /&gt;
&lt;BLOCKQUOTE&gt;
 &lt;TABLE border="1"&gt;
  &lt;TBODY&gt;
   &lt;TR&gt;
    &lt;TD&gt; connecting to socket on port 3654&lt;BR /&gt; connected&lt;BR /&gt;&amp;lt;html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en"&amp;gt;|advanced condition failed&lt;BR /&gt;&amp;lt;head&amp;gt;|advanced condition failed&lt;BR /&gt;&amp;lt;meta http-equiv="content-type" content="text/html;" /&amp;gt;|advanced condition failed&lt;BR /&gt;&amp;lt;title&amp;gt;Partnerships&amp;lt;/title&amp;gt;|advanced condition failed&lt;BR /&gt;&amp;lt;/head&amp;gt;|advanced condition failed&lt;BR /&gt;&amp;lt;body&amp;gt;&amp;lt;h1 class="entry-title" style="margin-bottom:25px;"&amp;gt;Partnerships&amp;lt;/h1&amp;gt;|advanced condition failed&lt;BR /&gt;.....&lt;BR /&gt;....&lt;/TD&gt;
   &lt;/TR&gt;
  &lt;/TBODY&gt;
 &lt;/TABLE&gt;
&lt;/BLOCKQUOTE&gt;</description>
      <pubDate>Mon, 01 Aug 2011 10:34:18 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287973#M61493</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-08-01T10:34:18Z</dc:date>
    </item>
    <item>
      <title>Re: html to xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287974#M61494</link>
      <description>Hi
&lt;BR /&gt;i tested to read a html file using tFileInputFullRow and I don't have any problem, can you please send me an example file for testing.
&lt;BR /&gt;Best regards
&lt;BR /&gt;Shong</description>
      <pubDate>Tue, 02 Aug 2011 04:53:32 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287974#M61494</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-08-02T04:53:32Z</dc:date>
    </item>
    <item>
      <title>Re: html to xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287975#M61495</link>
      <description>Thanks..I restarted it and it works fine.&lt;BR /&gt;However, when I try input_row.htmlstring.startsWith("&amp;lt;body&amp;gt;") in tfilterrow component then I see only first line. Seems that it breaks when there is new line within body tag. How can I solve this?</description>
      <pubDate>Tue, 02 Aug 2011 14:32:28 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287975#M61495</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-08-02T14:32:28Z</dc:date>
    </item>
    <item>
      <title>Re: html to xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287976#M61496</link>
      <description>Is there any component or way to convert xml or csv or from database to html files &lt;BR /&gt;Regards&lt;BR /&gt;Kishore</description>
      <pubDate>Thu, 10 Nov 2011 09:33:04 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287976#M61496</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-11-10T09:33:04Z</dc:date>
    </item>
    <item>
      <title>Re: html to xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287977#M61497</link>
      <description>if you want to convert an xml file to an html one, you need just to use an xsl transformation.
&lt;BR /&gt;you may use this model as a transformation job 
&lt;BR /&gt;tFileOutputXML -----&amp;gt; tFileList (in case you want to do this for a group of files) ----&amp;gt; tXSLT
&lt;BR /&gt;this works, i already tried it.</description>
      <pubDate>Thu, 10 Nov 2011 14:12:26 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287977#M61497</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2011-11-10T14:12:26Z</dc:date>
    </item>
    <item>
      <title>Re: html to xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287978#M61498</link>
      <description>Another way to go from XML -&amp;gt; HTML, CSV, and PDF is to use a Jasper Report. This video shows how to use the Jasper Report IDE "iReport" to build a report off of an XML document. The iReport product can be called from a Talend component.
&lt;BR /&gt;
&lt;A href="http://youtu.be/Y_JMUv7GiK8" rel="nofollow noopener noreferrer"&gt;http://youtu.be/Y_JMUv7GiK8&lt;/A&gt;</description>
      <pubDate>Thu, 10 Nov 2011 15:51:31 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287978#M61498</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-11-10T15:51:31Z</dc:date>
    </item>
    <item>
      <title>Re: html to xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287979#M61499</link>
      <description>Thank you friends its working&lt;BR /&gt;Regards&lt;BR /&gt;Kishore</description>
      <pubDate>Mon, 21 Nov 2011 17:54:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287979#M61499</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-11-21T17:54:51Z</dc:date>
    </item>
    <item>
      <title>Re: html to xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287980#M61500</link>
      <description>Hi friends,&lt;BR /&gt;New job please tell me how to extract data from HTML files in4.2 version.&lt;BR /&gt;regards,&lt;BR /&gt;Kishore</description>
      <pubDate>Mon, 21 Nov 2011 17:57:04 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287980#M61500</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-11-21T17:57:04Z</dc:date>
    </item>
    <item>
      <title>Re: html to xml</title>
      <link>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287981#M61501</link>
      <description>Hi Friends,&lt;BR /&gt;Plz let me know the process for extracting data from HTML to XML. CSV or any file formate.&lt;BR /&gt;Regards,&lt;BR /&gt;Kishore</description>
      <pubDate>Wed, 23 Nov 2011 10:48:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/html-to-xml/m-p/2287981#M61501</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-11-23T10:48:38Z</dc:date>
    </item>
  </channel>
</rss>

