<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Parsing XML files and loading into HDFS in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Parsing-XML-files-and-loading-into-HDFS/m-p/2231432#M21724</link>
    <description>If you want to load the whole xml file to HDFS, you can use tHDFSPut component, set the filemask as "*.xml", it will put all the files in the local folder where you specified to HDFS server.</description>
    <pubDate>Thu, 05 Nov 2015 08:58:00 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2015-11-05T08:58:00Z</dc:date>
    <item>
      <title>Parsing XML files and loading into HDFS</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-XML-files-and-loading-into-HDFS/m-p/2231431#M21723</link>
      <description>&lt;FONT color="#222222"&gt;&lt;FONT size="2"&gt;&lt;FONT face="'Helvetica Neue', Helvetica, Arial, sans-serif"&gt;Hi,&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT color="#222222"&gt;&lt;FONT size="2"&gt;&lt;FONT face="'Helvetica Neue', Helvetica, Arial, sans-serif"&gt;I need to parse XML files and load it to HDFS. &amp;nbsp;few are simple enough and few having data delimited by "|".&amp;nbsp;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT color="#222222"&gt;&lt;FONT size="2"&gt;&lt;FONT face="'Helvetica Neue', Helvetica, Arial, sans-serif"&gt;eg:&amp;nbsp;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;PRE&gt;&amp;lt;ebo:Record&amp;gt;CREATE|59|0|59|0|2015-10-28 00:00:00|||EA|S|2955|303176760||2015-10-28 00:00:00|R|0003|8|1&amp;lt;/ebo:Record&amp;gt;&lt;/PRE&gt;
&lt;BR /&gt;
&lt;FONT color="#222222"&gt;&lt;FONT size="2"&gt;&lt;FONT face="'Helvetica Neue', Helvetica, Arial, sans-serif"&gt;&amp;nbsp;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;PRE&gt;&amp;lt;ebo:Record&amp;gt;CREATE|179|0|179|0|2015-10-28 00:00:00|||EA|S|2955|303151906||2015-10-28 00:00:00|R|0003|8|1&amp;lt;/ebo:Record&amp;gt;&lt;/PRE&gt;
&lt;BR /&gt;
&lt;FONT color="#222222"&gt;&lt;FONT size="2"&gt;&lt;FONT face="'Helvetica Neue', Helvetica, Arial, sans-serif"&gt;I have to pick these files from a specific directory using Talend and load it into hdfs. There will be no transformation involved. Also, as there will be more than 50 xml format I don't want to go through creating the metadata schema individually for each file format.Is there any way to automate this task?&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;</description>
      <pubDate>Sat, 16 Nov 2024 10:58:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-XML-files-and-loading-into-HDFS/m-p/2231431#M21723</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T10:58:12Z</dc:date>
    </item>
    <item>
      <title>Re: Parsing XML files and loading into HDFS</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Parsing-XML-files-and-loading-into-HDFS/m-p/2231432#M21724</link>
      <description>If you want to load the whole xml file to HDFS, you can use tHDFSPut component, set the filemask as "*.xml", it will put all the files in the local folder where you specified to HDFS server.</description>
      <pubDate>Thu, 05 Nov 2015 08:58:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Parsing-XML-files-and-loading-into-HDFS/m-p/2231432#M21724</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-11-05T08:58:00Z</dc:date>
    </item>
  </channel>
</rss>

