<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Source File Delimiter Capture in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Source-File-Delimiter-Capture/m-p/2277931#M53546</link>
    <description>&lt;P&gt;The template in DB is like this:&lt;/P&gt;
&lt;P&gt;File_Name&amp;nbsp; Header_Info&amp;nbsp; Delimiter_Type&lt;/P&gt;
&lt;P&gt;-----------------------------------------------------------&amp;nbsp;&lt;/P&gt;
&lt;P&gt;ABD.txt&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Y&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;;&lt;/P&gt;
&lt;P&gt;XYZ.txt&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Y&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;|&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Source file looks like this: File_Name is XYZ.txt&lt;/P&gt;
&lt;P&gt;Name | ID | City&lt;/P&gt;
&lt;P&gt;-----------------------&lt;/P&gt;
&lt;P&gt;Krishu |10 | Bangalore&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I&amp;nbsp;have to capture the Delimiter which is coming from the Source (don't know what type of delimiter) and i have to compare the source delimiter value with the Template Delimiter_Type (as above). If both are matching i should process the file further or else i should reject the source file.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks in advance&lt;/P&gt;
&lt;P&gt;Krishu&lt;/P&gt;</description>
    <pubDate>Thu, 29 Nov 2018 06:17:27 GMT</pubDate>
    <dc:creator>krishu</dc:creator>
    <dc:date>2018-11-29T06:17:27Z</dc:date>
    <item>
      <title>Source File Delimiter Capture</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Source-File-Delimiter-Capture/m-p/2277928#M53543</link>
      <description>&lt;P&gt;Hi Team&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Please could you help me on the below scenario..&lt;/P&gt; 
&lt;P&gt;How to capture the file delimiter information(whether semicolon/pipe/comma separated) from the source file and compare it with the existing template in Database.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks in advance&lt;/P&gt; 
&lt;P&gt;Krishu&lt;/P&gt;</description>
      <pubDate>Tue, 27 Nov 2018 12:08:23 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Source-File-Delimiter-Capture/m-p/2277928#M53543</guid>
      <dc:creator>krishu</dc:creator>
      <dc:date>2018-11-27T12:08:23Z</dc:date>
    </item>
    <item>
      <title>Re: Source File Delimiter Capture</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Source-File-Delimiter-Capture/m-p/2277929#M53544</link>
      <description>&lt;P&gt;if you know structure&amp;nbsp;(because&amp;nbsp;you have template) it simple:&lt;/P&gt;&lt;P&gt;read first line from the file and check what between to columns&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;for example&lt;/P&gt;&lt;P&gt;in template you have&amp;nbsp;&lt;/P&gt;&lt;P&gt;id;name;phone&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;and in file&lt;/P&gt;&lt;P&gt;id,name,phone - all what you need to do is check 3rd character&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;of course, it simplified logic, leal could be different&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 27 Nov 2018 12:20:55 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Source-File-Delimiter-Capture/m-p/2277929#M53544</guid>
      <dc:creator>vapukov</dc:creator>
      <dc:date>2018-11-27T12:20:55Z</dc:date>
    </item>
    <item>
      <title>Re: Source File Delimiter Capture</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Source-File-Delimiter-Capture/m-p/2277930#M53545</link>
      <description>&lt;P&gt;That's an interesting problem. If the delimiter can only be one of several things (e.g. comma, semi-colon, etc.), and the data itself doesn't contain a lot of possible delimiters in close sequence, then you could read the first several lines and count the number of commas, etc. in each. If the first 10 lines each contain 20 commas, but only 3 semi-colons and zero pipes, then your delimiter is probably the comma.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;The more potential delimiters you have in the actual data, the more rows you need to read to be sure you've found the actual delimiter; even so, if you read 1000 rows, and every single one has exactly 20 commas, then the probability of the comma *not* being the delimiter is vanishingly small (I'm tempted to estimate it based on the relative density of each possible delimiter, but it's late, and I've had a long day).&lt;/P&gt;</description>
      <pubDate>Wed, 28 Nov 2018 02:52:23 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Source-File-Delimiter-Capture/m-p/2277930#M53545</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-11-28T02:52:23Z</dc:date>
    </item>
    <item>
      <title>Re: Source File Delimiter Capture</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Source-File-Delimiter-Capture/m-p/2277931#M53546</link>
      <description>&lt;P&gt;The template in DB is like this:&lt;/P&gt;
&lt;P&gt;File_Name&amp;nbsp; Header_Info&amp;nbsp; Delimiter_Type&lt;/P&gt;
&lt;P&gt;-----------------------------------------------------------&amp;nbsp;&lt;/P&gt;
&lt;P&gt;ABD.txt&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Y&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;;&lt;/P&gt;
&lt;P&gt;XYZ.txt&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Y&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;|&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Source file looks like this: File_Name is XYZ.txt&lt;/P&gt;
&lt;P&gt;Name | ID | City&lt;/P&gt;
&lt;P&gt;-----------------------&lt;/P&gt;
&lt;P&gt;Krishu |10 | Bangalore&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I&amp;nbsp;have to capture the Delimiter which is coming from the Source (don't know what type of delimiter) and i have to compare the source delimiter value with the Template Delimiter_Type (as above). If both are matching i should process the file further or else i should reject the source file.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks in advance&lt;/P&gt;
&lt;P&gt;Krishu&lt;/P&gt;</description>
      <pubDate>Thu, 29 Nov 2018 06:17:27 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Source-File-Delimiter-Capture/m-p/2277931#M53546</guid>
      <dc:creator>krishu</dc:creator>
      <dc:date>2018-11-29T06:17:27Z</dc:date>
    </item>
  </channel>
</rss>

