<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: [resolved] Ignoring repeated delimiters in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/resolved-Ignoring-repeated-delimiters/m-p/2345719#M113202</link>
    <description>Hi,&lt;BR /&gt;I have a column value in excel with multiple delimiters. I was to split string based on specific delimiter.&lt;BR /&gt;Example: Below value in one column&lt;BR /&gt;AAA.BBB_B.CCC&lt;BR /&gt;AAA.BBB.CCC&lt;BR /&gt;AAA.BBB_B&lt;BR /&gt;AAA..CCC&lt;BR /&gt;Result:&lt;BR /&gt;Column1|Column2|Column3&lt;BR /&gt;AAA|BBB_B|CCC&lt;BR /&gt;AAA|BBB|CCC&lt;BR /&gt;AAA|BBB_B&lt;BR /&gt;AAA||CCC&lt;BR /&gt;Regards,&lt;BR /&gt;Sathiyapriya</description>
    <pubDate>Tue, 28 Jun 2016 12:06:19 GMT</pubDate>
    <dc:creator>_AnonymousUser</dc:creator>
    <dc:date>2016-06-28T12:06:19Z</dc:date>
    <item>
      <title>[resolved] Ignoring repeated delimiters</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Ignoring-repeated-delimiters/m-p/2345715#M113198</link>
      <description>HI all,&lt;BR /&gt;(I must be having a "bad google search day" because I can't find anything that I'm looking for but I don't believe that I'm the first person with the following requirement !).&lt;BR /&gt;I have a file containing data of the following sort:&lt;BR /&gt;abc  100  def&lt;BR /&gt;ab     97     x&lt;BR /&gt;I'm trying to split each record into three fields using tExtractDelimitedFields using a blank space as the delimiter. However, where there are multiple spaces (i.e. delimiters) between two consecutive fields then my output is not what I want.&lt;BR /&gt;Using the example above I'll get the following output (as displayed by tLogRow):&lt;BR /&gt;|abc|100|def&lt;BR /&gt;|ab|||97|||x&lt;BR /&gt;Note that in the second record there are two output columns between the 'ab' and the '97', I only want one.&lt;BR /&gt;MS Excel has a useful "treat multiple delimiters as one" option. Does anything like that exist for the tExtractDelimitedFields component ?&lt;BR /&gt;Is there something better to use ?&lt;BR /&gt;I'm using TOS 3.2.3.r35442 generating Java code.&lt;BR /&gt;Cheers,&lt;BR /&gt;Dave</description>
      <pubDate>Sat, 16 Nov 2024 13:28:16 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Ignoring-repeated-delimiters/m-p/2345715#M113198</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T13:28:16Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Ignoring repeated delimiters</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Ignoring-repeated-delimiters/m-p/2345716#M113199</link>
      <description>Hello 
&lt;BR /&gt;Need to create a custom code to delete the Redundant " ". 
&lt;BR /&gt;in.csv:
&lt;BR /&gt;
&lt;B&gt;&lt;BR /&gt;abc 100 def&lt;BR /&gt;ab 97 x&lt;BR /&gt;ab 33 s&lt;BR /&gt;&lt;/B&gt;
&lt;BR /&gt;Go to repository--&amp;gt;routines and create a new routine, call f10515
&lt;BR /&gt;
&lt;PRE&gt;// template routine Java&lt;BR /&gt;package routines;&lt;BR /&gt;public class f10515 {&lt;BR /&gt;	public static String deleteRedundantSpace(String line) {&lt;BR /&gt;		line = line.trim();&lt;BR /&gt;		String newLine = "";&lt;BR /&gt;		if (line.contains(" ")) {&lt;BR /&gt;			String[] s = line.split(" ");&lt;BR /&gt;			for (int i = 0; i &amp;lt; s.length; i++) {&lt;BR /&gt;				if (!s&lt;I&gt;.equals("")) {&lt;BR /&gt;					newLine = newLine + " " + s&lt;I&gt;;&lt;BR /&gt;				}&lt;BR /&gt;			}&lt;BR /&gt;		}&lt;BR /&gt;		return newLine.trim();&lt;BR /&gt;	}&lt;BR /&gt;}&lt;/I&gt;&lt;/I&gt;&lt;/PRE&gt;
&lt;BR /&gt;Result:
&lt;BR /&gt;
&lt;PRE&gt;Starting job forum10515 at 14:23 08/04/2010.&lt;BR /&gt; connecting to socket on port 3948&lt;BR /&gt; connected&lt;BR /&gt;.---------+----------+----------.&lt;BR /&gt;|           tLogRow_1           |&lt;BR /&gt;|=--------+----------+---------=|&lt;BR /&gt;|newColumn|newColumn1|newColumn2|&lt;BR /&gt;|=--------+----------+---------=|&lt;BR /&gt;|abc      |100       |def       |&lt;BR /&gt;|ab       |97        |x         |&lt;BR /&gt;|ab       |33        |s         |&lt;BR /&gt;'---------+----------+----------'&lt;BR /&gt; disconnected&lt;BR /&gt;Job forum10515 ended at 14:23 08/04/2010. &lt;/PRE&gt;
&lt;BR /&gt;Best regards
&lt;BR /&gt;Shong</description>
      <pubDate>Thu, 08 Apr 2010 07:27:22 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Ignoring-repeated-delimiters/m-p/2345716#M113199</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2010-04-08T07:27:22Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Ignoring repeated delimiters</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Ignoring-repeated-delimiters/m-p/2345717#M113200</link>
      <description>Hi shong,
&lt;BR /&gt;Many thanks for that, it would have taken me a long time to get to that idea.
&lt;BR /&gt;Regards,
&lt;BR /&gt;Dave
&lt;BR /&gt;P.S. Sorry for the delayed response, I've been onsite with a customer.</description>
      <pubDate>Fri, 09 Apr 2010 14:22:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Ignoring-repeated-delimiters/m-p/2345717#M113200</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2010-04-09T14:22:40Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Ignoring repeated delimiters</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Ignoring-repeated-delimiters/m-p/2345718#M113201</link>
      <description>I just did this by specifying my Field Separator as "\\s+" telling the regex to match one or more whitespace characters. 
&lt;BR /&gt;Is there any way to get the final field to grab whatever is left? My final field is an error string of variable length which includes many spaces between the words in the error. I do not want to define a large number of fields and then splice them back together.</description>
      <pubDate>Mon, 26 Apr 2010 22:57:55 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Ignoring-repeated-delimiters/m-p/2345718#M113201</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2010-04-26T22:57:55Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Ignoring repeated delimiters</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Ignoring-repeated-delimiters/m-p/2345719#M113202</link>
      <description>Hi,&lt;BR /&gt;I have a column value in excel with multiple delimiters. I was to split string based on specific delimiter.&lt;BR /&gt;Example: Below value in one column&lt;BR /&gt;AAA.BBB_B.CCC&lt;BR /&gt;AAA.BBB.CCC&lt;BR /&gt;AAA.BBB_B&lt;BR /&gt;AAA..CCC&lt;BR /&gt;Result:&lt;BR /&gt;Column1|Column2|Column3&lt;BR /&gt;AAA|BBB_B|CCC&lt;BR /&gt;AAA|BBB|CCC&lt;BR /&gt;AAA|BBB_B&lt;BR /&gt;AAA||CCC&lt;BR /&gt;Regards,&lt;BR /&gt;Sathiyapriya</description>
      <pubDate>Tue, 28 Jun 2016 12:06:19 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Ignoring-repeated-delimiters/m-p/2345719#M113202</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2016-06-28T12:06:19Z</dc:date>
    </item>
  </channel>
</rss>

