<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Problem with tExtractRegexFields component in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Problem-with-tExtractRegexFields-component/m-p/2364096#M127891</link>
    <description>Hi,
&lt;BR /&gt;Thanks for your answer.
&lt;BR /&gt;My regex split my line in 19 fields, eg :
&lt;BR /&gt; line = 08/01/2012:23:59:59 +0100 192.168.90.17 "" "GET /templates/toto&amp;amp;typeUnivers=R HTTP/1.1" 200 344 "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0; GTB7.2; SLCC1; .NET CLR 2.0.50727; Media Center PC 5.0; InfoPath.2; .NET CLR 3.5.30729; .NET CLR 3.0.30618; .NET4.0C)]" "
&lt;A href="http://www.xxx.com/MAG000036.swf" rel="nofollow noopener noreferrer"&gt;http://www.xxx.com/MAG000036.swf&lt;/A&gt;" external 
&lt;A href="http://www.xxx.com" target="_blank" rel="nofollow noopener noreferrer"&gt;www.xxx.com&lt;/A&gt; "192.168.57.107" 27 "234B2789A03896013D3A12A0652D0DF9" PRODUCTION_EXTERNAL_ORI 192.168.2.230:8080 "." "vide"
&lt;BR /&gt;And after the split i'll do have :
&lt;BR /&gt;field1=
&lt;BR /&gt;field2=08/01/2012:23:59:59
&lt;BR /&gt;Field3=+0100
&lt;BR /&gt;field4=""
&lt;BR /&gt;field5="GET /templates/toto&amp;amp;typeUnivers=R HTTP/1.1"
&lt;BR /&gt;...
&lt;BR /&gt;field19="vide"
&lt;BR /&gt;Each line is in CSV format with whitespace delimiter and character " optionnaly enclose string.
&lt;BR /&gt;Regards,</description>
    <pubDate>Tue, 10 Jan 2012 10:34:47 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2012-01-10T10:34:47Z</dc:date>
    <item>
      <title>Problem with tExtractRegexFields component</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problem-with-tExtractRegexFields-component/m-p/2364094#M127889</link>
      <description>Hi,
&lt;BR /&gt;I use TOS 5.0.1.
&lt;BR /&gt;I put tExtractRegexFields on a job and for 1 line it split on several lines instead of several fields ...
&lt;BR /&gt;My regexp : "\"(+?)\"\\s?|(+)\\s?|\\s"
&lt;BR /&gt;Data to split (3 lines) : 
&lt;BR /&gt; 08/01/2012:23:59:59 +0100 192.168.90.17 "" "GET /templates/toto&amp;amp;typeUnivers=R HTTP/1.1" 200 344 "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0; GTB7.2; SLCC1; .NET CLR 2.0.50727; Media Center PC 5.0; InfoPath.2; .NET CLR 3.5.30729; .NET CLR 3.0.30618; .NET4.0C)]" "
&lt;A href="http://www.xxx.com/MAG000036.swf" target="_blank" rel="nofollow noopener noreferrer"&gt;http://www.xxx.com/MAG000036.swf&lt;/A&gt;" external 
&lt;A href="http://www.xxx.com" target="_blank" rel="nofollow noopener noreferrer"&gt;www.xxx.com&lt;/A&gt; "192.168.57.107" 27 "234B2789A03896013D3A12A0652D0DF9" PRODUCTION_EXTERNAL_ORI 192.168.2.230:8080 "." "vide"
&lt;BR /&gt; 09/01/2012:00:00:00 +0100 192.168.250.202 "" "GET /templates/onglets-home-univers.png HTTP/1.1" 200 12102 "Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10_6_5; fr-fr) AppleWebKit/533.19.4 (KHTML, like Gecko) Version/5.0.3 Safari/533.19.4]" "
&lt;A href="https://www.xxx.com/creationCompteClient" target="_blank" rel="nofollow noopener noreferrer"&gt;https://www.xxx.com/creationCompteClient&lt;/A&gt;" external secure.xxx.com "192.168.250.202" 9 "750BCE11C09241CDADC7C6893E0CD5E9" PRODUCTION_EXTERNAL_SECURE 192.168.242.230:8080 "." "vide" 
&lt;BR /&gt; 09/01/2012:00:00:00 +0100 192.168.235.141 "" "GET /repository/Parapharmacie_4.png/image_w397_h320 HTTP/1.1" 200 133367 "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.2; Trident/4.0; SLCC1; .NET CLR 1.1.4325; .NET CLR 2.0.50727; .NET CLR 3.0.30729; .NET CLR 3.5.30707)]" "
&lt;A href="http://www.xxx.com/infospratiques" target="_blank" rel="nofollow noopener noreferrer"&gt;http://www.xxx.com/infospratiques&lt;/A&gt;" external 
&lt;A href="http://www.xxx.com" target="_blank" rel="nofollow noopener noreferrer"&gt;www.xxx.com&lt;/A&gt; "192.168.17.80" 127 "96C3D22C6B031DD66F09D16A8AA529F1" PRODUCTION_EXTERNAL_ORI 192.168.242.230:8080 "." "vide" 
&lt;BR /&gt;It split in 19 fields but i have 19 lines by input line ...
&lt;BR /&gt;Regards,</description>
      <pubDate>Sat, 16 Nov 2024 12:28:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problem-with-tExtractRegexFields-component/m-p/2364094#M127889</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T12:28:07Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with tExtractRegexFields component</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problem-with-tExtractRegexFields-component/m-p/2364095#M127890</link>
      <description>Hi
&lt;BR /&gt;I hava reproduced your job and i get 27 lines as the following image.
&lt;BR /&gt;Please show me more details about 'split in 19 fields'.
&lt;BR /&gt;Regards,
&lt;BR /&gt;Pedro</description>
      <pubDate>Tue, 10 Jan 2012 07:03:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problem-with-tExtractRegexFields-component/m-p/2364095#M127890</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-01-10T07:03:00Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with tExtractRegexFields component</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problem-with-tExtractRegexFields-component/m-p/2364096#M127891</link>
      <description>Hi,
&lt;BR /&gt;Thanks for your answer.
&lt;BR /&gt;My regex split my line in 19 fields, eg :
&lt;BR /&gt; line = 08/01/2012:23:59:59 +0100 192.168.90.17 "" "GET /templates/toto&amp;amp;typeUnivers=R HTTP/1.1" 200 344 "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0; GTB7.2; SLCC1; .NET CLR 2.0.50727; Media Center PC 5.0; InfoPath.2; .NET CLR 3.5.30729; .NET CLR 3.0.30618; .NET4.0C)]" "
&lt;A href="http://www.xxx.com/MAG000036.swf" rel="nofollow noopener noreferrer"&gt;http://www.xxx.com/MAG000036.swf&lt;/A&gt;" external 
&lt;A href="http://www.xxx.com" target="_blank" rel="nofollow noopener noreferrer"&gt;www.xxx.com&lt;/A&gt; "192.168.57.107" 27 "234B2789A03896013D3A12A0652D0DF9" PRODUCTION_EXTERNAL_ORI 192.168.2.230:8080 "." "vide"
&lt;BR /&gt;And after the split i'll do have :
&lt;BR /&gt;field1=
&lt;BR /&gt;field2=08/01/2012:23:59:59
&lt;BR /&gt;Field3=+0100
&lt;BR /&gt;field4=""
&lt;BR /&gt;field5="GET /templates/toto&amp;amp;typeUnivers=R HTTP/1.1"
&lt;BR /&gt;...
&lt;BR /&gt;field19="vide"
&lt;BR /&gt;Each line is in CSV format with whitespace delimiter and character " optionnaly enclose string.
&lt;BR /&gt;Regards,</description>
      <pubDate>Tue, 10 Jan 2012 10:34:47 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problem-with-tExtractRegexFields-component/m-p/2364096#M127891</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-01-10T10:34:47Z</dc:date>
    </item>
  </channel>
</rss>

