<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How Can I Change the Batch Size in Data Quality</title>
    <link>https://community.qlik.com/t5/Data-Quality/How-Can-I-Change-the-Batch-Size/m-p/2272050#M2745</link>
    <description>Hi Everyone, 
&lt;BR /&gt;I want to create regex job to using below query. I need only columns name 
&lt;BR /&gt;Please help how to create regex job 
&lt;BR /&gt;Please tell me which transformations i have to use.. if possible please share job screen short.. 
&lt;BR /&gt;Thanks a lot.. 
&lt;BR /&gt;Regular Expression Query:- 
&lt;BR /&gt;^({7,15})\s # client_ip 
&lt;BR /&gt;-\s # unused IDENT field 
&lt;BR /&gt;-\s # unused USER field 
&lt;BR /&gt;\\d{4}))\] 
&lt;BR /&gt; # request time :HH:mm:ss -0800 
&lt;BR /&gt;\s"(GET|POST)\s # HTTP verb 
&lt;BR /&gt;(*) # HTTP URI 
&lt;BR /&gt;\sHTTP/1\."\s # HTTP version 
&lt;BR /&gt;(\d{3})\s # HTTP status code 
&lt;BR /&gt;(\d+)\s # bytes returned 
&lt;BR /&gt;"(+)"\s # referrer field 
&lt;BR /&gt;" # User agent parsing, always quoted. 
&lt;BR /&gt;"? # Sometimes if the user spoofs the user_agent, they incorrectly quote it. 
&lt;BR /&gt;( # The UA string 
&lt;BR /&gt; *? # Uninteresting bits 
&lt;BR /&gt; (?: 
&lt;BR /&gt; (?: 
&lt;BR /&gt; rv: # Beginning of the gecko engine version token 
&lt;BR /&gt; (?={3,15}[]) # ensure version string size 
&lt;BR /&gt; ( # Whole gecko version 
&lt;BR /&gt; (\d{1,2}) # version_component_major 
&lt;BR /&gt; \.(\d{1,2}{0,8}) # version_component_minor 
&lt;BR /&gt; (?:\.(\d{1,2}{0,8}))? # version_component_a 
&lt;BR /&gt; (?:\.(\d{1,2}{0,8}))? # version_component_b 
&lt;BR /&gt; ) 
&lt;BR /&gt; * # More uninteresting bits 
&lt;BR /&gt; ) 
&lt;BR /&gt; | 
&lt;BR /&gt; * # More uninteresting bits 
&lt;BR /&gt; ) 
&lt;BR /&gt;) # End of UA string 
&lt;BR /&gt;"? 
&lt;BR /&gt;"</description>
    <pubDate>Tue, 05 Feb 2013 06:31:09 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2013-02-05T06:31:09Z</dc:date>
    <item>
      <title>How Can I Change the Batch Size</title>
      <link>https://community.qlik.com/t5/Data-Quality/How-Can-I-Change-the-Batch-Size/m-p/2272046#M2741</link>
      <description>Bonjour,
&lt;BR /&gt;Is there a way to change the "row batch size" of the records being processed when I run a DQ Analysis Report? When processing, TOS DQ seems to process roughly around 5000 records at a time.
&lt;BR /&gt;We find it difficult to run a column analysis when we have a 500000 rows in a table.
&lt;BR /&gt;Any idea how to improve our experience?
&lt;BR /&gt;Thanks.
&lt;BR /&gt;PM</description>
      <pubDate>Tue, 30 Oct 2012 19:45:26 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/How-Can-I-Change-the-Batch-Size/m-p/2272046#M2741</guid>
      <dc:creator>ptremblay</dc:creator>
      <dc:date>2012-10-30T19:45:26Z</dc:date>
    </item>
    <item>
      <title>Re: How Can I Change the Batch Size</title>
      <link>https://community.qlik.com/t5/Data-Quality/How-Can-I-Change-the-Batch-Size/m-p/2272047#M2742</link>
      <description>If anyone know the answer Please provide me. 
&lt;BR /&gt;1) Talend takes more disk space during the execution of the job for the ETL process? 
&lt;BR /&gt;2) Talend creates files somewhere that increase the disk space in time? 
&lt;BR /&gt;3) Do you have some suggestions regarding the disk space reservation or a best practice? 
&lt;BR /&gt;------------------------------------------------------------------------- 
&lt;BR /&gt;The Job configuration uses only the following component: 
&lt;BR /&gt;tFileList 
&lt;BR /&gt;tFileinputdelimited 
&lt;BR /&gt;tMap 
&lt;BR /&gt;tAggregateRow 
&lt;BR /&gt;tOracleInput 
&lt;BR /&gt;tUnite 
&lt;BR /&gt; 
&lt;BR /&gt;Thanks, 
&lt;BR /&gt;Girish</description>
      <pubDate>Wed, 31 Oct 2012 07:25:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/How-Can-I-Change-the-Batch-Size/m-p/2272047#M2742</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-10-31T07:25:13Z</dc:date>
    </item>
    <item>
      <title>Re: How Can I Change the Batch Size</title>
      <link>https://community.qlik.com/t5/Data-Quality/How-Can-I-Change-the-Batch-Size/m-p/2272048#M2743</link>
      <description>Hi Everyone, 
&lt;BR /&gt;Please find the below query.. I want only columns name can u please help me to how can i create Regular Expression job 
&lt;BR /&gt;Regular Expression Query:- 
&lt;BR /&gt;^({7,15})\s # client_ip 
&lt;BR /&gt;-\s # unused IDENT field 
&lt;BR /&gt;-\s # unused USER field 
&lt;BR /&gt;\\d{4}))\] 
&lt;BR /&gt; # request time :HH:mm:ss -0800 
&lt;BR /&gt;\s"(GET|POST)\s # HTTP verb 
&lt;BR /&gt;(*) # HTTP URI 
&lt;BR /&gt;\sHTTP/1\."\s # HTTP version 
&lt;BR /&gt;(\d{3})\s # HTTP status code 
&lt;BR /&gt;(\d+)\s # bytes returned 
&lt;BR /&gt;"(+)"\s # referrer field 
&lt;BR /&gt;" # User agent parsing, always quoted. 
&lt;BR /&gt;"? # Sometimes if the user spoofs the user_agent, they incorrectly quote it. 
&lt;BR /&gt;( # The UA string 
&lt;BR /&gt; *? # Uninteresting bits 
&lt;BR /&gt; (?: 
&lt;BR /&gt; (?: 
&lt;BR /&gt; rv: # Beginning of the gecko engine version token 
&lt;BR /&gt; (?={3,15}) # ensure version string size 
&lt;BR /&gt; ( # Whole gecko version 
&lt;BR /&gt; (\d{1,2}) # version_component_major 
&lt;BR /&gt; \.(\d{1,2}{0,8}) # version_component_minor 
&lt;BR /&gt; (?:\.(\d{1,2}{0,8}))? # version_component_a 
&lt;BR /&gt; (?:\.(\d{1,2}{0,8}))? # version_component_b 
&lt;BR /&gt; ) 
&lt;BR /&gt; * # More uninteresting bits 
&lt;BR /&gt; ) 
&lt;BR /&gt; | 
&lt;BR /&gt; * # More uninteresting bits 
&lt;BR /&gt; ) 
&lt;BR /&gt;) # End of UA string 
&lt;BR /&gt;"? 
&lt;BR /&gt;" 
&lt;BR /&gt;================================= 
&lt;BR /&gt; 
&lt;BR /&gt;Capture Group Fields: Add the following all with Type String 
&lt;BR /&gt;i.client_ip 
&lt;BR /&gt;ii.full_request_date 
&lt;BR /&gt;iii.day 
&lt;BR /&gt;iv.month 
&lt;BR /&gt;v.year 
&lt;BR /&gt;vi.hour 
&lt;BR /&gt;vii.minute 
&lt;BR /&gt;viii.second 
&lt;BR /&gt;ix.timezone 
&lt;BR /&gt;x.http_verb 
&lt;BR /&gt;xi.uri 
&lt;BR /&gt;xii.http_status_code 
&lt;BR /&gt;xiii.bytes_returned 
&lt;BR /&gt;xiv.referrer 
&lt;BR /&gt;xv.user_agent 
&lt;BR /&gt;xvi.firefox_gecko_version 
&lt;BR /&gt;xvii.firefox_gecko_version_major 
&lt;BR /&gt;xviii.firefox_gecko_version_minor 
&lt;BR /&gt;xix.firefox_gecko_version_a 
&lt;BR /&gt;xx.firefox_gecko_version_b</description>
      <pubDate>Mon, 04 Feb 2013 10:21:47 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/How-Can-I-Change-the-Batch-Size/m-p/2272048#M2743</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-02-04T10:21:47Z</dc:date>
    </item>
    <item>
      <title>Re: How Can I Change the Batch Size</title>
      <link>https://community.qlik.com/t5/Data-Quality/How-Can-I-Change-the-Batch-Size/m-p/2272049#M2744</link>
      <description>@ptremblay, column set analysis require indeed a lot of memory when there too many distinct rows. 
&lt;BR /&gt;One way to avoid memory issue is to not store the data in the analysis file (there is an option in the analysis editor for that). 
&lt;BR /&gt;Another way to avoid crashes is to fine tune the memory in the preference page "profiling&amp;gt;analysis tuning" 
&lt;BR /&gt;If you are executing column analyses (not column set analysis), then some indicators also require some memory (either from the DBMS server side or from the studio if you are using the java engine). 
&lt;BR /&gt;Please, provide more details about what you're doing. 
&lt;BR /&gt;@Grirish_Shiva, please open another thread in our TOS for DQ forum for your question as it's not related to the data profiling product.</description>
      <pubDate>Mon, 04 Feb 2013 13:25:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/How-Can-I-Change-the-Batch-Size/m-p/2272049#M2744</guid>
      <dc:creator>Sebastiao_Qlik</dc:creator>
      <dc:date>2013-02-04T13:25:00Z</dc:date>
    </item>
    <item>
      <title>Re: How Can I Change the Batch Size</title>
      <link>https://community.qlik.com/t5/Data-Quality/How-Can-I-Change-the-Batch-Size/m-p/2272050#M2745</link>
      <description>Hi Everyone, 
&lt;BR /&gt;I want to create regex job to using below query. I need only columns name 
&lt;BR /&gt;Please help how to create regex job 
&lt;BR /&gt;Please tell me which transformations i have to use.. if possible please share job screen short.. 
&lt;BR /&gt;Thanks a lot.. 
&lt;BR /&gt;Regular Expression Query:- 
&lt;BR /&gt;^({7,15})\s # client_ip 
&lt;BR /&gt;-\s # unused IDENT field 
&lt;BR /&gt;-\s # unused USER field 
&lt;BR /&gt;\\d{4}))\] 
&lt;BR /&gt; # request time :HH:mm:ss -0800 
&lt;BR /&gt;\s"(GET|POST)\s # HTTP verb 
&lt;BR /&gt;(*) # HTTP URI 
&lt;BR /&gt;\sHTTP/1\."\s # HTTP version 
&lt;BR /&gt;(\d{3})\s # HTTP status code 
&lt;BR /&gt;(\d+)\s # bytes returned 
&lt;BR /&gt;"(+)"\s # referrer field 
&lt;BR /&gt;" # User agent parsing, always quoted. 
&lt;BR /&gt;"? # Sometimes if the user spoofs the user_agent, they incorrectly quote it. 
&lt;BR /&gt;( # The UA string 
&lt;BR /&gt; *? # Uninteresting bits 
&lt;BR /&gt; (?: 
&lt;BR /&gt; (?: 
&lt;BR /&gt; rv: # Beginning of the gecko engine version token 
&lt;BR /&gt; (?={3,15}[]) # ensure version string size 
&lt;BR /&gt; ( # Whole gecko version 
&lt;BR /&gt; (\d{1,2}) # version_component_major 
&lt;BR /&gt; \.(\d{1,2}{0,8}) # version_component_minor 
&lt;BR /&gt; (?:\.(\d{1,2}{0,8}))? # version_component_a 
&lt;BR /&gt; (?:\.(\d{1,2}{0,8}))? # version_component_b 
&lt;BR /&gt; ) 
&lt;BR /&gt; * # More uninteresting bits 
&lt;BR /&gt; ) 
&lt;BR /&gt; | 
&lt;BR /&gt; * # More uninteresting bits 
&lt;BR /&gt; ) 
&lt;BR /&gt;) # End of UA string 
&lt;BR /&gt;"? 
&lt;BR /&gt;"</description>
      <pubDate>Tue, 05 Feb 2013 06:31:09 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/How-Can-I-Change-the-Batch-Size/m-p/2272050#M2745</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-02-05T06:31:09Z</dc:date>
    </item>
    <item>
      <title>Re: How Can I Change the Batch Size</title>
      <link>https://community.qlik.com/t5/Data-Quality/How-Can-I-Change-the-Batch-Size/m-p/2272051#M2746</link>
      <description>Hi, Grirish_Shiva 
&lt;BR /&gt;Please create a new topic for your regex job so that more professional guys in forum will see your requirement and help you asap, Thanks a lot. 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Sabrina</description>
      <pubDate>Tue, 05 Feb 2013 06:56:27 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/How-Can-I-Change-the-Batch-Size/m-p/2272051#M2746</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-02-05T06:56:27Z</dc:date>
    </item>
  </channel>
</rss>

