<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How does DataPrepRun in a Standard job works? in Data Quality</title>
    <link>https://community.qlik.com/t5/Data-Quality/How-does-DataPrepRun-in-a-Standard-job-works/m-p/2269190#M2437</link>
    <description>&lt;P&gt;Thanks for your reply!, just to be sure I got it right&lt;/P&gt;
&lt;P&gt;if I use a tDataPrepRun component to use a recipe built on Data Preparation Cloud and the job using that component is being executed in an onPrem remote engine then for every flow of data it will be uploaded to Talend cloud, run the recipe and download the results.&lt;/P&gt;
&lt;P&gt;Is that right?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If that's right then it would explain the performance difference with the local regEx&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Damian&lt;/P&gt;</description>
    <pubDate>Wed, 30 Jan 2019 14:25:12 GMT</pubDate>
    <dc:creator>dbeltritti</dc:creator>
    <dc:date>2019-01-30T14:25:12Z</dc:date>
    <item>
      <title>How does DataPrepRun in a Standard job works?</title>
      <link>https://community.qlik.com/t5/Data-Quality/How-does-DataPrepRun-in-a-Standard-job-works/m-p/2269187#M2434</link>
      <description>&lt;P&gt;We've Talend Cloud Data Preparation and I created a recipe based on a dataset that I loaded as test file which is basically doing a regEx replace in a column.&lt;/P&gt;
&lt;P&gt;Then I've a standard job that runs on a remote engine onprem that basically reads several files and applies the data preparation recipe using a tDataPrepRun component.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I tested performance using the tDataPrepRun component and a standard regEx tReplace component and the later is like 4 times faster than using the tDataPrepRun.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So I'm wondering how the Data Preparation recipe is applied? Does it downloads the recipe everytime from the cloud and apply it locally in the RunTime engine? or does it upload the data to Talend Cloud, applies the recipe and then downloads it back?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If anyone knows the details please let me know&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 06:43:45 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/How-does-DataPrepRun-in-a-Standard-job-works/m-p/2269187#M2434</guid>
      <dc:creator>dbeltritti</dc:creator>
      <dc:date>2024-11-16T06:43:45Z</dc:date>
    </item>
    <item>
      <title>Re: How does DataPrepRun in a Standard job works?</title>
      <link>https://community.qlik.com/t5/Data-Quality/How-does-DataPrepRun-in-a-Standard-job-works/m-p/2269188#M2435</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;I have raised the query to Product Team using JIRA Ticket and below is the link for your reference.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;A href="https://jira.talendforge.org/browse/TDP-6730" target="_blank" rel="nofollow noopener noreferrer"&gt;https://jira.talendforge.org/browse/TDP-6730&lt;/A&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;Regarding your last query, the data is never loaded to Talend Cloud as Talend Cloud is fetching only metadata information to control the job. All the other details will be processed directly from your remote engine.&lt;/P&gt; 
&lt;P&gt;&lt;BR /&gt;Warm Regards,&lt;BR /&gt;Nikhil Thampi&lt;/P&gt; 
&lt;P&gt;Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 30 Jan 2019 04:25:46 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/How-does-DataPrepRun-in-a-Standard-job-works/m-p/2269188#M2435</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-01-30T04:25:46Z</dc:date>
    </item>
    <item>
      <title>Re: How does DataPrepRun in a Standard job works?</title>
      <link>https://community.qlik.com/t5/Data-Quality/How-does-DataPrepRun-in-a-Standard-job-works/m-p/2269189#M2436</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;When it comes to tDataPrepRun, the following documentation page should answer your question:&amp;nbsp;&lt;A href="https://help.talend.com/reader/rGfDn9c_Qjv5~4P5XcYKbw/tClZKcGIQ9tfYAAOSeeg7w.%C2%A0Short" target="_blank" rel="nofollow noopener noreferrer noopener noreferrer"&gt;https://help.talend.com/reader/rGfDn9c_Qjv5~4P5XcYKbw/tClZKcGIQ9tfYAAOSeeg7w.&amp;nbsp;Short&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;version:&lt;/P&gt; 
&lt;UL&gt; 
 &lt;LI&gt;For DI jobs, the processing is performed on the Data Prep server&lt;/LI&gt; 
 &lt;LI&gt;For Big Data jobs, the processing is performed on the cluster&lt;/LI&gt; 
&lt;/UL&gt; 
&lt;P&gt;We plan to align the DI behavior to the Big Data one, but there is no confirmed ETA yet. There is no difference between on-prem and Cloud, btw. Same principles apply.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;As a side note, the runtime used when running a preparation directly from the UI is described here: &lt;A href="https://help.talend.com/reader/94sQcluQTA3Bds1QWGANTw/49m3unRzMnXJnX7tIv79mA" target="_blank" rel="nofollow noopener noreferrer noopener noreferrer"&gt;https://help.talend.com/reader/94sQcluQTA3Bds1QWGANTw/49m3unRzMnXJnX7tIv79mA&lt;/A&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Cheers,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Gwendal&lt;/P&gt;</description>
      <pubDate>Wed, 30 Jan 2019 08:59:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/How-does-DataPrepRun-in-a-Standard-job-works/m-p/2269189#M2436</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-01-30T08:59:00Z</dc:date>
    </item>
    <item>
      <title>Re: How does DataPrepRun in a Standard job works?</title>
      <link>https://community.qlik.com/t5/Data-Quality/How-does-DataPrepRun-in-a-Standard-job-works/m-p/2269190#M2437</link>
      <description>&lt;P&gt;Thanks for your reply!, just to be sure I got it right&lt;/P&gt;
&lt;P&gt;if I use a tDataPrepRun component to use a recipe built on Data Preparation Cloud and the job using that component is being executed in an onPrem remote engine then for every flow of data it will be uploaded to Talend cloud, run the recipe and download the results.&lt;/P&gt;
&lt;P&gt;Is that right?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If that's right then it would explain the performance difference with the local regEx&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Damian&lt;/P&gt;</description>
      <pubDate>Wed, 30 Jan 2019 14:25:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/How-does-DataPrepRun-in-a-Standard-job-works/m-p/2269190#M2437</guid>
      <dc:creator>dbeltritti</dc:creator>
      <dc:date>2019-01-30T14:25:12Z</dc:date>
    </item>
    <item>
      <title>Re: How does DataPrepRun in a Standard job works?</title>
      <link>https://community.qlik.com/t5/Data-Quality/How-does-DataPrepRun-in-a-Standard-job-works/m-p/2269191#M2438</link>
      <description>Thanks for creating the ticket!, really appreciate it</description>
      <pubDate>Wed, 30 Jan 2019 14:25:39 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/How-does-DataPrepRun-in-a-Standard-job-works/m-p/2269191#M2438</guid>
      <dc:creator>dbeltritti</dc:creator>
      <dc:date>2019-01-30T14:25:39Z</dc:date>
    </item>
    <item>
      <title>Re: How does DataPrepRun in a Standard job works?</title>
      <link>https://community.qlik.com/t5/Data-Quality/How-does-DataPrepRun-in-a-Standard-job-works/m-p/2269192#M2439</link>
      <description>&lt;P&gt;Yes, that is correct. And yes, that fully explains the performance discrepancy ... and why we want to review the way DI jobs work with tDataPrepRun to mimic Big Data jobs.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Gwendal&lt;/P&gt;</description>
      <pubDate>Wed, 30 Jan 2019 15:54:35 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/How-does-DataPrepRun-in-a-Standard-job-works/m-p/2269192#M2439</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-01-30T15:54:35Z</dc:date>
    </item>
  </channel>
</rss>

