<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Dynamic Job, Single job for multiple files, Multiple Schema, etc. in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Dynamic-Job-Single-job-for-multiple-files-Multiple-Schema-etc/m-p/2218756#M13827</link>
    <description>&lt;P&gt;Hi Shong,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thank you very much for quick response. Few reasons I mentioned to the client - more generic job you try to design -&lt;/P&gt; 
&lt;P&gt;1) Overall design becomes complex and difficult to maintain&lt;/P&gt; 
&lt;P&gt;2) Testing such jobs also becomes difficult&lt;/P&gt; 
&lt;P&gt;3) Massive configuration table / file means you need to train people to provide accurate information to the job&lt;/P&gt; 
&lt;P&gt;4) If such job breaks down - debugging also becomes challenging&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks,&lt;/P&gt; 
&lt;P&gt;Nishad Joshi.&lt;/P&gt;</description>
    <pubDate>Fri, 24 Apr 2020 17:50:18 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2020-04-24T17:50:18Z</dc:date>
    <item>
      <title>Dynamic Job, Single job for multiple files, Multiple Schema, etc.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Dynamic-Job-Single-job-for-multiple-files-Multiple-Schema-etc/m-p/2218754#M13825</link>
      <description>&lt;P&gt;&lt;FONT size="2" face="tahoma,arial,helvetica,sans-serif"&gt;Hi Everyone,&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2" face="tahoma,arial,helvetica,sans-serif"&gt;One of the questions I get whenever I visit any client for a Talend project / assignment is &lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2" face="tahoma,arial,helvetica,sans-serif"&gt;&lt;STRONG&gt;Can I have one single job which can process multiple files with different schema, format, etc. ? I should be able to carry out following transformations for example - &lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2" face="tahoma,arial,helvetica,sans-serif"&gt;&lt;STRONG&gt;1) Read Data from File&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2" face="tahoma,arial,helvetica,sans-serif"&gt;&lt;STRONG&gt;2) Sort based on column(s)&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2" face="tahoma,arial,helvetica,sans-serif"&gt;&lt;STRONG&gt;3) Filter records on some condition(s)&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2" face="tahoma,arial,helvetica,sans-serif"&gt;&lt;STRONG&gt;4) Aggregate data&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2" face="tahoma,arial,helvetica,sans-serif"&gt;&lt;STRONG&gt;5) Store it in individual table ( each file will have separate table )&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2" face="tahoma,arial,helvetica,sans-serif"&gt;&lt;STRONG&gt;Which file to process, which column to use for sorting, condition to filter data, columns to aggregate data - all this information should be able to pass to Talend job through a configuration file / table.&lt;/STRONG&gt; &lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2" face="tahoma,arial,helvetica,sans-serif"&gt;Those who have worked on other ETL tools like Pentaho, Ab intio will know this is very much possible through these tools. In case of Petaho - metadata injection feature allows you achieve this. &lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2" face="tahoma,arial,helvetica,sans-serif"&gt;I understand - there is a feature available in the form of dynamic schema ( Enterprise Edition ). But it does not really allow to implement the use case mentioned above. &lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2" face="tahoma,arial,helvetica,sans-serif"&gt;I've had hard time to explain this to one of my client. But on a second thought - it appears feature like this would be useful in situations where multiple source files need to go through set number of transformations. &lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2" face="tahoma,arial,helvetica,sans-serif"&gt;Therefore just want to understand what community members and Talend team think about this? &lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 21 Apr 2020 20:11:25 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Dynamic-Job-Single-job-for-multiple-files-Multiple-Schema-etc/m-p/2218754#M13825</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2020-04-21T20:11:25Z</dc:date>
    </item>
    <item>
      <title>Re: Dynamic Job, Single job for multiple files, Multiple Schema, etc.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Dynamic-Job-Single-job-for-multiple-files-Multiple-Schema-etc/m-p/2218755#M13826</link>
      <description>Hi 
&lt;BR /&gt;In Talend, there are different component to read each type of file format, and the columns to be sorted, to be aggregated should be defined at design time, can't pass these information at runtime, so it is impossible achieve this use case with a single generic job. 
&lt;BR /&gt;
&lt;BR /&gt;Regards
&lt;BR /&gt;Shong</description>
      <pubDate>Fri, 24 Apr 2020 08:18:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Dynamic-Job-Single-job-for-multiple-files-Multiple-Schema-etc/m-p/2218755#M13826</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2020-04-24T08:18:00Z</dc:date>
    </item>
    <item>
      <title>Re: Dynamic Job, Single job for multiple files, Multiple Schema, etc.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Dynamic-Job-Single-job-for-multiple-files-Multiple-Schema-etc/m-p/2218756#M13827</link>
      <description>&lt;P&gt;Hi Shong,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thank you very much for quick response. Few reasons I mentioned to the client - more generic job you try to design -&lt;/P&gt; 
&lt;P&gt;1) Overall design becomes complex and difficult to maintain&lt;/P&gt; 
&lt;P&gt;2) Testing such jobs also becomes difficult&lt;/P&gt; 
&lt;P&gt;3) Massive configuration table / file means you need to train people to provide accurate information to the job&lt;/P&gt; 
&lt;P&gt;4) If such job breaks down - debugging also becomes challenging&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks,&lt;/P&gt; 
&lt;P&gt;Nishad Joshi.&lt;/P&gt;</description>
      <pubDate>Fri, 24 Apr 2020 17:50:18 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Dynamic-Job-Single-job-for-multiple-files-Multiple-Schema-etc/m-p/2218756#M13827</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2020-04-24T17:50:18Z</dc:date>
    </item>
  </channel>
</rss>

