<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Data validation against dynamic schema in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Data-validation-against-dynamic-schema/m-p/2297210#M69801</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;I need to validate data which come from csv files against dynamic schema and I am using Talend 5.6 Enterprise version. I read a number of posts on Talend Community and found that tComplianceCheck cannot be used in this case as it does not support dynamic schema. Can you please advise how this can be done in Talend ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;My initial approach is to create 2 separate jobs: e.g. Job1 - create schema dynamically from a schema definition file (csv), then pass this schema definition to Job2, which will loop through a number of csv files, read each one and compare/validate each record against the schema defined in Job1. My question is how I can pass a dynamic schema to Job2 to do the validation ? Is it a feasible solution ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;The problem is the schema definitions are not known in advance so I cannot create it when designing the job. Users of my application would define the schema using a schema definition file (could be csv/excel) and provide the csv files to be validated against that particular schema&amp;nbsp;at runtime.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Look forward to your suggestions/advice on this topic.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks !&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 08:53:54 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2024-11-16T08:53:54Z</dc:date>
    <item>
      <title>Data validation against dynamic schema</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Data-validation-against-dynamic-schema/m-p/2297210#M69801</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;I need to validate data which come from csv files against dynamic schema and I am using Talend 5.6 Enterprise version. I read a number of posts on Talend Community and found that tComplianceCheck cannot be used in this case as it does not support dynamic schema. Can you please advise how this can be done in Talend ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;My initial approach is to create 2 separate jobs: e.g. Job1 - create schema dynamically from a schema definition file (csv), then pass this schema definition to Job2, which will loop through a number of csv files, read each one and compare/validate each record against the schema defined in Job1. My question is how I can pass a dynamic schema to Job2 to do the validation ? Is it a feasible solution ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;The problem is the schema definitions are not known in advance so I cannot create it when designing the job. Users of my application would define the schema using a schema definition file (could be csv/excel) and provide the csv files to be validated against that particular schema&amp;nbsp;at runtime.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Look forward to your suggestions/advice on this topic.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks !&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 08:53:54 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Data-validation-against-dynamic-schema/m-p/2297210#M69801</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T08:53:54Z</dc:date>
    </item>
    <item>
      <title>Re: Data validation against dynamic schema</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Data-validation-against-dynamic-schema/m-p/2297211#M69802</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Could you please elaborate your case with an example with input and expected output values?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Best regards&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Sabrina&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 08 Jan 2018 03:31:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Data-validation-against-dynamic-schema/m-p/2297211#M69802</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-01-08T03:31:51Z</dc:date>
    </item>
    <item>
      <title>Re: Data validation against dynamic schema</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Data-validation-against-dynamic-schema/m-p/2297212#M69803</link>
      <description>&lt;P&gt;Thanks for your reply. I need to validate a number of feed files against a schema definition file, which are both defined at runtime. For each set of feed files, there will be a corresponding schema definition file. For example, the schema definition file looks like below:&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-left" image-alt="schema-definition-file.PNG" style="width: 400px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009LsFo.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/156957iEF0BEECA35D60095/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009LsFo.png" alt="0683p000009LsFo.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;The&amp;nbsp;feed files to be validated&amp;nbsp;(can come in any of the following&amp;nbsp;extensions - .dat/.data/.csv/.xlsx) have no headers and&amp;nbsp;data in feed are something like:&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;111101020000001|1|&lt;BR /&gt;111201030000002|1|&lt;BR /&gt;111301030000003|1|&lt;BR /&gt;111401030000004|5678|&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;After validation, the expected output would be:&lt;/P&gt; 
&lt;P&gt;111401030000004|5678|&amp;nbsp;&amp;nbsp; ----&amp;nbsp;failed, length exceeded&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;What I want to achieve is something similar to what tComplianceCheck component does. I built a&amp;nbsp;test project using tComplianceCheck component as attached.&amp;nbsp;The difference&amp;nbsp;is that the schema is dynamic, which I am not sure how to do.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I have thought about another approach to this problem, which is reading feed files using tFileInputDelimited, get metadata (datatype, length) from Talend and compare this with schema definition.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Can you please let me know how this requirement could be achieved in Talend ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;BR /&gt;&lt;A href="https://community.qlik.com/legacyfs/online/tlnd_dw_files/0683p000009Lrh8"&gt;TalendJob.zip&lt;/A&gt;</description>
      <pubDate>Mon, 08 Jan 2018 15:38:30 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Data-validation-against-dynamic-schema/m-p/2297212#M69803</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-01-08T15:38:30Z</dc:date>
    </item>
    <item>
      <title>Re: Data validation against dynamic schema</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Data-validation-against-dynamic-schema/m-p/2297213#M69804</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;So far, tSchemaComplianceCheck component doesn't support for dynamic schema.&lt;/P&gt;
&lt;P&gt;Best regards&lt;/P&gt;
&lt;P&gt;Sabrina&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jan 2018 10:53:43 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Data-validation-against-dynamic-schema/m-p/2297213#M69804</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-01-09T10:53:43Z</dc:date>
    </item>
  </channel>
</rss>

