<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Validate Schema of Entire File - Not Just Row by Row in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Validate-Schema-of-Entire-File-Not-Just-Row-by-Row/m-p/2342059#M109962</link>
    <description>&lt;P&gt;Thank you for your response.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We did look into using the components you mentioned.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, our ultimate solution was using the &lt;B&gt;tFileInputRegex &lt;/B&gt;component and then splitting off using the &lt;B&gt;main &lt;/B&gt;connector and the &lt;B&gt;reject &lt;/B&gt;connector and incrementing a global variable counter when a bad match is found.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We branch off from that subjob with an &lt;B&gt;onSubjobOk &lt;/B&gt;and the check the global variable and proceed with an &lt;B&gt;if &lt;/B&gt;connector.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I don't know if that is the most efficient workflow, but it does seem to be working correctly and reliably.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
    <pubDate>Fri, 13 Jan 2023 15:03:51 GMT</pubDate>
    <dc:creator>ml1662663516</dc:creator>
    <dc:date>2023-01-13T15:03:51Z</dc:date>
    <item>
      <title>Validate Schema of Entire File - Not Just Row by Row</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Validate-Schema-of-Entire-File-Not-Just-Row-by-Row/m-p/2342057#M109960</link>
      <description>&lt;P&gt;We are reading in a file using &lt;B&gt;tFileInputDelimited &lt;/B&gt;and are validating schema column formats using &lt;B&gt;tSchemaComplianceCheck&lt;/B&gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We see we can use &lt;B&gt;main &lt;/B&gt;and &lt;B&gt;reject &lt;/B&gt;to check each row of the file.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, we do not want to processes only the good rows.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We would like to set the job up to only process the entire file if there are no bad rows. Kind of an all-or-nothing type scenario.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Would anybody have any ideas on the components and flow we could use to perform such a scenario?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you!&lt;/P&gt;</description>
      <pubDate>Fri, 15 Nov 2024 22:12:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Validate-Schema-of-Entire-File-Not-Just-Row-by-Row/m-p/2342057#M109960</guid>
      <dc:creator>ml1662663516</dc:creator>
      <dc:date>2024-11-15T22:12:14Z</dc:date>
    </item>
    <item>
      <title>Re: Validate Schema of Entire File - Not Just Row by Row</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Validate-Schema-of-Entire-File-Not-Just-Row-by-Row/m-p/2342058#M109961</link>
      <description>&lt;P&gt;There is no component (that I am aware of....new components arrive all the time) to check the whole file. But what you could do here is carry on checking line by line and store the output in memory. If you have any rejected rows, you will not proceed and work on the file. If you do not get any rejected rows, then you would process the data you have kept in memory. &lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;To do this, you could use the tHSQLDB components (https://help.talend.com/r/en-US/7.3/hsqldb/hsqldb) or the Hash components (tHashInput - https://help.talend.com/r/en-US/8.0/technical/thashinput and tHashOutput - https://help.talend.com/r/en-US/8.0/technical/thashoutput).&lt;/P&gt;</description>
      <pubDate>Thu, 12 Jan 2023 16:43:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Validate-Schema-of-Entire-File-Not-Just-Row-by-Row/m-p/2342058#M109961</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-01-12T16:43:50Z</dc:date>
    </item>
    <item>
      <title>Re: Validate Schema of Entire File - Not Just Row by Row</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Validate-Schema-of-Entire-File-Not-Just-Row-by-Row/m-p/2342059#M109962</link>
      <description>&lt;P&gt;Thank you for your response.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We did look into using the components you mentioned.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, our ultimate solution was using the &lt;B&gt;tFileInputRegex &lt;/B&gt;component and then splitting off using the &lt;B&gt;main &lt;/B&gt;connector and the &lt;B&gt;reject &lt;/B&gt;connector and incrementing a global variable counter when a bad match is found.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We branch off from that subjob with an &lt;B&gt;onSubjobOk &lt;/B&gt;and the check the global variable and proceed with an &lt;B&gt;if &lt;/B&gt;connector.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I don't know if that is the most efficient workflow, but it does seem to be working correctly and reliably.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Fri, 13 Jan 2023 15:03:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Validate-Schema-of-Entire-File-Not-Just-Row-by-Row/m-p/2342059#M109962</guid>
      <dc:creator>ml1662663516</dc:creator>
      <dc:date>2023-01-13T15:03:51Z</dc:date>
    </item>
    <item>
      <title>Re: Validate Schema of Entire File - Not Just Row by Row</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Validate-Schema-of-Entire-File-Not-Just-Row-by-Row/m-p/2342060#M109963</link>
      <description>&lt;P&gt;So long as it suits your requirements &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 13 Jan 2023 16:47:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Validate-Schema-of-Entire-File-Not-Just-Row-by-Row/m-p/2342060#M109963</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-01-13T16:47:07Z</dc:date>
    </item>
  </channel>
</rss>

