<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Problems to handle dirty data in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338066#M106342</link>
    <description>&lt;P&gt;Can you send us more information on this? We will need to see an example of the data that you are using (an example, not private data) and a couple of screenshots of your job, so that we can see what you are doing at the moment.&lt;/P&gt;</description>
    <pubDate>Fri, 09 Dec 2022 19:20:14 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2022-12-09T19:20:14Z</dc:date>
    <item>
      <title>Problems to handle dirty data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338065#M106341</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;i'm trying to handle a dirty data that i have in a csv file. I have an integer column but some values are "n.a."&lt;/P&gt;&lt;P&gt;i've tried to handle this error in a tmap but i couldn't do that.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 15 Nov 2024 22:16:58 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338065#M106341</guid>
      <dc:creator>lcupito</dc:creator>
      <dc:date>2024-11-15T22:16:58Z</dc:date>
    </item>
    <item>
      <title>Re: Problems to handle dirty data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338066#M106342</link>
      <description>&lt;P&gt;Can you send us more information on this? We will need to see an example of the data that you are using (an example, not private data) and a couple of screenshots of your job, so that we can see what you are doing at the moment.&lt;/P&gt;</description>
      <pubDate>Fri, 09 Dec 2022 19:20:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338066#M106342</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-12-09T19:20:14Z</dc:date>
    </item>
    <item>
      <title>Re: Problems to handle dirty data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338067#M106343</link>
      <description>&lt;P&gt;Yes, this is sometimes a mess. In your case I suggest to use as column type of the csv source column String as type.&lt;/P&gt;&lt;P&gt;You can use the component tConvertType and on its output set the actual integer columns as Integer typed.&lt;/P&gt;&lt;P&gt;Take care the tConvertType is not set to Die on Error and you can also lead the mistaken values to a reject flow.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Another solution is using a routine. &lt;/P&gt;&lt;P&gt;I have created a lot of them and you can use the source code as you like. E.g. in your use case you can use the NumberUtil and here the method getFailSaveInt(...) in the tMap to convert the String fail-save into a Number.&lt;/P&gt;&lt;P&gt;Take a look in this project and here in the source folder src/routines: https://github.com/jlolling/talend_routines&lt;/P&gt;&lt;P&gt;To install a routine create a new routine with the same name you see here and after successfully create (with a dummy method) replace the whole content with the content of the source code in this project.&lt;/P&gt;</description>
      <pubDate>Sun, 11 Dec 2022 22:23:28 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338067#M106343</guid>
      <dc:creator>jlolling</dc:creator>
      <dc:date>2022-12-11T22:23:28Z</dc:date>
    </item>
    <item>
      <title>Re: Problems to handle dirty data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338068#M106344</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;this is the job&lt;span class="lia-inline-image-display-wrapper" image-alt="0695b00000aFPD9AAO.jpg"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/137433i53A781AE83CFB228/image-size/large?v=v2&amp;amp;px=999" role="button" title="0695b00000aFPD9AAO.jpg" alt="0695b00000aFPD9AAO.jpg" /&gt;&lt;/span&gt;the tmap structure&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0695b00000aFPDJAA4.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/146869iFCE9E3AA57D58F3F/image-size/large?v=v2&amp;amp;px=999" role="button" title="0695b00000aFPDJAA4.png" alt="0695b00000aFPDJAA4.png" /&gt;&lt;/span&gt;dirty data i have &lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0695b00000aFPDOAA4.jpg"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/156000i6AEC1772421C11AE/image-size/large?v=v2&amp;amp;px=999" role="button" title="0695b00000aFPDOAA4.jpg" alt="0695b00000aFPDOAA4.jpg" /&gt;&lt;/span&gt;the errors&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0695b00000aFPRBAA4.jpg"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/140140iC92B20DA0245AA10/image-size/large?v=v2&amp;amp;px=999" role="button" title="0695b00000aFPRBAA4.jpg" alt="0695b00000aFPRBAA4.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 12 Dec 2022 09:10:30 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338068#M106344</guid>
      <dc:creator>lcupito</dc:creator>
      <dc:date>2022-12-12T09:10:30Z</dc:date>
    </item>
    <item>
      <title>Re: Problems to handle dirty data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338069#M106345</link>
      <description>&lt;P&gt;Ok, I would put right after the tFileInputDelimited a tConvertType component and setup the problematic column as String typed and the outgoing column of the tConvertType as integer column.&lt;/P&gt;</description>
      <pubDate>Mon, 12 Dec 2022 10:04:04 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338069#M106345</guid>
      <dc:creator>jlolling</dc:creator>
      <dc:date>2022-12-12T10:04:04Z</dc:date>
    </item>
    <item>
      <title>Re: Problems to handle dirty data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338070#M106346</link>
      <description>&lt;P&gt;Or you install the routines I have described and use them in the expression of the output.&lt;/P&gt;</description>
      <pubDate>Mon, 12 Dec 2022 10:09:16 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338070#M106346</guid>
      <dc:creator>jlolling</dc:creator>
      <dc:date>2022-12-12T10:09:16Z</dc:date>
    </item>
    <item>
      <title>Re: Problems to handle dirty data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338071#M106347</link>
      <description>&lt;P&gt;@Luca Cupitò​&amp;nbsp;it looks like @Jan Lolling​&amp;nbsp;got here first. I'd do exactly the same as he suggested. He deserves a best answer for this &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 12 Dec 2022 13:08:46 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338071#M106347</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-12-12T13:08:46Z</dc:date>
    </item>
    <item>
      <title>Re: Problems to handle dirty data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338072#M106348</link>
      <description>&lt;P&gt;Maybe you can use tReplace between tFileInputDelimited and tMap and replace “n/a” value by a valid value.  Sample as 0.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;the integer field must be defined as a string in the tFileInputDelimited, for uses it  into the tReplace component, and convert it after to integer when do you need it.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;best regards&lt;/P&gt;</description>
      <pubDate>Wed, 14 Dec 2022 22:43:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problems-to-handle-dirty-data/m-p/2338072#M106348</guid>
      <dc:creator>RJLC</dc:creator>
      <dc:date>2022-12-14T22:43:38Z</dc:date>
    </item>
  </channel>
</rss>

