<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Split out transactions in QlikView</title>
    <link>https://community.qlik.com/t5/QlikView/Split-out-transactions/m-p/687648#M670556</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;BR /&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have now developed a code to cleanse an Excel file that we get from a customer where there are often several item_numbers in one line (there is supposed to be only one per line of course).&lt;/P&gt;&lt;P&gt;The issue is, I have yet run it on only one list - of five. They are all the same format, but in the second one, the records look even more chaotic.&lt;/P&gt;&lt;P&gt;I will attach a sample - in short, the customer has packed the complete description into the item_nr_field and I cannot think of any way to identify which of these X lines (in one line) is/are relevant - in the sample, only the 1st is relevant, all the others are like a description.&lt;/P&gt;&lt;P&gt;I can't see that I have anything to go on here.&lt;/P&gt;&lt;P&gt;I have already thought of just outputting those as "post_processing" so that someone can have a look at them and process them manually - but even for that, I would need to find a way to identify that these are records my code cannot handle.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks for any ideas - my own are spent for the time being.&lt;/P&gt;&lt;P&gt;Best regards,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;DataNibbler&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 09 Oct 2014 09:26:24 GMT</pubDate>
    <dc:creator>datanibbler</dc:creator>
    <dc:date>2014-10-09T09:26:24Z</dc:date>
    <item>
      <title>Split out transactions</title>
      <link>https://community.qlik.com/t5/QlikView/Split-out-transactions/m-p/687648#M670556</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;BR /&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have now developed a code to cleanse an Excel file that we get from a customer where there are often several item_numbers in one line (there is supposed to be only one per line of course).&lt;/P&gt;&lt;P&gt;The issue is, I have yet run it on only one list - of five. They are all the same format, but in the second one, the records look even more chaotic.&lt;/P&gt;&lt;P&gt;I will attach a sample - in short, the customer has packed the complete description into the item_nr_field and I cannot think of any way to identify which of these X lines (in one line) is/are relevant - in the sample, only the 1st is relevant, all the others are like a description.&lt;/P&gt;&lt;P&gt;I can't see that I have anything to go on here.&lt;/P&gt;&lt;P&gt;I have already thought of just outputting those as "post_processing" so that someone can have a look at them and process them manually - but even for that, I would need to find a way to identify that these are records my code cannot handle.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks for any ideas - my own are spent for the time being.&lt;/P&gt;&lt;P&gt;Best regards,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;DataNibbler&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 09 Oct 2014 09:26:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Split-out-transactions/m-p/687648#M670556</guid>
      <dc:creator>datanibbler</dc:creator>
      <dc:date>2014-10-09T09:26:24Z</dc:date>
    </item>
    <item>
      <title>Re: Split out transactions</title>
      <link>https://community.qlik.com/t5/QlikView/Split-out-transactions/m-p/687649#M670558</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi DataNibbler,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;generally you need an approach like &lt;A href="https://community.qlik.com/thread/136886"&gt;Separating records&lt;/A&gt; but in this case you split firstly the records per subfield without any cleansing and checked in a second step the content for "LFS-Nr." and/or further chars to identity if this record is garbage or not and in a third step could be the cleasing from the valid record follow.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;- Marcus&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 09 Oct 2014 09:47:48 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Split-out-transactions/m-p/687649#M670558</guid>
      <dc:creator>marcus_sommer</dc:creator>
      <dc:date>2014-10-09T09:47:48Z</dc:date>
    </item>
    <item>
      <title>Re: Split out transactions</title>
      <link>https://community.qlik.com/t5/QlikView/Split-out-transactions/m-p/687650#M670559</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;BR /&gt;Hi!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Well, for now I can make it - I go by the nr. of letters. The maximum that can usually be expected to be there in a "clean" record is 6 ("Stueck" - pcs.) - so whenever there's more than 6, I say that record is garbage and needs post_processing.&lt;/P&gt;&lt;P&gt;However - you know the one about the race between developers and nature ... There's no limit to the creativity of users ...&lt;/P&gt;&lt;P&gt;Best regards,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;DataNibbler&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 09 Oct 2014 11:37:28 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Split-out-transactions/m-p/687650#M670559</guid>
      <dc:creator>datanibbler</dc:creator>
      <dc:date>2014-10-09T11:37:28Z</dc:date>
    </item>
  </channel>
</rss>

