<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Loading a lot of data incremental with deletion (1 bill rows +) in App Development</title>
    <link>https://community.qlik.com/t5/App-Development/Loading-a-lot-of-data-incremental-with-deletion-1-bill-rows/m-p/1377677#M31211</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The fastest way to get data into qlik is the use of optimized qvd-loads. This meant you will need to store the data as qvd-files and only the new/deleted data/projects will be queried from the database and also stored as qvd. For this each project need a unique ID. I think the various links to incremental loads and optimized loadings exists() here will be quite useful for you:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.qlik.com/docs/DOC-9038"&gt;Advanced topics for creating a qlik datamodel&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;- Marcus&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Fri, 08 Sep 2017 12:09:48 GMT</pubDate>
    <dc:creator>marcus_sommer</dc:creator>
    <dc:date>2017-09-08T12:09:48Z</dc:date>
    <item>
      <title>Loading a lot of data incremental with deletion (1 bill rows +)</title>
      <link>https://community.qlik.com/t5/App-Development/Loading-a-lot-of-data-incremental-with-deletion-1-bill-rows/m-p/1377676#M31210</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I am currently working on an application for Qlik sense server, however I got some technical issues / architecture to figure out.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;We got an application today, where our customers can create a new project, set different parameters,&amp;nbsp; calculate the project and get loads of statistical data as output. The output consists of X datasets.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 13.3333px;"&gt;The sum of the output for each project can be between 2-10 million rows - extremely detailed data.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 13.3333px;"&gt;After a project is ran to completion in the application, it has to be in qlik sense within minutes. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The application got approx 1500 projects&amp;nbsp; today, it grows with 10-15 new projects each day.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Each customer can "recalculate" any project: The old data in qlik has to be replaced by the new &lt;BR /&gt;Each customer can delete any project at a given time: The project has to be deleted in Qlik sense &lt;BR /&gt;New project data must be in Qlik sense within minutes.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;I have created a service for extracting all the data from our application, the problem is how I load this data into qlik sense, taken into account that projects can get new data or be deleted.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have tried MongoDB and it works fine with the whole incremental process, including deletion of changed/deleted projects, but as the size of the collections grows the load time is very slow.&amp;nbsp; I have tried the Qlik connector and the &lt;SPAN style="font-size: 13.3333px;"&gt;official &lt;/SPAN&gt; connector. The official connector was a bit faster than the Qlik Mongo Connector. &lt;SPAN style="font-size: 13.3333px;"&gt;Load 200-300k rows / sec.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have also tried with flat files, which is fast, but I have no way to delete projects here. Load 4 mill rows / sec.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;What is the best strategy to implement? The data can get into billions rows quite easily, and the data can be changed/deleted anytime by the customer.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Would it be better to "split" the data into different apps for each customer? So each load would be less data?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 08 Sep 2017 09:30:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/App-Development/Loading-a-lot-of-data-incremental-with-deletion-1-bill-rows/m-p/1377676#M31210</guid>
      <dc:creator />
      <dc:date>2017-09-08T09:30:07Z</dc:date>
    </item>
    <item>
      <title>Re: Loading a lot of data incremental with deletion (1 bill rows +)</title>
      <link>https://community.qlik.com/t5/App-Development/Loading-a-lot-of-data-incremental-with-deletion-1-bill-rows/m-p/1377677#M31211</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The fastest way to get data into qlik is the use of optimized qvd-loads. This meant you will need to store the data as qvd-files and only the new/deleted data/projects will be queried from the database and also stored as qvd. For this each project need a unique ID. I think the various links to incremental loads and optimized loadings exists() here will be quite useful for you:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.qlik.com/docs/DOC-9038"&gt;Advanced topics for creating a qlik datamodel&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;- Marcus&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 08 Sep 2017 12:09:48 GMT</pubDate>
      <guid>https://community.qlik.com/t5/App-Development/Loading-a-lot-of-data-incremental-with-deletion-1-bill-rows/m-p/1377677#M31211</guid>
      <dc:creator>marcus_sommer</dc:creator>
      <dc:date>2017-09-08T12:09:48Z</dc:date>
    </item>
  </channel>
</rss>

