Skip to main content
Announcements
NEW: Seamless Public Data Sharing with Qlik's New Anonymous Access Capability: TELL ME MORE!
cancel
Showing results for 
Search instead for 
Did you mean: 
Not applicable

growing data size with RAND()

Hi.

I was trying to reload Data from my qvw-file(150Mb) with Binary for making a new Demo.qvw.

I was decided to recalculate Data values with *Rand(), but after reloading my new Demo.qvw have +130Mb size.

Numbers of rows(24 000 000) and tables the same. Structure the same. Only fields value differ.

I tested on the other files, results the same.

Why?

BINARY [Data.qvw];

table_1_demo:

NoConcatenate

LOAD

Country,

Dept,

Value*Rand() as Value

Resident table_1;

DROP table table_1;

1 Solution

Accepted Solutions
Gysbert_Wassenaar

Rand() returns random values. Sometimes a value will occur more often than other times. That means the number of unique values will be different every time you reload. The number of unique values in a field determines how well that field can be compressed. A good randomizer will create nearly only unique values, so compression will be very low. See this blog post: Symbol Tables and Bit-Stuffed Pointers

Also, chart objects and background images are not loaded with a binary load. That may explain more of the size difference.


talk is cheap, supply exceeds demand

View solution in original post

2 Replies
Gysbert_Wassenaar

Rand() returns random values. Sometimes a value will occur more often than other times. That means the number of unique values will be different every time you reload. The number of unique values in a field determines how well that field can be compressed. A good randomizer will create nearly only unique values, so compression will be very low. See this blog post: Symbol Tables and Bit-Stuffed Pointers

Also, chart objects and background images are not loaded with a binary load. That may explain more of the size difference.


talk is cheap, supply exceeds demand
Not applicable
Author

Thanks.