Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 
Not applicable

Qlik sense scripting: use of codepage & delimiter

Dears,

Very silly questions, but hey, still an apprentice 🙂

In scripting, for loads from excel files, I'm used to the below to identify my source data

FROM

                  [$(vL.PipelineFile)]

               (ooxml, embedded labels, header is 9 lines, table is [Incoming Pipeline])

               where not isnull(CRM );

ow I see for a part of our script:

FROM

            [$(vL.ExternalFundFile)]

            (txt, codepage is 1252, embedded labels, delimiter is ';', msq)

As I don't understand what this does, I can't figger out how to simply add a column from the source file to my load in Qlik.

Can somebody explain what the above means? Thanks a lot

4 Replies
beck_bakytbek
Master
Master

Hi Camille,

check this:Delimiter ‒ QlikView

i hope that helps

Beck

Not applicable
Author

Hi, thanks but no, it's not helping.

What is codepage? What does it do?

calexandru
Partner - Contributor III
Partner - Contributor III

Hi there Camille,

Provided that you find Beck's clarification in regards to delimiters satisfactory let's then try and tackle the codepage notion. You might think of them similar to character sets.


Strictly speaking a code page is a table of characters and corresponding numerical representation. This provides the means of mapping the characters to their binary values. Character sets on the other hand are just a collection of symbols without a method of encoding applied.

In simple terms the codepage bit tells Qlik which 'map' to use in order to properly interpret the characters contained in your file.

Please see below an eloquent example:

Unicode is best described at www.unicode.org. The Unicode Standard says "The Unicode Standard is the universal character encoding scheme for written characters and text.  It defines a consistent way of encoding multilingual text that enables the exchange of text data internationally..."  Basically its kind of a enormous character set that encompasses all of the other characters.  It’s encoded in UTF-8, UTF-16 or UTF-32.  All 3 UTF encodings are representations of the same set of characters.  Windows and .Net (and many other systems) use Unicode natively, and it’s the natural preferred encoding for .Net or Windows applications.

source and further reading here.

I also found this image to be helpful:

bknrf005.gif

Hope this helps!

Cheers,

Alex

Not applicable
Author

thanks Alexandru!