Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Join us to spark ideas for how to put the latest capabilities into action. Register here!
cancel
Showing results for 
Search instead for 
Did you mean: 
nicolas_martin
Partner - Creator II
Partner - Creator II

Problem with UTF-8 "fixed record" files

Hello,

I have data in a text file, encoded in UTF-8 (w/o BOM), with fixed records.

When the file contains a special character, this one is considered as 2 characters, and all the following data is parsed wrong (with a shift).

This file:

BRAND           MODEL              DATE    VALUE  

Audi            A3                 20140101abcdefgh

Audi            A4                 20140202abcdefgh

Audi            Coupé              20140303abcdefgh

loaded with QlikView:

Data:

LOAD @1:16 AS BRAND,

     @17:35 AS MODEL,

     @36:43 AS DATE,

     @44:n AS VALUE

FROM

test.csv

(fix, utf8,header is 1 lines);

will give me a wrong DATE for the last record: "2014030" instead of "20140303", because the "é" of "Coupé" will count as 2 characters.

And it's VALUE will be "3abcdefgh" (with a "3" that should not be there).

If I convert the same file in ANSI, I don't have the problem.

(please, don't answer me "so, convert the file in ANSI" )

0 Replies