Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik and ServiceNow Partner to Bring Trusted Enterprise Context into AI-Powered Workflows. Learn More!
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Special characters

Hi,

 

I have a Delimited file with an UTF-8 encodage, but I have few characters as "é", "à"... wich are a black diamond with the character ? inside.

And a can't choose this case as condition in a tFilterRow.

 

Do you have an idea to resolve this issue ? Can you help me, please?

 

Best regards

Labels (2)
1 Solution

Accepted Solutions
TRF
Champion II
Champion II

No, you can just forget the preview (it's just a preview!) and set the Encoding as ISO-8859-15 in tFileInputDelimited (Advanced settings).

Should work.

View solution in original post

9 Replies
Anonymous
Not applicable
Author

Are you needing to get rid of the characters, or select based on them? I think Talend has a function to strip non-standard characters from text strings. To select on them, try using \uxxx and the character's Unicode value.

David
vboppudi
Partner - Creator III
Partner - Creator III

Hi,

 

Please provide some sample source data and expected output.

 

Regards,

 

Anonymous
Not applicable
Author

I don't have time to do that, but if you create a tMap component and look at the functions available under String Handling, you'll see what I'm referring to.

David
TRF
Champion II
Champion II

Hi,
Are you sure the source file is UTF-8 encoded?
When do see the "black diamond with the character ? inside"?

Can you share an extract of the input file?

Anonymous
Not applicable
Author

Hello,

 

This is my job :

tFileList_1 :

directory : "D:/SPC/FichienSensibilite"

files : "*.dat"

tFileInputDelimited_1 :

flow's name :

((Strint)globalMap.get("tFileList_1_CURRENT_FILEPATH"))

tFilterRow_1 :

 

condition

Freq    Vide    Vaut   "Fréquence=1000Hz"

Run IF :

condition :

((Integer)globalMap.get("tFilterRow_1_NB_LINE_OK")) > 0

tFileCopy_1 :

 

file's name :

((String)globalMap.get("tFile_1_CURRENT_FILEPATH"))

directory : "D:/SPC/test"

 

 

For my tFileInputDelimited_1 component, I use a metadata delimited file wich is form :

Colonne0

Date=20151202

Fréquence=1000Hz

Temps=50µs

 

Information

 

Colonne0 is a String type.

I want to search "Fréquence=1000Hz" and "Temps=50µs" in the tFilterRow_1 component, but "é" and "µ" are change by black diamonds with the character ? inside in my metadata preview.

My encode is UTF-8.

 

Best regards

BastienM

 

PS : My job is in French

TRF
Champion II
Champion II

Once again, are you sure of the file encoding?

Can you open the file using Notepad++ then, menu Encode and you should have something like that:

0683p000009LvBY.png

I just made the test with a CSV file with this encoding, the preview is OK.

If I change the file encoding to ANSI (using Notepad++) and save the file, preview is not OK as you can see here:

0683p000009Lw8O.png

Hope this helps.

Anonymous
Not applicable
Author

Hi TRF,

 

It's good for my metatdata, it was encode in ANSI so I changed its. Now I'm a new issue, it's that all input files are encode in ANSI, so what exit a solution to change all files in UTF-8 encode with an automatic process?

 

Best regards

BastienM

TRF
Champion II
Champion II

No, you can just forget the preview (it's just a preview!) and set the Encoding as ISO-8859-15 in tFileInputDelimited (Advanced settings).

Should work.

Anonymous
Not applicable
Author

Hi,

 

It's a success, thank you.

 

Best regards

BastienM