Skip to main content
Announcements
Global Transformation Awards! Applications are now open. Submit Entry
cancel
Showing results for 
Search instead for 
Did you mean: 
jdlo_25
Contributor
Contributor

[resolved] Parsing String (removing unicode 0xb)

Hi
Can anyone help me with this ? I'm trying to remove a unicode character (0xb) in a string(JSON) and input it in tExtractJSONfield. Is there any component that can help me do this. Thanks 
Regards,
John 
 
Labels (3)
3 Replies
Anonymous
Not applicable

Hi,
Please take a look at tReplace component, this component is used to cleanse all files before further processing.
What does your input json file with a unicode character (0xb) look like?
Best regards
Sabrina
jdlo_25
Contributor
Contributor
Author

Hello Sabrina,
Good Day !
Thank you for your suggestion. Will update you if tReplace can do the cleansing, I'm looking at doing the advance mode using the regex pattern . Hoping it will work. Thanks a lot 0683p000009MACn.png
Sorry there is an error uploading the file
jdlo_25
Contributor
Contributor
Author

Hi Sabrina, 
Good Day 
As you advised I use the tReplace to parse the json string and used the advanced setting to replace the unicode character oxb. I used this regex pattern "\\u000B" to replace the unicode  and I also tried this pattern "\u000B" but it seems that it doesn't work because I get an error on the tExtractJSONField:
Error on line 4 of document  : An invalid XML character (Unicode: 0xb) was found in the element content of the document. Nested exception: An invalid XML character (Unicode: 0xb) was found in the element content of the document. - Line: 0
Am I using the wrong regex pattern for the unicode (0xb) ?
Best Regards 
John