Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik Connect 2026! Turn data into bold moves, April 13 -15: Learn More!
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

invalid XML character (unicode: 0xB) while reading file

Hello!

I have a job which is initiated by wehook and which loads xml source file into sql table when user uploads the xml file. The job looks like 0683p000009M613.jpg

now in one of the file I get this error while running this job.

0683p000009M618.jpg

how can i have them replaced? by using tReplace?

 

Regards

PV

Labels (3)
1 Solution

Accepted Solutions
Anonymous
Not applicable
Author

Hi @priyadarshiniv 

 

In your job, before the processing with tFileInputXML you can cleanse the file by reading the XML with tFileInputFullRow -> tReplace-> tFileOutputDelimited (Cleansed File) [Cleansed Job] -> OnSubjobOk -> your main job of processing where the tFileInputXML will point to the cleansed file created in the cleansed job.

 

Thanks and Regards,

Subhadip

View solution in original post

13 Replies
Anonymous
Not applicable
Author

Hi Priya,

 

     In short, if my understanding is right, you are having an issue due to unwanted character set in your input file.

 

     What is the allowed character set in your input file.  From your screen shot, I could see that its one of the European languages (apologies for my ignorance to recognize the language). So if you could share a problematic sample data and the expected output, we can try to do some experiments around it.

 

Warm Regards,
Nikhil Thampi

Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved 🙂

Anonymous
Not applicable
Author

Hello @nthampi 

In the screenshot you can see the character . But error I get is this:

org.xml.sax.SAXParseException; lineNumber: 1633168; An invalid XML character (Unicode: 0xc) was found in the element content of the document.
 
Regards
Priya
Anonymous
Not applicable
Author

Here is what I am getting in my file : ff form feed ctrl-l

 

\x{C}ff form feed ctrl-l014120xC
Anonymous
Not applicable
Author

Hi,

 

In tReplace Advanced mode you can replace using regex pattern.

 

Please check the attachment.0683p000009M62L.jpg

 

Thanks and Regards,

Subhadip

Anonymous
Not applicable
Author

@subhadip13 Thank you for your reply. Could you post a pic of whole job? I want to know where to place this treplace. Because my problem is the XML file is really very big. 

Anonymous
Not applicable
Author

Hi @priyadarshiniv 

 

Please read the file line by line as a full string using tFileInputFullRow -> tReplace -> tFileOutputDelimited for the cleansed file.

 

This flow will not use lot of memory as it will be line by line operation.

 

Thanks and regards,

Subhadip

Anonymous
Not applicable
Author

Hello @subhadip13 

Thank you again for your reply. But my issue is the file is uploaded via an application on a sharepoint location and this XML job is triggered via webhook. So when users uploads the file then it triggers this job. In this case how will this tFileOutputDelimited can be used? where will i store?

 

Regards

Priya

Anonymous
Not applicable
Author

Hi @priyadarshiniv 

 

In your job, before the processing with tFileInputXML you can cleanse the file by reading the XML with tFileInputFullRow -> tReplace-> tFileOutputDelimited (Cleansed File) [Cleansed Job] -> OnSubjobOk -> your main job of processing where the tFileInputXML will point to the cleansed file created in the cleansed job.

 

Thanks and Regards,

Subhadip

Anonymous
Not applicable
Author

yes. i got the flow, but in tFileOutputDelimited, dont i need to specify the path for the output file? but since I am loading the xml from sharepoint, i dont know where this tFileOutputDelimited output to store?

 

Regards

Priya