MongoInput (Json) to Hive (text) using tExtractJSON giving an error.
Hi Experts,
I'm trying to load HIVE tables (text - Flat) format from MongoDB (JSON format). I have used tExtractJSONFields component, but getting an error "
Error on line 1 of document : The content of elements must consist of well-formed character data or markup. Nested exception: The content of elements must consist of well-formed character data or markup.". Searched talendforge forum for help, but didn't help much.
Here is my mapping
tMongoInput --> tExtractJsonField ---> tlogRow --->tHDFSOutput ---> tHIVECreateTable ---> tHIVELoad
Any help is much appreciated.
Regards
cmadaka
Hi, Did you check your json source file to see if it is well-formed (Json Validator JSONLint)? How did you set your tExtractJsonField component? Could you please also paste your json source file here(a sample content is OK)? Best regards Sabrina
Thanks Sabrina. tExtractJsonField was set-up using
Read by : XPath and
JSON Field : triggerFired,
Loop XPath Query : ""
Here is the sample JSON record. If i remove the FiringDate field in the subdocument, job seems to run fine. other wise i get an error. Don't have any issue with insertDate and createtimestamp date field.
{
"_id" : ObjectId("22349j3242394-234"),
"insertDate" : ISODate("2014-06-30T04:00:00.000Z"),
"Identifier" : "Raw Data",
"status" : "Accepted",
"userId" : "93837574849",
"createdTimestamp" : ISODate("2015-07-10T22:01:44.000Z"),
"triggerFired" :
}