
Contributor
2015-07-13
10:47 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
MongoInput (Json) to Hive (text) using tExtractJSON giving an error.
Hi Experts,
I'm trying to load HIVE tables (text - Flat) format from MongoDB (JSON format). I have used tExtractJSONFields component, but getting an error " Error on line 1 of document : The content of elements must consist of well-formed character data or markup. Nested exception: The content of elements must consist of well-formed character data or markup.". Searched talendforge forum for help, but didn't help much.
Here is my mapping
tMongoInput --> tExtractJsonField ---> tlogRow --->tHDFSOutput ---> tHIVECreateTable ---> tHIVELoad
Any help is much appreciated.
Regards
cmadaka
I'm trying to load HIVE tables (text - Flat) format from MongoDB (JSON format). I have used tExtractJSONFields component, but getting an error " Error on line 1 of document : The content of elements must consist of well-formed character data or markup. Nested exception: The content of elements must consist of well-formed character data or markup.". Searched talendforge forum for help, but didn't help much.
Here is my mapping
tMongoInput --> tExtractJsonField ---> tlogRow --->tHDFSOutput ---> tHIVECreateTable ---> tHIVELoad
Any help is much appreciated.
Regards
cmadaka
215 Views
2 Replies

Anonymous
Not applicable
2015-07-16
07:14 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
Did you check your json source file to see if it is well-formed (Json Validator JSONLint)?
How did you set your tExtractJsonField component? Could you please also paste your json source file here(a sample content is OK)?
Best regards
Sabrina
Did you check your json source file to see if it is well-formed (Json Validator JSONLint)?
How did you set your tExtractJsonField component? Could you please also paste your json source file here(a sample content is OK)?
Best regards
Sabrina
215 Views

Contributor
2015-07-16
10:21 AM
Author
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thanks Sabrina. tExtractJsonField was set-up using
Read by : XPath and
JSON Field : triggerFired,
Loop XPath Query : ""
Here is the sample JSON record. If i remove the FiringDate field in the subdocument, job seems to run fine. other wise i get an error. Don't have any issue with insertDate and createtimestamp date field.
{
"_id" : ObjectId("22349j3242394-234"),
"insertDate" : ISODate("2014-06-30T04:00:00.000Z"),
"Identifier" : "Raw Data",
"status" : "Accepted",
"userId" : "93837574849",
"createdTimestamp" : ISODate("2015-07-10T22:01:44.000Z"),
"triggerFired" :
}
Read by : XPath and
JSON Field : triggerFired,
Loop XPath Query : ""
Here is the sample JSON record. If i remove the FiringDate field in the subdocument, job seems to run fine. other wise i get an error. Don't have any issue with insertDate and createtimestamp date field.
{
"_id" : ObjectId("22349j3242394-234"),
"insertDate" : ISODate("2014-06-30T04:00:00.000Z"),
"Identifier" : "Raw Data",
"status" : "Accepted",
"userId" : "93837574849",
"createdTimestamp" : ISODate("2015-07-10T22:01:44.000Z"),
"triggerFired" :
}
215 Views
