Skip to main content
Announcements
Qlik Connect 2025: 3 days of full immersion in data, analytics, and AI. May 13-15 | Orlando, FL: Learn More

tFileInputXML Component does not extract element values within a Spark Job

No ratings
cancel
Showing results for 
Search instead for 
Did you mean: 
TalendSolutionExpert
Contributor II
Contributor II

tFileInputXML Component does not extract element values within a Spark Job

Last Update:

Feb 9, 2024 1:22:49 PM

Updated By:

Jamie_Gregory

Created date:

Apr 1, 2021 6:15:32 AM

Talend Version (Required)      6.3.1

Summary

 
Additional Versions 
Product (Required)Big Data
Component (Required)Studio Spark
Problem Description

A Talend 6.3.1 Spark Job contains a tFileInputXML component to extract XML element values (for instance, here, ID) within element (Incident) that has an Attribute (Active) from an XML document:

 

<Incident Active="true">
<ID>Incident2017</ID>
<AssignmentGroup>FoundationTeam</AssignmentGroup>
<CommentsCount>0</CommentsCount>
<CompanyName>My Company</CompanyName>
..
</Incident>

 

The expected behavior is that tFileInputXML component extracts the Incident2017 value for the ID element. The problem is that the element values extracted by the tFileInputXML component are null values when executing a Spark Job.

 

When you remove the Active attribute of the Incident element, then the element values (here, Incident2017, FoundationTeam, and My Company) can be extracted correctly with the tFileInputXML component in a Spark job.

 

This issue does not occur when executing tFileInputXML component within a Standard Job.

Problem root cause 
Solution or WorkaroundThis issue is fixed with Talend 6.4.1 and 6.3.2 versions. For Talend 6.2.1, the issue is solved by applying patch Patch_20170522_TPS-1949_v1-6.3.1.zip.
JIRA ticket numberTBD-4903
Version history
Last update:
‎2024-02-09 01:22 PM
Updated by: