A Talend 6.3.1 Spark Job contains a tFileInputXML component to extract XML element values (for instance, here, ID) within element (Incident) that has an Attribute (Active) from an XML document: <Incident Active="true">
<ID>Incident2017</ID>
<AssignmentGroup>FoundationTeam</AssignmentGroup>
<CommentsCount>0</CommentsCount>
<CompanyName>My Company</CompanyName>
..
</Incident> The expected behavior is that tFileInputXML component extracts the Incident2017 value for the ID element. The problem is that the element values extracted by the tFileInputXML component are null values when executing a Spark Job. When you remove the Active attribute of the Incident element, then the element values (here, Incident2017, FoundationTeam, and My Company) can be extracted correctly with the tFileInputXML component in a Spark job. This issue does not occur when executing tFileInputXML component within a Standard Job. |