<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Issues with tExtractJSONFields Component xPath v jsonPath in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Issues-with-tExtractJSONFields-Component-xPath-v-jsonPath/m-p/2321134#M91202</link>
    <description>Good Morning All, 
&lt;BR /&gt;I'm having some issues processing data from an API. 
&lt;BR /&gt;I make a call to a URL for example: 
&lt;BR /&gt; 
&lt;A href="https://webresint.zendesk.com/api/v2/incremental/tickets.json?include=metric_sets%2Ccomment_count&amp;amp;start_time=1437728629" target="_blank" rel="nofollow noopener noreferrer"&gt;https://webresint.zendesk.com/api/v2/incremental/tickets.json?include=metric_sets%2Ccomment_count&amp;amp;start_time=1437728629&lt;/A&gt; 
&lt;BR /&gt;and i get back a json object. 
&lt;BR /&gt;I make the call using a tRESTClient component, and parse the json using a tExtractJSONFields component. &amp;nbsp;If i stop here i do not get an error. 
&lt;BR /&gt;If i try to connect the tExtractJSONFields component to anything else (tMap or tLogRow for example) i receive the error: 
&lt;BR /&gt;Invalid white space character (0x1) in text to output (in xml 1.1, could output as a character entity) 
&lt;BR /&gt;I've tried all the debug run options and traces and log catchers, but this is the only message i ever see. 
&lt;BR /&gt;In case it is important, the tExtractJSONFields component is configured using XPath. &amp;nbsp;Also worth noting that when i call a different timeframe the job works perfectly to i know its a data issue. 
&lt;BR /&gt;The problem being that as this is an external company providing the data i cannot 'fix' the data, so rather i need to implement something within Talend to overcome the issue. 
&lt;BR /&gt;I have googled extensively and i canot find anything Talend specific for this. &amp;nbsp;The best i have seen is to use a regex function to remove the foreign characters, but this doesn't really seem to be a solution - rather a workaround. 
&lt;BR /&gt;I am using TOS 5.6. 
&lt;BR /&gt;Thanks in advance, 
&lt;BR /&gt;Scott 
&lt;BR /&gt;Job Screenshots: 
&lt;BR /&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MB6W.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/146922iB4BC102AD12D0971/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MB6W.png" alt="0683p000009MB6W.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MAmZ.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/156762i3A240BDDED3E563C/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MAmZ.png" alt="0683p000009MAmZ.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MB6b.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/150517i86242290DF8B0D94/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MB6b.png" alt="0683p000009MB6b.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MAta.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/137986i913D397F1D13DCF9/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MAta.png" alt="0683p000009MAta.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MB6g.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/132517iFA139BEA825C7E1B/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MB6g.png" alt="0683p000009MB6g.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MB6l.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/152858i0938D7C6DFA1E148/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MB6l.png" alt="0683p000009MB6l.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MB6v.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/153428i111901697550E885/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MB6v.png" alt="0683p000009MB6v.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MAuO.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/143416iA827887EEB6AFB93/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MAuO.png" alt="0683p000009MAuO.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MAuO.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/143416iA827887EEB6AFB93/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MAuO.png" alt="0683p000009MAuO.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MB75.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/155263iC10B3D6A362EB488/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MB75.png" alt="0683p000009MB75.png" /&gt;&lt;/span&gt;</description>
    <pubDate>Fri, 22 Jul 2016 09:53:22 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2016-07-22T09:53:22Z</dc:date>
    <item>
      <title>Issues with tExtractJSONFields Component xPath v jsonPath</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Issues-with-tExtractJSONFields-Component-xPath-v-jsonPath/m-p/2321134#M91202</link>
      <description>Good Morning All, 
&lt;BR /&gt;I'm having some issues processing data from an API. 
&lt;BR /&gt;I make a call to a URL for example: 
&lt;BR /&gt; 
&lt;A href="https://webresint.zendesk.com/api/v2/incremental/tickets.json?include=metric_sets%2Ccomment_count&amp;amp;start_time=1437728629" target="_blank" rel="nofollow noopener noreferrer"&gt;https://webresint.zendesk.com/api/v2/incremental/tickets.json?include=metric_sets%2Ccomment_count&amp;amp;start_time=1437728629&lt;/A&gt; 
&lt;BR /&gt;and i get back a json object. 
&lt;BR /&gt;I make the call using a tRESTClient component, and parse the json using a tExtractJSONFields component. &amp;nbsp;If i stop here i do not get an error. 
&lt;BR /&gt;If i try to connect the tExtractJSONFields component to anything else (tMap or tLogRow for example) i receive the error: 
&lt;BR /&gt;Invalid white space character (0x1) in text to output (in xml 1.1, could output as a character entity) 
&lt;BR /&gt;I've tried all the debug run options and traces and log catchers, but this is the only message i ever see. 
&lt;BR /&gt;In case it is important, the tExtractJSONFields component is configured using XPath. &amp;nbsp;Also worth noting that when i call a different timeframe the job works perfectly to i know its a data issue. 
&lt;BR /&gt;The problem being that as this is an external company providing the data i cannot 'fix' the data, so rather i need to implement something within Talend to overcome the issue. 
&lt;BR /&gt;I have googled extensively and i canot find anything Talend specific for this. &amp;nbsp;The best i have seen is to use a regex function to remove the foreign characters, but this doesn't really seem to be a solution - rather a workaround. 
&lt;BR /&gt;I am using TOS 5.6. 
&lt;BR /&gt;Thanks in advance, 
&lt;BR /&gt;Scott 
&lt;BR /&gt;Job Screenshots: 
&lt;BR /&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MB6W.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/146922iB4BC102AD12D0971/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MB6W.png" alt="0683p000009MB6W.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MAmZ.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/156762i3A240BDDED3E563C/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MAmZ.png" alt="0683p000009MAmZ.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MB6b.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/150517i86242290DF8B0D94/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MB6b.png" alt="0683p000009MB6b.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MAta.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/137986i913D397F1D13DCF9/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MAta.png" alt="0683p000009MAta.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MB6g.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/132517iFA139BEA825C7E1B/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MB6g.png" alt="0683p000009MB6g.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MB6l.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/152858i0938D7C6DFA1E148/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MB6l.png" alt="0683p000009MB6l.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MB6v.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/153428i111901697550E885/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MB6v.png" alt="0683p000009MB6v.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MAuO.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/143416iA827887EEB6AFB93/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MAuO.png" alt="0683p000009MAuO.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MAuO.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/143416iA827887EEB6AFB93/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MAuO.png" alt="0683p000009MAuO.png" /&gt;&lt;/span&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MB75.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/155263iC10B3D6A362EB488/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MB75.png" alt="0683p000009MB75.png" /&gt;&lt;/span&gt;</description>
      <pubDate>Fri, 22 Jul 2016 09:53:22 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Issues-with-tExtractJSONFields-Component-xPath-v-jsonPath/m-p/2321134#M91202</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-07-22T09:53:22Z</dc:date>
    </item>
    <item>
      <title>Re: Issues with tExtractJSONFields Component xPath v jsonPath</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Issues-with-tExtractJSONFields-Component-xPath-v-jsonPath/m-p/2321135#M91203</link>
      <description>Thought it worth mentioning: 
&lt;BR /&gt;I have tested processing the same json streams in other tools and loading them into the database without any issues. 
&lt;BR /&gt;Therefore i do not believe this to be a general data quality issue but rather a Talend Handling Issue. 
&lt;BR /&gt;My research suggests that the tExtractJSONFields component is parsing the data from the json and compiling it into an xml output using jaxen-1.1.1.jar. 
&lt;BR /&gt;I believe that this version of jaxen uses an older xml protocol and i should be using a newer version. 
&lt;BR /&gt;i have located and downloaded jaxen-1.1.4.jar and added this into Talend - but the Modules pane does not appear to have updated. 
&lt;BR /&gt;not quite sure where to go from here as this is clearly Talend related but its not apparent that there is a way to circumvent this. 
&lt;BR /&gt;I dont want to have to move away from Talend after investing significant time in getting it to this point in our organization.</description>
      <pubDate>Fri, 22 Jul 2016 16:04:04 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Issues-with-tExtractJSONFields-Component-xPath-v-jsonPath/m-p/2321135#M91203</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-07-22T16:04:04Z</dc:date>
    </item>
    <item>
      <title>Re: Issues with tExtractJSONFields Component xPath v jsonPath</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Issues-with-tExtractJSONFields-Component-xPath-v-jsonPath/m-p/2321136#M91204</link>
      <description>Could really do with help on this please...</description>
      <pubDate>Mon, 25 Jul 2016 08:35:54 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Issues-with-tExtractJSONFields-Component-xPath-v-jsonPath/m-p/2321136#M91204</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-07-25T08:35:54Z</dc:date>
    </item>
    <item>
      <title>Re: Issues with tExtractJSONFields Component xPath v jsonPath</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Issues-with-tExtractJSONFields-Component-xPath-v-jsonPath/m-p/2321137#M91205</link>
      <description>so if i change the tExtractJSONFields component from xPath to jsonPath the error goes away...
&lt;BR /&gt;This changes my question...
&lt;BR /&gt;so a sample json is:
&lt;BR /&gt;{
&lt;BR /&gt;&amp;nbsp; tickets: ,
&lt;BR /&gt;&amp;nbsp; Count: 1000,
&lt;BR /&gt;&amp;nbsp; next_page:
&lt;A href="https://webresint.zendesk.com/api/v2/incremental/tickets.json?start_time=1434370040" target="_blank" rel="nofollow noopener noreferrer"&gt;"https://webresint.zendesk.com/api/v2/incremental/tickets.json?start_time=1434370040"&lt;/A&gt;,
&lt;BR /&gt;&amp;nbsp; end_time: 1434370040
&lt;BR /&gt;}
&lt;BR /&gt;Using xPath i was able to set the Loop Path as tickets but use the ../count query to refer to a higher level.
&lt;BR /&gt;Using jsonPath if i set the Loop query to $.tickets.* then i cannot access the count value according to&amp;nbsp;
&lt;A href="http://goessner.net/articles/JsonPath/" rel="nofollow noopener noreferrer"&gt;http://goessner.net/articles/JsonPath/&lt;/A&gt; as the parent operator is not applicable to jsonPath.
&lt;BR /&gt;I have tried using the json query $.count but this does not return a value for me.
&lt;BR /&gt;And of course if i change the Loop path to $.* i only get the first ticket in the array and not all of them...
&lt;BR /&gt;The big issue i have with this is using other tools i can get this to work without any issues - i don't understand why this is so difficult in Talend...</description>
      <pubDate>Mon, 25 Jul 2016 10:08:01 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Issues-with-tExtractJSONFields-Component-xPath-v-jsonPath/m-p/2321137#M91205</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-07-25T10:08:01Z</dc:date>
    </item>
    <item>
      <title>Re: Issues with tExtractJSONFields Component xPath v jsonPath</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Issues-with-tExtractJSONFields-Component-xPath-v-jsonPath/m-p/2321138#M91206</link>
      <description>still struggling with this and need assistance</description>
      <pubDate>Mon, 25 Jul 2016 13:31:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Issues-with-tExtractJSONFields-Component-xPath-v-jsonPath/m-p/2321138#M91206</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-07-25T13:31:00Z</dc:date>
    </item>
    <item>
      <title>Re: Issues with tExtractJSONFields Component xPath v jsonPath</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Issues-with-tExtractJSONFields-Component-xPath-v-jsonPath/m-p/2321139#M91207</link>
      <description>ive even tried upgrading to v6.2, but this upgrade has broken everything (see other ticket).
&lt;BR /&gt;Guess i have no choice but to move to a different tool - very disappointing</description>
      <pubDate>Tue, 26 Jul 2016 08:08:58 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Issues-with-tExtractJSONFields-Component-xPath-v-jsonPath/m-p/2321139#M91207</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-07-26T08:08:58Z</dc:date>
    </item>
  </channel>
</rss>

