<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Temp path for spark jars missing a slash? in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Temp-path-for-spark-jars-missing-a-slash/m-p/2376990#M138979</link>
    <description>I'm trying to run a Spark batch job on an EMR cluster, using Data Fabric 6.1.1 
&lt;BR /&gt;Jobs are being started on the cluster, but failing with an error like 
&lt;BR /&gt; 
&lt;PRE&gt;java.io.FileNotFoundException: File file:/...&amp;lt;local_path_to_Talend&amp;gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/lib/talend-spark-assembly-1.4.1-hadoop2.6.0-amzn-0.jar&lt;/PRE&gt; 
&lt;BR /&gt;the job name is reads3 in this case. From the debug logs, I can see it trying to pass this file to the cluster (and several others in this directory as&amp;nbsp;-Dspark.jars= arguments) 
&lt;BR /&gt; 
&lt;PRE&gt;: org.apache.spark.deploy.yarn.Client - &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; SPARK_YARN_CACHE_FILES -&amp;gt; file:/...&amp;lt;local_path_to_Talend&amp;gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/lib/talend-spark-assembly-1.4.1-hadoop2.6.0-amzn-0.jar#__spark__.jar&lt;BR /&gt;: org.apache.spark.deploy.yarn.Client - &amp;nbsp; &amp;nbsp; resources:&lt;BR /&gt;: org.apache.spark.deploy.yarn.Client - &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; __spark__.jar -&amp;gt; resource { scheme: "file" port: -1 file: "/...&amp;lt;local_path_to_Talend&amp;gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/lib/talend-spark-assembly-1.4.1-hadoop2.6.0-amzn-0.jar" } size: 183166734 timestamp: 1467949359000 type: FILE visibility: PRIVATE&lt;/PRE&gt; 
&lt;BR /&gt;The key point here appears to be that it's expecting these directories: 
&lt;BR /&gt; 
&lt;PRE&gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/lib&lt;BR /&gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/reads3&lt;/PRE&gt; 
&lt;BR /&gt;But I find the following directories have been created (though they are mostly empty): 
&lt;BR /&gt; 
&lt;PRE&gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3lib&lt;BR /&gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3reads3&lt;/PRE&gt; 
&lt;BR /&gt;It looks like Talend is not putting a trailing slash on a temporary directory name, but I'm not sure where I would go to fix this. It's possible that it's then not decompressing the archive there, but the directory names seems like the first issue. I tried 
&lt;BR /&gt; 
&lt;PRE&gt;mkdir&amp;nbsp;./Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/&lt;BR /&gt;tar -xC /Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/ -f ./workspace/.Java/target/reads3_0_1.tar.gz&lt;/PRE&gt; 
&lt;BR /&gt;Which puts the required jar file in the right place, and then running it again, but it looks like Talend removes this directory when it builds. Though it leaves the reads3lib and reads3reads3 directories.</description>
    <pubDate>Sat, 16 Nov 2024 10:31:33 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2024-11-16T10:31:33Z</dc:date>
    <item>
      <title>Temp path for spark jars missing a slash?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Temp-path-for-spark-jars-missing-a-slash/m-p/2376990#M138979</link>
      <description>I'm trying to run a Spark batch job on an EMR cluster, using Data Fabric 6.1.1 
&lt;BR /&gt;Jobs are being started on the cluster, but failing with an error like 
&lt;BR /&gt; 
&lt;PRE&gt;java.io.FileNotFoundException: File file:/...&amp;lt;local_path_to_Talend&amp;gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/lib/talend-spark-assembly-1.4.1-hadoop2.6.0-amzn-0.jar&lt;/PRE&gt; 
&lt;BR /&gt;the job name is reads3 in this case. From the debug logs, I can see it trying to pass this file to the cluster (and several others in this directory as&amp;nbsp;-Dspark.jars= arguments) 
&lt;BR /&gt; 
&lt;PRE&gt;: org.apache.spark.deploy.yarn.Client - &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; SPARK_YARN_CACHE_FILES -&amp;gt; file:/...&amp;lt;local_path_to_Talend&amp;gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/lib/talend-spark-assembly-1.4.1-hadoop2.6.0-amzn-0.jar#__spark__.jar&lt;BR /&gt;: org.apache.spark.deploy.yarn.Client - &amp;nbsp; &amp;nbsp; resources:&lt;BR /&gt;: org.apache.spark.deploy.yarn.Client - &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; __spark__.jar -&amp;gt; resource { scheme: "file" port: -1 file: "/...&amp;lt;local_path_to_Talend&amp;gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/lib/talend-spark-assembly-1.4.1-hadoop2.6.0-amzn-0.jar" } size: 183166734 timestamp: 1467949359000 type: FILE visibility: PRIVATE&lt;/PRE&gt; 
&lt;BR /&gt;The key point here appears to be that it's expecting these directories: 
&lt;BR /&gt; 
&lt;PRE&gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/lib&lt;BR /&gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/reads3&lt;/PRE&gt; 
&lt;BR /&gt;But I find the following directories have been created (though they are mostly empty): 
&lt;BR /&gt; 
&lt;PRE&gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3lib&lt;BR /&gt;.../Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3reads3&lt;/PRE&gt; 
&lt;BR /&gt;It looks like Talend is not putting a trailing slash on a temporary directory name, but I'm not sure where I would go to fix this. It's possible that it's then not decompressing the archive there, but the directory names seems like the first issue. I tried 
&lt;BR /&gt; 
&lt;PRE&gt;mkdir&amp;nbsp;./Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/&lt;BR /&gt;tar -xC /Talend-Studio-macosx-cocoa.app/Contents/MacOS/temp/reads3/ -f ./workspace/.Java/target/reads3_0_1.tar.gz&lt;/PRE&gt; 
&lt;BR /&gt;Which puts the required jar file in the right place, and then running it again, but it looks like Talend removes this directory when it builds. Though it leaves the reads3lib and reads3reads3 directories.</description>
      <pubDate>Sat, 16 Nov 2024 10:31:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Temp-path-for-spark-jars-missing-a-slash/m-p/2376990#M138979</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T10:31:33Z</dc:date>
    </item>
    <item>
      <title>Re: Temp path for spark jars missing a slash?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Temp-path-for-spark-jars-missing-a-slash/m-p/2376991#M138980</link>
      <description>Hi&amp;nbsp;
&lt;FONT color="#5b5b5d"&gt;&lt;FONT size="2"&gt;&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;Mark.Nettle,&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT size="1"&gt;&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;Is your network running well?&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT color="#5b5b5d"&gt;&lt;FONT size="2"&gt;&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;Could you please report a ticket on Talend Support Portal for your&amp;nbsp;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/FONT&gt;
&lt;FONT size="1"&gt;&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;Data Fabric 6.1.1 product. In this way, we can give you a remote assistance on this issue to see if it is a bug through support cycle with priority.&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;BR /&gt;
&lt;FONT size="1"&gt;&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;Best regards&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT size="1"&gt;&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;Sabrina&lt;/FONT&gt;&lt;/FONT&gt;</description>
      <pubDate>Mon, 11 Jul 2016 04:19:58 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Temp-path-for-spark-jars-missing-a-slash/m-p/2376991#M138980</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-07-11T04:19:58Z</dc:date>
    </item>
  </channel>
</rss>

