Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik and ServiceNow Partner to Bring Trusted Enterprise Context into AI-Powered Workflows. Learn More!
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Not able to schedule big-data jobs(e.g.PIG,HIVE,M/R) using OOZIE

Hi,
I am very new to Talend. Is there a way to schedule Hadoop Jobs (e.g. HIVE, PIG etc.) using OOZIE plugin.
Can you please show me a example.
Regards,
Shouvanik

Labels (1)
49 Replies
Anonymous
Not applicable
Author

Hi,
For community version(Talend Open Studio for BigData), we support for OOZIE.
As a newbie, I encourage you to read the online manual of How+to+run+a+Job+on+a+remote+HDFS+server with four chapters How+to+set+HDFS+connection+details, How+to+run+a+Job+on+the+HDFS+server, How+to+schedule+the+executions+of+a+Job and How+to+monitor+Job+execution+status. Hope it is helpful for you.
Best regards
Sabrina
Anonymous
Not applicable
Author

Hi,
Are these tutorials for community edition? I have a job. I just want to schedule it using OOZIE. What are the steps. It will be good if you can give me the steps. I could not follow from the links you had shared.
Regards,
Shouvanik
Anonymous
Not applicable
Author

Hi,
It is community edition tutorials on talend help center. The references provide tips and tricks to teach you how to schedule Hadoop Jobs (e.g. HIVE, PIG etc.) using OOZIE, what's more, it is for free.
You can log in Talend Help Center to see the related references using your forum account. If you cannot log in, please read the notice forum Upcoming Maintenance.
Best regards
Sabrina
Anonymous
Not applicable
Author

Hi Sabrina,
Do we always have to create a connection to HDFS before scheduling a job? Can't we schedule any Talend job using OOZIE.
Like, I have a example job, which reads data from a fixed format input file and puts into HBASE.
Next, I export the job. Now, in OOZIE tab when I am clicking on run, I get the following error.

Deploying job to Hadoop...
Deployment failed!
Can not access Hadoop File System with user user!
Protocol org.apache.hadoop.hdfs.protocol.ClientProtocol version mismatch. (client = 61, server = 63)

Please help
Regards,
Shouvanik
Anonymous
Not applicable
Author

Hi
Before you can run or schedule executions of a Job on an HDFS server, you need first to define the HDFS connection details either in the Oozie scheduler view or in the studio preference settings, and specify the path where your Job will be deployed. see How to set HDFS connection details.
Shong
Anonymous
Not applicable
Author

Hi,
Please help me out with the above HDFS issue.
I am using Talend 5.2 Open Studio for Big data and its a community edition. It does not have all the features.
Please help me
Thanks,
Shouvanik
Anonymous
Not applicable
Author

Hi,
You need to open the View which is not open by default in the Studio.
Here is the procedure to open the Talend Oozie view :
In your Menu = Window > Show view > Talend Oozie.
Then the View is open and appears beside your RUN view; you can set up your connection to Oozie and then run.
(A valid PATH on your hadoop cluster need to exist).
Regards;
Christophe
Anonymous
Not applicable
Author

Hi Shouvanik,
To make it more clear, attach the screenshot in forum.
Best regards
Sabrina
0683p000009MAI2.jpg
Anonymous
Not applicable
Author

Hi,
@cantoine: Opening Talend OOZIE view window is not a problem here. The issue is: I want to schedule and run a Talend job using OOZIE workflow scheduler plugin provided with Talend Open studio (screenshot of the product attached.).
Is it possible to do that? Why do I have to connect to HDFS always remotely in order to do that.
@sabrina: I am attaching the screenshot of the job, which I created from the example given. Can I schedule it with OOZIE?

Moreover, when I am trying to schedule a job and click on RUN button, I get the following error.
Deploying job to Hadoop...
Deployment failed!
Can not access Hadoop File System with user root!
Protocol org.apache.hadoop.hdfs.protocol.ClientProtocol version mismatch. (client = 61, server = 63)
What should be going inside the field for PATH inside OOZIE view?
I suspect that the hadoop version running locally in my machine is of a incompatible version with respect to the version required by Talend.
Can you please tell me what Hadoop version is compatible with Talend Hadoop client version?

Please help me.
Regards,
Shouvanik
0683p000009MA54.jpg 0683p000009MAI7.jpg