Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik and ServiceNow Partner to Bring Trusted Enterprise Context into AI-Powered Workflows. Learn More!
cancel
Showing results for 
Search instead for 
Did you mean: 
gt0731
Contributor III
Contributor III

Greenplum - gpload process error

We have small greenplum cluster. In that , trying for Merge operation using tgreenplumGPload Component.
Getting this error.
ENV Details :

OS detail 
Talend server  - windows server 2012
Greenplum Cluster version -   centos 7    

Hadoop cluster - centos 7 







Getting following error :
Exception in thread "Thread-1" java.lang.RuntimeException: Cannot run program "gpload": CreateProcess error=2, The system cannot find the file specified

Attached is screenshot error :
0683p000009MDUP.png

Job flow Setting at tgreenplumGPload component
0683p000009MDHj.png
0683p000009MDUU.png
gpfdist program is running at the Greenplum master host. 
$ ps -A | grep gpfdist
20071 pts/0    00:00:00 gpfdist
$


Do i need to copy file from Local windows on which talend job is running  to REMOTE linux server on which greenplum database master exist ? It would be great help if you will suggest on my current  data flow.
 Current Data flow:
                                       tgreenplumconnection
                                      |
Read from SQL server -->hdfs -->tmap-->tgreenplumGPload -->tgreenplumCommit
Q1 : How do I get  source HDFS data into greenplum at  serving directory of gpfdist protocol. so, that gpload merge operation start using it. We cannot use gphdfs because purpose is gpload merge operation. Please suggest if we have any alternate way to do this.

Checked    -  following process is running in greenplum server .
$ gpload -f gpload.yml
2017-02-25 20:20:48|INFO|gpload session started 2017-02-25 20:20:48
2017-02-25 20:20:48|INFO|started gpfdist -p 8081 -P 8082 -f "/home/gpadmin/demo/gp_RevenueReport_stg0.txt" -t 30
2017-02-25 20:20:48|INFO|running time: 0.20 seconds
2017-02-25 20:20:48|INFO|rows Inserted          = 0
2017-02-25 20:20:48|INFO|rows Updated           = 3
2017-02-25 20:20:48|INFO|data formatting errors = 0
2017-02-25 20:20:48|INFO|gpload succeeded

Main cause :
Greenplum database server (Linux) is remote to ETL talend server (window). hence , when i am running the job from window server . ALSO,  i am not able to configure component tgreenplumGPload. 




Any help on it would be much appreciated ? Thanks in advance
Labels (3)
11 Replies
Anonymous
Not applicable

Hello,

Did you install greenplum-loader client successfully on the studio machine and execute the job with specifying the gpload path at component properties?

Best regards

Sabrina

Anonymous
Not applicable

Hi Sabrina,

 

i don't know about green plum loader installation in Talend.

 

If you know can you share me more details and download link.

 

If suppose i don't have that drivers in Talend green plum loader component should be Red color but i am not seeing the any error in that component in Talend. 

 

Thanks

Ramesh