Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik Open Lakehouse is Now Generally Available! Discover the key highlights and partner resources here.
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

How to stream ORACLE data to AWS S3 ?

How to stream ORACLE data to AWS S3

 

Hi All,

 

My company is doing POC on using Talend to load data to AWS (S3,Redshift).

I am completely new to Talend  and I am looking for a possibility to stream Oracle transactional data to S3.

 

Could anyone advices best method (product) for doing this with Talend ( any of products currently available from Talend)?

 

Regards,

 

Wojtek

Labels (4)
3 Replies
vapukov
Master II
Master II

easy

 

first of all as new - install Talend and import Demo project, You will have a lot of examples (not for S3, but this is just details)

 

in simplest case, You will need only 4 component:

- tOracleInput

- tMap

- tCSVOutputDelimited

- tS3Put

as much more complicated logic --> as much more complicated would be final Job or Project

 

check series of articles - Talend Best Practice (parts 1-4)
https://www.talend.com/blog/2017/05/05/data-model-design-best-practices-part-1/

Anonymous
Not applicable
Author

 

Hi Vapukov,

 

First of all thanks for your answer.

I already did some tutorials but I still consider myself as a total beginner so I will definitely follow up on those best practices you send me.

 

As I understand those components will build kind of pull mechanism (with delta detection using tMap) while I need data to be pushed asap change occurs in source db like in case of Oracle CDC.

 

Probably I can build flow with bellow logic:

In the loop read CDC components save to CSV file and and send changes to S3 but I wonder if there are better ways of doing this using Talend?

 

Regards, 

 

Wojtek

vapukov
Master II
Master II

Probably I can build flow with bellow logic:
In the loop read CDC components save to CSV file and and send changes to S3 but I wonder if there are better ways of doing this using Talend?

not so easy

If data only new (incremental loading) - yes, and not only by CDC

 

but CDC mean not only new data, but as well - UPDATES and DELETE, so logic would be more complicated