Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
Hi,
I have a requirement of loading around 10Million data into my vertica database With certain rules as below:
1. One record can span to multiple rows and they are identified using segments. Every record start at segment 0001 and ends at segment 0005. segment 0001 contains record for Table say A, segment 0002 contains record for Table B , segment 0003 for Table C, segment 0004 for Table D and 0005 for Table E.
2. using Col1, Col2 and current date (format yyyymmdd), I need to form a new value say PaymentID but this information is only in segment 003 or segment 0005
Since I am new to this tool, my approach to solve the problem is very naive. What I am trying to make it professional, fast and acceptable is to load all data in Staging/Temp table in vertica DB with a new col - paymentid (Already Done). Then use ELT components like tVerticaInput,tELTVerticaMap and tELTVerticaOutput or simple verticainput,join,map,verticaoutput to check and update the payment id for one record which falls between segment 0001 to 0005.
Problem is I am not able to use tVerticaInput to join with multiple tJoins. My question here to all the experts are:
1. What do you suggest the better approach to solve this problem?
2. How is it possible to use tVerticaInput to join with multiple tJoins components?
3. How do I monitor the progress of a job I mean is it possible to log the progress of the job in some table?
Appreciate all the help and suggestion.
Onkar
A1CG0000000053975482 00331 0002 00331Hmsinfortel PO Box245875
A2CG0000000053975482 0033100331.4.-902200605.110115-145-19 0003 00331DOUGLAS MICHEAL
A3CG0000000053975482 0033100331.4.-902200605.1110115-145-19 00041 000198941
A4CG0000000053975482 00331 0005 028 MAXIMUM NUMBER