Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
Hello,
In a file that the loading is almost 30 minutes i have been asked to add two tables.
¿there is a way to make the loading smaller with just a few lines so i can see if the loading i am making is correct?
thank you
if you want to verify the script without syntax errors then use Debug with limited records
if already data model is created and you want to add one more extra table you can choose Add Load/Partial Reload
Add load:
Simply put, this statement blindly appends data from one table to the data of another table, having similar signature, during partial reload. It does not check for any duplicate. Hence, ADD LOAD or ADD SELECT is usually follwed by distinct or a proper where clause.
e.g.
LOAD OrderID, OrderAmt from Order_May.csv;
ADD LOAD OrderID, OrderAmt from Order_June.csv;
This will simply concate data from Order_June to Order_May. But OrderID might be duplicated. Hence, this statement can be properly shaped to remove duplicate data as:
LOAD OrderID, OrderAmt from Order_May.csv;
ADD LOAD OrderID, OrderAmt from Order_June.csv Where Not Exists(OrderID);
The best way to do this is to use the 'debug' option in the edit script dialog, this way you can set it to load only a few lines
if you want to verify the script without syntax errors then use Debug with limited records
if already data model is created and you want to add one more extra table you can choose Add Load/Partial Reload
Add load:
Simply put, this statement blindly appends data from one table to the data of another table, having similar signature, during partial reload. It does not check for any duplicate. Hence, ADD LOAD or ADD SELECT is usually follwed by distinct or a proper where clause.
e.g.
LOAD OrderID, OrderAmt from Order_May.csv;
ADD LOAD OrderID, OrderAmt from Order_June.csv;
This will simply concate data from Order_June to Order_May. But OrderID might be duplicated. Hence, this statement can be properly shaped to remove duplicate data as:
LOAD OrderID, OrderAmt from Order_May.csv;
ADD LOAD OrderID, OrderAmt from Order_June.csv Where Not Exists(OrderID);