FirstValue should have a lower RAM memory consumption
Hello all,
FirstValue is a very useful aggregation function to take the first result of your data set. However, I had today a big issue with this function.
Context : 256 GB RAM Server with 16 CPU 67 millions row First Value on 8 fields (going from 67 millions row to 60 millions row)
The reload takes more than half of RAM !!
here is my code :
NoConcatenate
FACT_Tx_Article_Histo :
LOAD
article_reference,
article_date,
text(FirstValue(ddj)) as ddj,
num(FirstValue(pmp)) as pmp,
num(FirstValue(taux_depreciation)) as taux_depreciation,
text(FirstValue(enjeu_commercial)) as enjeu_commercial,
//v1.3.0.0 - ajout de 5 champs
text(FirstValue(modele_code)) as modele_code,
text(FirstValue(article_ean13)) as article_ean13, //ok
text(FirstValue(modele_description)) as modele_description, //ok
num(FirstValue(article_pcb)) as article_pcb,
text(FirstValue(article_statut)) as article_statut,
num(FirstValue(supplier_type)) as supplier_type //plante si on le met en place sur l'ensemble de la volumétrie mais pas sur un échantillon... pourquoi?
Resident TMP_FACT_Tx_Article_Histo_Step1
group by
article_date,
article_reference
order by article_date,article_reference
,supplier_type
;
It seems to have a threshold effect since if I don't take supplier_type on, it works perfectly fine ( num type field with only two value 1 and 2) .
Best regards,
Simon
Bi Consultant (Dataviz & Dataprep) @ Business & Decision
NOTE: Upon clicking this link 2 tabs may open - please feel free to close the one with a login page. If you only see 1 tab with the login page, please try clicking this link first: Authenticate me! then try the link above again. Ensure pop-up blocker is off.