Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
I uploaded a file with ~10 million records to Cloud Data Preparation. It finds the patterns of a column based on 10,000 records. There may be many more patterns in the file, given the sample size. (I'm willing to wait.)
The docs mention a config file, which includes parameters such as:
dataset.records.limit=10000
dataset.local.file.size.limit=2000000000
How do I get access to those parameters in the cloud?
Hi,
Today, this parameter is not exposed in the Cloud version of Data Preparation. But you can rely on "fetch more" to have access to more patterns. See the product documentation for that part: https://help.talend.com/reader/jErhAENS5HA9L8lGuHSmsA/ZQ_OY~Bc3yVJ1izp9OgntQ
And just to give you a sense of what's coming in the roadmap, we plan to make some enhancements on that front end of this year/beg. of next year with more options for the sampling configuration.
Regards,
Patrick