Count of rows where a String column value exceeds threshold length
Profiling a delimited file before attempting to import it into a database, I have set a maximum threshold on the maximal length indicator on a String column. The results show me that I have at least one datum that exceeds the threshold -- but how can I get a picture of the real size of the problem? That is, how, in Talend Open Profiler, can I get a count of all the rows where the column value is longer than my threshold?
Hi Alan,
You would need to define a java user defined indicator that counts these rows. A parameter table can be used to pass the threshold value.
Please, see the documentation about how to do that.
Let me know if you need help.
Once your indicator is defined, you may share it on
http://www.talendforge.org/exchange/index.php so that you can easily retrieve this indicator in any of the future release.