Hi Jerry
I'm not sure how far you've gotten with your use-case, but it could be an example with Talend Presales might need to get involved. With TOS4BigData you need to use Pig and Pig UDFs, or one of the existing components. With Talend Enterprise Big Data however we have a number of extensions (through MapReduce and custom functions), which might be required for your immediate need.
Cheers
We have done some POCs using Hadoop/Talend with Hive and MR using Talend. Some of the use cases are:
-Slow Changing Dimensions (SCD)
-Change Data capture
-Merge Statements
-Lookups
-Draining Data from Queues and parsing complex xmls etc.
- Logfiles and Error Handling
- Data Pattern Matching across historical transactions.
-Aggregations
etc.
For most of the cases Hive is used. For some of them MapReduce code is used. In general Hive is found to be easier to support