Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
Hi Talend experts
I am reaching out to you for your help.
I have got words like this:
Bachelor of Business with a major in Accounting and Finance
Bachelor Of Business (Marketing)
I need to extract Accounting and Finance and Marketing value from second row.
What's the best way to do this?
Thanks
Harshal.
Hi Harshal,
Unfortunately, there is no direct way of doing this task. You will have to parse the data after Key words "Bachelor of Business" and then trim out the unwanted data. But the data after that can come in various format according to your use case.
with a major in Accounting and Finance
Accounting and Finance
Accounting + Finance
(Accounting and Finance)
I believe you are trying to read a free text column. So effectively it all depends on customer's imagination about writing the values. Some times they may even write in Acronyms like Phy, Chem etc. So the combinations is infinite. You will have to use some matching algorithm like Jaro Wrinkler to match the input data with standardized versions using Talend Matching components.
You can send all suspected and failed matches for a manual reference so that data can be parsed in both automatic and manual fashion.
Warm Regards,
Nikhil Thampi
Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved 🙂