Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Join us in Bucharest on Sept 18th for Qlik's AI Reality Tour! Register Now
cancel
Showing results for 
Search instead for 
Did you mean: 
Parikhharshal
Creator III
Creator III

Extract string

Hi Talend experts

 

I am reaching out to you for your help.

 

I have got words like this:

 

Bachelor of Business with a major in Accounting and Finance

Bachelor Of Business (Marketing)

 

I need to extract Accounting and Finance and Marketing value from second row.

 

What's the best way to do this?

 

Thanks

Harshal.

Labels (3)
1 Reply
Anonymous
Not applicable

Hi Harshal,

 

     Unfortunately, there is no direct way of doing this task. You will have to parse the data after Key words "Bachelor of Business" and then trim out the unwanted data. But the data after that can come in various format according to your use case.

 

with a major in Accounting and Finance

Accounting and Finance

Accounting + Finance

(Accounting and Finance)

 

I believe you are trying to read a free text column. So effectively it all depends on customer's imagination about writing the values. Some times they may even write in Acronyms like Phy, Chem etc. So the combinations is infinite. You will have to use  some matching algorithm like Jaro Wrinkler to match the input data with standardized versions using Talend Matching components.

 

You can send all suspected and failed matches for a manual reference so that data can be parsed in both automatic and manual fashion.

 

Warm Regards,
Nikhil Thampi

Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved 🙂