Skip to main content
Announcements
Accelerate Your Success: Fuel your data and AI journey with the right services, delivered by our experts. Learn More
cancel
Showing results for 
Search instead for 
Did you mean: 
talend_consumer
Contributor
Contributor

tmap wildacrd join

Dear All,

 

I have a fairly straight-forward requirement. I need to allocate a nearest delivery centre based on customers post-code...I can generate 2 files

customers.csv

Name,Address,PostCode

Joe Bloggs, Some Road in Reading, RG1 4XX

Tim John, Some Road in London, NW10 1AA

 

Deliveru.csv

PostCode, Centre

RG, Reading

NW, London

EC,London

 

I want to match based on '%PostCode%' from Customer to contain PostCode from Delivery.csv

 

Many Thanks

 

Labels (3)
1 Solution

Accepted Solutions
akumar2301
Specialist II
Specialist II

You could something simple if you are sure that 1st two Char of postal code will contain region code. Please do the null check in your expression ( I missed to do that)

0683p000009M3Fb.jpg

View solution in original post

6 Replies
Anonymous
Not applicable

Hi,

 

    It is pretty risky proposition to identify the nearest delivery centre postcode by just doing %POST_CODE% match on customer post code. Obviously the match might not be correct in many cases and we are trying to reinvent the wheel where we are already having pretty matured SaaS solutions already available.

0683p000009M3FH.png

 

In a nutshell, I would suggest to use the Standardization match features on one of the above components based on your comfort level. They are using Royal Mail's Standardized Address File as their source and will give the accuracy percentages by comparing with input addresses. 

 

Another problem is that even if you got postal address as same, what is the guarantee that you will be delivering to exactly same person if your address match is just on hard match based on postal code? There are lot of common names in UK like Adam Smith where people with same name will be there in same street. 

 

Now, there is another scenario where father and son are having same name with Junior or II suffix. Another scenario is Husband and wife divorced and living in same location. If your address match is wrong and the letter is going wrongly to a divorced couple, you are going to end a paying lot of GDPR fine 🙂

 

So my suggestion will be to utilize the address standardization features of Talend Data quality component rather than trying to try something yourself. First of all, you will have to put lot of efforts to reinvent the wheel and they are doing the stuff with pretty much matured algorithms in plug and play fashion.

 

So do you still want to try this item by yourself? 

 

Warm Regards,
Nikhil Thampi

Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved 🙂

talend_consumer
Contributor
Contributor
Author

I appreciate your advise. However all out delivery centres have a post code list i.e. just the first two letters. So I am really looking for a wildcard match in tmap or similar....

 

Many Thanks

akumar2301
Specialist II
Specialist II

You could something simple if you are sure that 1st two Char of postal code will contain region code. Please do the null check in your expression ( I missed to do that)

0683p000009M3Fb.jpg

talend_consumer
Contributor
Contributor
Author

Thanks Abhishek,

That is my fall back plan. Is there any way to do wild card match as the next phase load will be 3 digit post-codes. Alternatively I will upload them to staging database and use wildcard match in join....however was seeing if we have anything available in Talend to do this. 

 

Appreciate your answer.

 

 

 

Anonymous
Not applicable

Hi,

 

   That will be a good fall back method but add the necessary caveats in your design documents and development handover guide about the possible data match risk so that the issue will not snowball to you later.

 

   If the architect of the project is deciding this is the best method, then the onus of handling any data problem/GDPR issues also should be eventually handled by him/ her. They should not throw the mud to you or testing team for it for any possible setback.

 

Warm Regards,
Nikhil Thampi

Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved 🙂

 

 

 

 

talend_consumer
Contributor
Contributor
Author

Appreciate Nikhil.

 

Thanks