Skip to main content
Announcements
A fresh, new look for the Data Integration & Quality forums and navigation! Read more about what's changed.
cancel
Showing results for 
Search instead for 
Did you mean: 
AAA999
Contributor III

Validating International Email and Phone Number

I am new to Talend and would like to know if there is any component that can be used to validate International Email and Phone number?

I went through the document and found tVerifyEmail but it is not that helpful for international check. 

Has anyone used this concept in there project? If so please provide the steps.

 

Thanks a Lot.

Labels (2)
13 Replies
TRF
Champion II

You may use a regex if you want to validate email format https://howtodoinjava.com/regex/java-regex-validate-email-address/
For a deeper analysis and validation, you need to use a service validation. See this link for example https://www.accuwebhosting.com/blog/top-10-bulk-email-list-verification-validation-services-compared...
AAA999
Contributor III
Author

Thanks! could you please provide me the job design for Email and Phone and the components?

Thanks a lot

TRF
Champion II

You should try by yourself or you will never learn how-to.

Try, fail, retry, ask, retry, win - this is the best way.

AAA999
Contributor III
Author

You are right. I have worked with Infa and Infosphere etl tools. Talend is new and due to time crunch needed the solution.

Thanks a lot.

 

AAA999
Contributor III
Author

Could you at least let me know what components would be good? And how the o/p comes back, does it additional o/p coumns the way it does in INFA and InforSphere?

TRF
Champion II

Can't understand the 2nd part of your question.

For the rest the components you need depend of the way and/or service you choose for email validation.

If you just want to validate the email format, a little routine is enough.

You can use this one for example:

package routines;
import java.util.regex.*;
public class checkEmail {
    public static boolean isEmailValid(String email) {
    	String regex = "^[\\w!#$%&'*+/=?`{|}~^-]+(?:\\.[\\w!#$%&'*+/=?`{|}~^-]+)*@(?:[a-zA-Z0-9-]+\\.)+[a-zA-Z]{2,6}$";
    	Pattern pattern = Pattern.compile(regex);
    	Matcher matcher = pattern.matcher(email);
    	return matcher.matches();
    }
}

If you decide to call an external service for in deep validation, the components depend on the service constraints.

Internet components such as tHttpRequest, tREST or tSOAP will probably be usefull.

 

AAA999
Contributor III
Author

my second question"

 

In infosphere they have a in-build component for Address and Name Standardization for USA.

ex: there are 6 columns Addr1,Addr2,City,State,Zip,Name.

When the data is passed through the Standardized component it adds additional cols along with the input cols.

o/p column

6 input cols + additional cols as Address_Formatted, Name_Formatted, Input_Pattern....

Does Talend component behave in similar way.

Thanks

 

AAA999
Contributor III
Author

Still not resolved

Anonymous
Not applicable

Easiest way to validate and email email address is through email verification service. If there are fewer emails that you need to verify, you can use single validation feature or if ther is a list of email records in 100s or 1000s batch email verification can be used. https://bulkemailverification.com is an unbiased directory of leading email list cleaning services that can come really come handy in selecting the right tools as per your need.