<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Data standardization and cleansing using talend open studio in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Data-standardization-and-cleansing-using-talend-open-studio/m-p/2248106#M33067</link>
    <description>Hi, 
&lt;BR /&gt;There are Talend dq components about 
&lt;A href="https://help.talend.com/search/all?query=tStandardizePhoneNumber&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt; TalendHelpCenter:tStandardizePhoneNumber&lt;/A&gt;,&amp;nbsp; 
&lt;A href="https://help.talend.com/search/all?query=tRecordMatching&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt;TalendHelpCenter:tRecordMatching&lt;/A&gt; and some name parsing routines that ship with TDQ, check out DataQuality in the expression builder to pull out first name, last name, title, etc. 
&lt;BR /&gt;Please take a look at Talend Data Quality Product: 
&lt;BR /&gt; 
&lt;A href="http://www.talend.com/products/data-quality" target="_blank" rel="nofollow noopener noreferrer"&gt;http://www.talend.com/products/data-quality&lt;/A&gt; 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Sabrina</description>
    <pubDate>Mon, 09 Nov 2015 03:45:37 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2015-11-09T03:45:37Z</dc:date>
    <item>
      <title>Data standardization and cleansing using talend open studio</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Data-standardization-and-cleansing-using-talend-open-studio/m-p/2248105#M33066</link>
      <description>Hi Guys! 
&lt;BR /&gt;I am quite new to talend and I have encountered a little bit of problem.&amp;nbsp; 
&lt;BR /&gt;I have 2 tables which are csv files: 
&lt;BR /&gt;Customer.csv and Billing.csv 
&lt;BR /&gt;Customer.csv Table columns: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Customer_ID|Title|First_Name|Last_Name|Status|Email|Date_of_Birth 
&lt;BR /&gt;Billing.csv Table columns: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Customer_ID|Phone_No|Address_Line_1|City|Region|Country|Zip 
&lt;BR /&gt; 
&lt;BR /&gt;I have a task to standardize the data of the following tables 
&lt;BR /&gt;In the customer.csv table, the 
&lt;B&gt;accepted&lt;/B&gt; format for the "Title" column are the following: 1.)&amp;nbsp;Mr. 2.) Mrs. and 3.)Ms. 
&lt;BR /&gt;For the billing.csv table, the 
&lt;B&gt;accepted&lt;/B&gt; format for the "Phone_No" is: XXX-XXX-XXXX 
&lt;BR /&gt;My question is wether its possible to standardize and cleanse the data using talend open studio for data integration(not the enterprise edition of talend and not talend studio for data quality either). AND if its not possible...how can I somehow filter the data&amp;nbsp;so that if it does'nt follow the right format, only that certain type of data will not pass(a little bit of error handling). Is it possible to standardize data using tMap? or tJava? 
&lt;BR /&gt;Anyways, any kind of help would be gladly appreciated 
&lt;BR /&gt;sincerely Locke</description>
      <pubDate>Sat, 16 Nov 2024 10:57:41 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Data-standardization-and-cleansing-using-talend-open-studio/m-p/2248105#M33066</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T10:57:41Z</dc:date>
    </item>
    <item>
      <title>Re: Data standardization and cleansing using talend open studio</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Data-standardization-and-cleansing-using-talend-open-studio/m-p/2248106#M33067</link>
      <description>Hi, 
&lt;BR /&gt;There are Talend dq components about 
&lt;A href="https://help.talend.com/search/all?query=tStandardizePhoneNumber&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt; TalendHelpCenter:tStandardizePhoneNumber&lt;/A&gt;,&amp;nbsp; 
&lt;A href="https://help.talend.com/search/all?query=tRecordMatching&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt;TalendHelpCenter:tRecordMatching&lt;/A&gt; and some name parsing routines that ship with TDQ, check out DataQuality in the expression builder to pull out first name, last name, title, etc. 
&lt;BR /&gt;Please take a look at Talend Data Quality Product: 
&lt;BR /&gt; 
&lt;A href="http://www.talend.com/products/data-quality" target="_blank" rel="nofollow noopener noreferrer"&gt;http://www.talend.com/products/data-quality&lt;/A&gt; 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Sabrina</description>
      <pubDate>Mon, 09 Nov 2015 03:45:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Data-standardization-and-cleansing-using-talend-open-studio/m-p/2248106#M33067</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2015-11-09T03:45:37Z</dc:date>
    </item>
  </channel>
</rss>

