<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Fragmentation PK on UUID for SQL Server in Talend Data Catalog</title>
    <link>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338438#M1741</link>
    <description>you can forget immediatly type char(36). 
&lt;BR /&gt;too much time too load datas and SQL server doesn't like it fot a PK type. 
&lt;BR /&gt;Moreover fragmentation stay the same.</description>
    <pubDate>Tue, 12 Apr 2016 10:53:01 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2016-04-12T10:53:01Z</dc:date>
    <item>
      <title>Fragmentation PK on UUID for SQL Server</title>
      <link>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338435#M1738</link>
      <description>Hi everybody. 
&lt;BR /&gt;in a customer MDM project, we set Talend MDM on SQL Server 2008 R2 with talend 5.6.1. 
&lt;BR /&gt;After using autoIncrement, we decide to move on UUID generated in the job. So field's type for PrimaryKey are nvarchar(255). 
&lt;BR /&gt;I encounter the following problem: fragmentation are too important on each PK (99%) 
&lt;BR /&gt;the first big load records around 15 millions data in each entities. 
&lt;BR /&gt;weekly charges will records 200 000 lines. 
&lt;BR /&gt;I know that we can launch a reorganization but i should be too much 1 per week. . . 
&lt;BR /&gt; 
&lt;BR /&gt;i try to change type in one entity setting in uniqueidentifier but same result</description>
      <pubDate>Sat, 16 Nov 2024 10:42:11 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338435#M1738</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T10:42:11Z</dc:date>
    </item>
    <item>
      <title>Re: Fragmentation PK on UUID for SQL Server</title>
      <link>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338436#M1739</link>
      <description>Hello,&lt;BR /&gt;the right datatype for UUID is either char(36) (this is a UUID: 0862164E-235D-4DD3-9964-905F471EFD06) or a databaseinternal type the dbms vendor recommends for this.&lt;BR /&gt;if you have problems with a corrupted PK i think you are in the wrong Forum, you should ask in a SQL-Server Forum&lt;BR /&gt;&lt;BR /&gt;read here: &lt;A href="http://stackoverflow.com/questions/4503501/microsoft-sql-server-2008-99-fragmentation-on-non-clustered-non-unique-index" target="_blank" rel="nofollow noopener noreferrer"&gt;stackoverflow.com/questions/4503501/microsoft-sql-server-2008-99-fragmentation-on-non-clustered-non-unique-index&lt;/A&gt;&lt;BR /&gt;or here:&amp;nbsp; &lt;A href="http://www.sqlskills.com/blogs/paul/can-guid-cluster-keys-cause-non-clustered-index-fragmentation/" target="_blank" rel="nofollow noopener noreferrer"&gt;www.sqlskills.com/blogs/paul/can-guid-cluster-keys-cause-non-clustered-index-fragmentation/&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;anyway, AFAIK you first load data to a table, then create the index. &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;regards&lt;BR /&gt;dj</description>
      <pubDate>Fri, 08 Apr 2016 13:29:10 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338436#M1739</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-08T13:29:10Z</dc:date>
    </item>
    <item>
      <title>Re: Fragmentation PK on UUID for SQL Server</title>
      <link>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338437#M1740</link>
      <description>Hi,
&lt;BR /&gt;maybe i have to try char(36).
&lt;BR /&gt;but Sorry, i think you don't understand my problem.
&lt;BR /&gt;we launched the MDM project with SQL Server recommanded by Talend itself.
&lt;BR /&gt;we were in autoIncrement (best practise from my customer) but in MDM, autoincrement is managed by an unique table, so to add data in parallel, we had bottleneck.
&lt;BR /&gt;So to try with UUID, we change all PK type and now it's much better.
&lt;BR /&gt;The last and not the least is when we generate UUID to inject, data is directly fragmented.
&lt;BR /&gt;I have no problem with my PK, just bad performance due to fragmentation.
&lt;BR /&gt;Maybe you have right, it's a SQL Server topic, but it is a typical case to use MDM with SQL server when we want to use UUID.
&lt;BR /&gt;the goal is to find the best type or to organize UUID.
&lt;BR /&gt;in the futur we will inject 100 000 new data a week, we can't sort UUID before injection or reorganize index every day. . . too heavy</description>
      <pubDate>Tue, 12 Apr 2016 08:57:06 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338437#M1740</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-12T08:57:06Z</dc:date>
    </item>
    <item>
      <title>Re: Fragmentation PK on UUID for SQL Server</title>
      <link>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338438#M1741</link>
      <description>you can forget immediatly type char(36). 
&lt;BR /&gt;too much time too load datas and SQL server doesn't like it fot a PK type. 
&lt;BR /&gt;Moreover fragmentation stay the same.</description>
      <pubDate>Tue, 12 Apr 2016 10:53:01 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338438#M1741</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-12T10:53:01Z</dc:date>
    </item>
    <item>
      <title>Re: Fragmentation PK on UUID for SQL Server</title>
      <link>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338439#M1742</link>
      <description>please Post the create table Statement with the pk ceration here</description>
      <pubDate>Tue, 12 Apr 2016 12:51:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338439#M1742</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-12T12:51:00Z</dc:date>
    </item>
    <item>
      <title>Re: Fragmentation PK on UUID for SQL Server</title>
      <link>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338440#M1743</link>
      <description>why not but i don't understand the interest, Talend MDM generate itself the model and so the table.
&lt;BR /&gt;you just have to choose string in the MDM model and it creates the table in SQL Server
&lt;BR /&gt;by default: nvarchar(255) CLUSTERED for a pk in string</description>
      <pubDate>Tue, 12 Apr 2016 15:31:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338440#M1743</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-12T15:31:07Z</dc:date>
    </item>
    <item>
      <title>Re: Fragmentation PK on UUID for SQL Server</title>
      <link>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338441#M1744</link>
      <description>First: i dont run MDM, so if you ask for help ...&lt;BR /&gt;&lt;BR /&gt;Second: UUID is random data, randomdata is not good ( IMHO the worst) &amp;nbsp;for clustered indicies with heavy upload.&lt;BR /&gt;A Clustered index is ONLY good when you append data, otherwise the leafs of the table are reorged all the time.&lt;BR /&gt;Change this, if it is MDM internal open a case&amp;nbsp; with talend.</description>
      <pubDate>Wed, 13 Apr 2016 18:02:45 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338441#M1744</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-13T18:02:45Z</dc:date>
    </item>
    <item>
      <title>Re: Fragmentation PK on UUID for SQL Server</title>
      <link>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338442#M1745</link>
      <description>Hi,
&lt;BR /&gt;one more information, Talend MDM generate itself PK named "x_talend_id" when a sub table (complexetype in an entity) is created.
&lt;BR /&gt;this x_talend_id is by default in UUID and it's amazing, this UUID is sorted so no fragmentation!
&lt;BR /&gt;
&lt;BR /&gt;To conclude i know that there is a solution known by Talend. . . i try to open an issue</description>
      <pubDate>Wed, 20 Apr 2016 09:29:30 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Data-Catalog/Fragmentation-PK-on-UUID-for-SQL-Server/m-p/2338442#M1745</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-20T09:29:30Z</dc:date>
    </item>
  </channel>
</rss>

