Please start any new threads on our new site at https://forums.sqlteam.com. We've got lots of great SQL Server experts to answer whatever question you can come up with.

 All Forums
 SQL Server 2005 Forums
 SQL Server Administration (2005)
 How to Remove Dupicate Data....

Author  Topic 

hirani_prashant
Yak Posting Veteran

93 Posts

Posted - 2008-03-24 : 01:11:46
Hello All,

I do have redundancy in data for a particaular table.
Can any one please guide me how to remove duplicate data.

Thanks
Prashant

harsh_athalye
Master Smack Fu Yak Hacker

5581 Posts

Posted - 2008-03-24 : 04:57:35
See this: http://www.sqlteam.com/forums/topic.asp?TOPIC_ID=6256

Harsh Athalye
India.
"The IMPOSSIBLE is often UNTRIED"
Go to Top of Page

jackv
Master Smack Fu Yak Hacker

2179 Posts

Posted - 2008-03-24 : 13:43:02
This one gives a good explanation:
http://www.sql-server-performance.com/articles/dev/dv_delete_duplicates_p1.aspx

Jack Vamvas
--------------------
Search IT jobs from multiple sources- http://www.ITjobfeed.com
Go to Top of Page

LOOKUP_BI
Constraint Violating Yak Guru

295 Posts

Posted - 2008-03-24 : 14:03:15
You can write a Query to find exact matching rows based on the columns you would like to compare the rows againts.Place these rows[duplicates] into a separate table and run SSIS[ fuzzy grouping transformation] on the table.[Use this method only if you have large number of rows, example up to max of 800 000 rows, as Fuzzy performs badly once number of rows increase above 800 000 rows]

If you have minimum number of rows.You can create a SSIS package and directly drag the table to a Fuzzy Grouping Transformation.This process would split the incoming rows into duplicate and unique table.This would not remove both duplicate rows[but rather keep 1].

http://www.sql-server-performance.com/article_print.aspx?id=1002&type=art
Go to Top of Page
   

- Advertisement -