Please start any new threads on our new
site at https://forums.sqlteam.com. We've got lots of great SQL Server
experts to answer whatever question you can come up with.
| Author |
Topic |
|
hirani_prashant
Yak Posting Veteran
93 Posts |
Posted - 2008-03-24 : 01:11:46
|
| Hello All,I do have redundancy in data for a particaular table. Can any one please guide me how to remove duplicate data.ThanksPrashant |
|
|
harsh_athalye
Master Smack Fu Yak Hacker
5581 Posts |
|
|
jackv
Master Smack Fu Yak Hacker
2179 Posts |
|
|
LOOKUP_BI
Constraint Violating Yak Guru
295 Posts |
Posted - 2008-03-24 : 14:03:15
|
| You can write a Query to find exact matching rows based on the columns you would like to compare the rows againts.Place these rows[duplicates] into a separate table and run SSIS[ fuzzy grouping transformation] on the table.[Use this method only if you have large number of rows, example up to max of 800 000 rows, as Fuzzy performs badly once number of rows increase above 800 000 rows]If you have minimum number of rows.You can create a SSIS package and directly drag the table to a Fuzzy Grouping Transformation.This process would split the incoming rows into duplicate and unique table.This would not remove both duplicate rows[but rather keep 1].http://www.sql-server-performance.com/article_print.aspx?id=1002&type=art |
 |
|
|
|
|
|