Please start any new threads on our new site at https://forums.sqlteam.com. We've got lots of great SQL Server experts to answer whatever question you can come up with.

 All Forums
 General SQL Server Forums
 New to SQL Server Programming
 How to identify similar records

Author  Topic 

sxbo672158
Starting Member

4 Posts

Posted - 2011-07-19 : 22:25:42
Hi,all
Is there any way that I can identify similar records in a huge table instead of manually check the records?

For example, if I have similar emails in the table like:

abcd123@gmail.com
abcd1234@gmail.com
abcd12345@gmail.com

And I want to find them out, and output into a new table.

Thanks

tkizer
Almighty SQL Goddess

38200 Posts

Posted - 2011-07-19 : 22:49:26
You'll have to define "similar" for us. What are the rules? How many characters have to match?

Tara Kizer
Microsoft MVP for Windows Server System - SQL Server
http://weblogs.sqlteam.com/tarad/

Subscribe to my blog
Go to Top of Page

visakh16
Very Important crosS Applying yaK Herder

52326 Posts

Posted - 2011-07-20 : 06:35:21
seems like what you're looking for is some kind of fuzzy grouping alogorithm

------------------------------------------------------------------------------------------------------
SQL Server MVP
http://visakhm.blogspot.com/

Go to Top of Page

sxbo672158
Starting Member

4 Posts

Posted - 2011-07-20 : 21:33:14
Probably 80% above.
Nothing specific about the rules, as long as the strings are in same orders, except from one or two letters are different.

quote:
Originally posted by tkizer

You'll have to define "similar" for us. What are the rules? How many characters have to match?

Tara Kizer
Microsoft MVP for Windows Server System - SQL Server
http://weblogs.sqlteam.com/tarad/

Subscribe to my blog

Go to Top of Page

visakh16
Very Important crosS Applying yaK Herder

52326 Posts

Posted - 2011-07-21 : 02:12:45
you need to clearly specify upto what level you want to consider for matching.

------------------------------------------------------------------------------------------------------
SQL Server MVP
http://visakhm.blogspot.com/

Go to Top of Page
   

- Advertisement -