Please start any new threads on our new site at https://forums.sqlteam.com. We've got lots of great SQL Server experts to answer whatever question you can come up with.

 All Forums
 SQL Server 2005 Forums
 Other SQL Server Topics (2005)
 Identifying ALL possible renditions of a string

Author  Topic 

LukeD
Starting Member

5 Posts

Posted - 2011-02-10 : 04:16:06
Hi,

I have a table containing a colum 'OrganisationName' which comprises of a list of OrganisationNames that are believed to exist within my database, ofcourse not all will be exact matches, I wish to return all possible simularities of each OrganisationName within that table that exist within my database. This is ofcourse part of a data cleasing process but I the manual labour required to determine ALL existent data is unpractical and at the mercy of human error.

Is there a query? or systematic method of exposing such data?

Thanks

robvolk
Most Valuable Yak

15732 Posts

Posted - 2011-02-10 : 06:49:38
Look up the SOUNDEX() and DIFFERENCE() functions in SQL Server Books Online. They perform rudimentary "sounds alike" matches that may help you. Also look at the Fuzzy Lookup operator in Integration Services.
Go to Top of Page

Lumbago
Norsk Yak Master

3271 Posts

Posted - 2011-02-10 : 07:30:14
You can also check out full-text indexes and if it's an option to upgrade to SQL 2008 R2 the new Master Data Services has some *pretty cool* string comparison functions! Check out the mdq.Similarity function:

http://thefirstsql.com/2011/02/07/regular-expressions-advanced-string-matching-and-new-split-function-sql-server-2008-r2/



- Lumbago
My blog-> http://thefirstsql.com/2011/02/07/regular-expressions-advanced-string-matching-and-new-split-function-sql-server-2008-r2/
Go to Top of Page
   

- Advertisement -