My colleague posted a question about fuzzy logic (quoted message below). I am not familiar with the subject, so I would like to refer to the experts on Tek-Tips.
“I am looking for a method of determining fuzzy matches, and hoping you have a suggestion for me. I am not referring to fuzzy matches in an internet search application. This is for a database application. We have a requirement to evaluate pairs of columns and determine if they are "close" to a match. I am thinking of an algorithm that will return a score on how similar the columns are. We would set a threshold on the score and determine if it was close enough to call it a match.
This is going to run against a table with perhaps 30,000 rows. The columns are VARCHAR, but they contain all numeric digits so a numeric algorithm would work.
Do you have any resources, experience, or leads on this?”
I would greatly appreciate your help
Numina
“I am looking for a method of determining fuzzy matches, and hoping you have a suggestion for me. I am not referring to fuzzy matches in an internet search application. This is for a database application. We have a requirement to evaluate pairs of columns and determine if they are "close" to a match. I am thinking of an algorithm that will return a score on how similar the columns are. We would set a threshold on the score and determine if it was close enough to call it a match.
This is going to run against a table with perhaps 30,000 rows. The columns are VARCHAR, but they contain all numeric digits so a numeric algorithm would work.
Do you have any resources, experience, or leads on this?”
I would greatly appreciate your help
Numina