[AccessD] Fuzzy Matching (Like Soundex) or other ideas?
Stuart McLachlan
stuart at lexacorp.com.pg
Sat Apr 22 00:25:00 CDT 2023
Oops, it's a long time since I did it.
If the strings are different lengths, it's the levenshtein distance :)
On 22 Apr 2023 at 15:21, Stuart McLachlan wrote:
> Hamming distance? (Although that is usually for equal length strings)
> I've got some code archived somewhere that I will try to track down.
>
>
> On 21 Apr 2023 at 19:26, Ryan W wrote:
>
> > Does anyone have any fuzzy matching routines that would have fuzzy
> > matching logic based on a matching percentage or something else?
> >
> > Example:
> >
> > Client sends us a hand written chain of custody, they list something
> > we look at as: "MW-14" so we enter it as MW-14
> >
> > On a set of forms they provide later for us to use reporting
> > purposes, they called it MW14.
> >
> > So now we have data that doesn't precisely match. The usual fix is
> > for us to ask the client which one is right, and fix the incorrect
> > one. Sometimes it's the hand written form (and then our database,
> > because we relied on the hand written form to start the work),
> > sometimes it's the data entry forms they provided (You figure they'd
> > get this right....).
> >
> > I'm trying to make it so when they send us the data to digest toward
> > the end of the job, I can pull that data in and it'll match MW-14
> > with MW14, or vice versa.
> >
> > While SoundEx works for MW-14 vs MW14, it also thinks.. .. as an
> > example that MW-104 matches as well. (or something similar as an
> > example).
> >
> > I'm not even sure if a percentage match would be enough since MW-14,
> > MW14 and MW-104 are all a really tight grouping of "like"
> > characters. MW-104 would be unrelated to MW-14 or MW14... so
> > erroneously matching it would cause more grief than us just hand
> > patching the IDs before we button up the job. -- AccessD mailing
> > list AccessD at databaseadvisors.com
> > https://databaseadvisors.com/mailman/listinfo/accessd Website:
> > http://www.databaseadvisors.com
> >
>
>
> --
> AccessD mailing list
> AccessD at databaseadvisors.com
> https://databaseadvisors.com/mailman/listinfo/accessd
> Website: http://www.databaseadvisors.com
>
More information about the AccessD
mailing list