Gustav Brock
gustav at cactus.dk
Tue Jun 8 14:03:20 CDT 2004
Hi Marty et all There is some discussion here on a national HIV register. Info kept in this will, of course, be quite sensible, thus it is suggested that entries are kept under anonymous Soundex code. But I don't quite get what the purpose should be? Why not just apply a random key for each entry? Why would you wish some simple level of grouping which is all the Soundex code can offer? And indeed if only the surname is used for the code - that will create bizarre results as your example. Why are first and middle names excluded? /gustav > Here is an article I came across on name transliteration. > http://www.itworld.ca/Pages/Docbase/ViewArticle.aspx?ID=idgml-a8d2a7b7-7a56-43c8-87c5-2742527f8f14&Portal=E-Government > I like the bit > For example, internationally sought-after terrorist mastermind Osama Bin > Laden has the same Soundex code (L350) as Johnny "Rotten" Lydon, former > lead singer of British punk rock group The Sex Pistols > There is a company named Language Analysis Systems at > http://www.las-inc.com/ > Their price may be out of your league since they do FBI and homeland > security work but they have some white papers that may be of use.