[dba-SQLServer] Find the second occurrence of a character

MartyConnelly martyconnelly at shaw.ca
Fri Dec 29 13:34:40 CST 2006


Just came across this in an IBM software review released in July 2006
used by security global watch lists.

Global Name Recognition

http://www-306.ibm.com/software/data/globalname/

IBM Global Name Recognition Technologies
IBM's patented name recognition technology is exactly
type of problem. Some of the highlights include the ability
. Identify name by culture and relative frequency
. Search for multi-cultural names in a database
. Parse a name into Surname and Given Name
. Generate frequency statistics for name tokens
. Generate all variants of a name
. Generate additional attributes such as gender
. Quickly train field personnel in advanced multicultural
searching techniques
. Utilize rich name data gained from the comprehensive
over 1 billion names from around the world

Platforms Supported
IBM Global Name Recognition technologies are available across
Win32, Unix, and Linux platforms. Interfaces are available in C++,
JNI, SOAP, and XML-over-IP for most products. (Please check specific
availability)


Ahhh here is the cough
IBM Global Name Analytics Server License + SW Maintenance 12 Months 
(D58MBLL)  $645,885.00 CAD

Robert L. Stewart wrote:

>You are just going to have to build a reference table as you go and 
>validate that the name does or does not exist in the surname 
>reference table.  If it does, parse it out to the surname 
>column.  Then parse the balance of the name.  You will still have to 
>deal with oddities like my oldest son.  Robert Christopher Lawrence 
>Stewart.  But, at least only the single surname.
>
>There are probably genealogy databases that have been built that can 
>get you most of the way on the surname database.  Most likely the 
>Mormons have the largest.
>
>John, I know it was not very helpful, hence the smile.  But, I doubt 
>very much that the source database that supplies your data is a 
>single field.  So, getting the data in the proper format is not 
>impossible.  Difficult, maybe.
>
>
>Robert
>
>At 12:00 PM 12/22/2006, you wrote:
>  
>
>>Date: Fri, 22 Dec 2006 09:32:17 -0500
>>From: "JWColby" <jwcolby at colbyconsulting.com>
>>Subject: Re: [dba-SQLServer] Find the second occurrence of a character
>>        in      astring
>>To: <dba-sqlserver at databaseadvisors.com>
>>Message-ID: <003b01c725d5$fb9a3e80$1501a8c0 at m6805>
>>Content-Type: text/plain;       charset="us-ascii"
>>
>>I think the "best way" to handle this if you are going to truly try to
>>handle this problem is to:
>>
>>Develop a list of those "prefixes" to last names - Van, La, De etc.
>>Take the first word as the first name
>>Get a count of remaining words.
>>If count > 0 then
>>        ProcessRest
>>Else
>>        Rest is last name
>>Endif
>>
>>ProcessRest
>>        Look up the second word in the prefix list.
>>        If InList then
>>                Treat everything left as the last name
>>        else
>>                Treat next word as middle name
>>                remove middle name from string
>>                Process rest as last name
>>        endif
>>End ProcessRest
>>
>>Let's just say this is not s simple sql statement
>>
>>John W. Colby
>>Colby Consulting
>>www.ColbyConsulting.com
>>
>>-----Original Message-----
>>From: dba-sqlserver-bounces at databaseadvisors.com
>>[mailto:dba-sqlserver-bounces at databaseadvisors.com] On Behalf Of
>>artful at rogers.com
>>Sent: Thursday, December 21, 2006 3:00 PM
>>To: dba-sqlserver at databaseadvisors.com
>>Subject: Re: [dba-SQLServer] Find the second occurrence of a character in
>>astring
>>
>>I appreciate your point, but I'm still not certain of the best way to go
>>with my question, which concerns the way to handle some unusual surnames.
>>
>>van den Berq
>>la Flame
>>de la Vega
>>Ben Gurion
>>
>>and any number of names that begin with "al". Or "da" as in Leonardo. My
>>very limited Italian suggests that Leonardo was born in a town called Vinci.
>>
>>So how does one sort such a list? On the capitalized word? On the first
>>letter of the two or three words considered the surname?
>>
>>Advice from Europeans, Asians, Africans, or even North Americans familiar
>>with this problem, would be appreciated. I have no immediate problem that
>>requires this solution. This is purely theoretical at the moment, but who
>>knows, someday I may need the answer.
>>
>>TIA,
>>Arthur
>>
>>----- Original Message ----
>>From: Robert L. Stewart <rl_stewart at highstream.net>
>>To: dba-sqlserver at databaseadvisors.com
>>Sent: Thursday, December 21, 2006 1:41:04 PM
>>Subject: Re: [dba-SQLServer] Find the second occurrence of a character in a
>>string
>>
>>You put it in the right columns to begin with and don't try to parse it out
>>of a single one.  :-)
>>
>>    
>>
>
>  
>

-- 
Marty Connelly
Victoria, B.C.
Canada




More information about the dba-SQLServer mailing list