[dba-SQLServer] Find the second occurrence of a character

MartyConnelly martyconnelly at shaw.ca
Fri Dec 29 13:34:40 CST 2006

Just came across this in an IBM software review released in July 2006
used by security global watch lists.

Global Name Recognition


IBM Global Name Recognition Technologies
IBM's patented name recognition technology is exactly
type of problem. Some of the highlights include the ability
. Identify name by culture and relative frequency
. Search for multi-cultural names in a database
. Parse a name into Surname and Given Name
. Generate frequency statistics for name tokens
. Generate all variants of a name
. Generate additional attributes such as gender
. Quickly train field personnel in advanced multicultural
searching techniques
. Utilize rich name data gained from the comprehensive
over 1 billion names from around the world

Platforms Supported
IBM Global Name Recognition technologies are available across
Win32, Unix, and Linux platforms. Interfaces are available in C++,
JNI, SOAP, and XML-over-IP for most products. (Please check specific

Ahhh here is the cough
IBM Global Name Analytics Server License + SW Maintenance 12 Months 
(D58MBLL)  $645,885.00 CAD

Robert L. Stewart wrote:

>You are just going to have to build a reference table as you go and 
>validate that the name does or does not exist in the surname 
>reference table.  If it does, parse it out to the surname 
>column.  Then parse the balance of the name.  You will still have to 
>deal with oddities like my oldest son.  Robert Christopher Lawrence 
>Stewart.  But, at least only the single surname.
>There are probably genealogy databases that have been built that can 
>get you most of the way on the surname database.  Most likely the 
>Mormons have the largest.
>John, I know it was not very helpful, hence the smile.  But, I doubt 
>very much that the source database that supplies your data is a 
>single field.  So, getting the data in the proper format is not 
>impossible.  Difficult, maybe.
>At 12:00 PM 12/22/2006, you wrote:
>>Date: Fri, 22 Dec 2006 09:32:17 -0500
>>From: "JWColby" <jwcolby at colbyconsulting.com>
>>Subject: Re: [dba-SQLServer] Find the second occurrence of a character
>>        in      astring
>>To: <dba-sqlserver at databaseadvisors.com>
>>Message-ID: <003b01c725d5$fb9a3e80$1501a8c0 at m6805>
>>Content-Type: text/plain;       charset="us-ascii"
>>I think the "best way" to handle this if you are going to truly try to
>>handle this problem is to:
>>Develop a list of those "prefixes" to last names - Van, La, De etc.
>>Take the first word as the first name
>>Get a count of remaining words.
>>If count > 0 then
>>        ProcessRest
>>        Rest is last name
>>        Look up the second word in the prefix list.
>>        If InList then
>>                Treat everything left as the last name
>>        else
>>                Treat next word as middle name
>>                remove middle name from string
>>                Process rest as last name
>>        endif
>>End ProcessRest
>>Let's just say this is not s simple sql statement
>>John W. Colby
>>Colby Consulting
>>-----Original Message-----
>>From: dba-sqlserver-bounces at databaseadvisors.com
>>[mailto:dba-sqlserver-bounces at databaseadvisors.com] On Behalf Of
>>artful at rogers.com
>>Sent: Thursday, December 21, 2006 3:00 PM
>>To: dba-sqlserver at databaseadvisors.com
>>Subject: Re: [dba-SQLServer] Find the second occurrence of a character in
>>I appreciate your point, but I'm still not certain of the best way to go
>>with my question, which concerns the way to handle some unusual surnames.
>>van den Berq
>>la Flame
>>de la Vega
>>Ben Gurion
>>and any number of names that begin with "al". Or "da" as in Leonardo. My
>>very limited Italian suggests that Leonardo was born in a town called Vinci.
>>So how does one sort such a list? On the capitalized word? On the first
>>letter of the two or three words considered the surname?
>>Advice from Europeans, Asians, Africans, or even North Americans familiar
>>with this problem, would be appreciated. I have no immediate problem that
>>requires this solution. This is purely theoretical at the moment, but who
>>knows, someday I may need the answer.
>>----- Original Message ----
>>From: Robert L. Stewart <rl_stewart at highstream.net>
>>To: dba-sqlserver at databaseadvisors.com
>>Sent: Thursday, December 21, 2006 1:41:04 PM
>>Subject: Re: [dba-SQLServer] Find the second occurrence of a character in a
>>You put it in the right columns to begin with and don't try to parse it out
>>of a single one.  :-)

Marty Connelly
Victoria, B.C.

More information about the dba-SQLServer mailing list