[dba-VB] "Smart" fulltext search on large documents(data)base...

Shamil Salakhetdinov shamil at smsconsulting.spb.ru
Fri Feb 4 04:34:43 CST 2011


Hi Gustav,

Search itself is as fast as Google is :)
Then it takes some time to get info on docs.
If your search filter results in a few records returned then the search is
almost instant (I have 1000+ test docs in my GoogleDocs test site):
I have slightly corrected code to return timestamps - here are the stats:

2 docs returned, batch size = 10
========================
04/02/2011 13:18:23: Logging in...
04/02/2011 13:18:23: Successfully logged in
04/02/2011 13:18:23: Gettings docs...
04/02/2011 13:18:23: Batch size = 10, getting first docs' batch...
04/02/2011 13:18:24: All 2 docs collected


another filtering criteria 882 docs returned, batch size = 100
============================================
04/02/2011 13:08:41: Logging in...
04/02/2011 13:08:41: Successfully logged in
04/02/2011 13:08:41: Gettings docs...
04/02/2011 13:08:41: Batch size = 100, getting first docs' batch...
04/02/2011 13:08:44: 100 docs processed
04/02/2011 13:08:46: 200 docs processed
04/02/2011 13:08:47: 300 docs processed
04/02/2011 13:08:49: 400 docs processed
04/02/2011 13:08:50: 500 docs processed
04/02/2011 13:08:51: 600 docs processed
04/02/2011 13:08:53: 700 docs processed
04/02/2011 13:08:54: 800 docs processed
04/02/2011 13:08:55: All 882 docs collected

same as above filtering criteria 882 docs returned, batch size = 500
=================================================
04/02/2011 13:10:05: Logging in...
04/02/2011 13:10:05: Successfully logged in
04/02/2011 13:10:05: Gettings docs...
04/02/2011 13:10:05: Batch size = 500, getting first docs' batch...
04/02/2011 13:10:14: 500 docs processed
04/02/2011 13:10:19: All 882 docs collected

BTW, I have used just a subset of Google.GData classlibs.

Classlibs used: 

Google.GData.AccessControl.dll
Google.GData.Client.dll
Google.GData.Extensions.dll
Google.GData.Documents.dll

The full list of samples Google.GData provides is:

analytics
Analytics_AccountFeed_Sample
Analytics_DataFeed_Sample
appsforyourdomain
blogger
calendar
codesearch
contentforshopping_sample
+DocListExporter
+DocListUploader
execrequest
gapps_calendar_resource_sample
gapps_google_mail_settings_sample
gapps_multidomain_sample
gapps_orgmanagement_sample
gbase
health
OAuth
PhotoBrowser
spreadsheets
YouTubeNotifier
YouTubeSample
YouTubeSample.sln
YouTubeUploader

I used just two of provided samples for R&D and making my solution.
As one can see based on samples' titles almost everything can be
queryed/processed on google similar way....

Thank you.

--
Shamil
 
-----Original Message-----
From: dba-vb-bounces at databaseadvisors.com
[mailto:dba-vb-bounces at databaseadvisors.com] On Behalf Of Gustav Brock
Sent: 4 ??????? 2011 ?. 12:33
To: dba-vb at databaseadvisors.com
Subject: Re: [dba-VB] "Smart" fulltext search on large
documents(data)base...

Hi Shamil

How fast is this? Sounds very clever.

/gustav


>>> shamil at smsconsulting.spb.ru 04-02-2011 03:38 >>>
Hi Doug,

Yes, search google docs docs can be automated  - I have found and I used
http://code.google.com/p/google-gdata/
That wasn't very quick to make it working but I have got it working after
all.

My sample quick & dirty code, which worked for me was as the following:

    string userName = "myTest at gmail.com";
    string password = "mypassword";

    List<Document> all = new List<Document>();
    GDataCredentials credentials = new GDataCredentials(userName, password);

<<<skipped>>>




More information about the dba-VB mailing list