[dba-VB] "Smart" fulltext search on large documents(data)base...

Shamil Salakhetdinov shamil at smsconsulting.spb.ru
Fri Feb 4 04:34:43 CST 2011


Hi Mark --

I didn't know about that DNN's "heavy lifting" - what it's? I'm joining
Gustav's question on that subject...

Yes, I did plan to use DNN search for local manual search on the custom site
but I do plan to keep that site as small as possible - hence I'm tryng to
find how to "outsource" docs' keeping and searching tasks to Google Docs and
Google API...

BTW, for C# code solutions for full text searaching and many other tasks for
local large docs base there exists a powerful free code solution I have
occasionally get at yesterday:

http://www.searcharoo.net/

Although I didn't use it, no I plan to use it in the near future...

Thank you.

--
Shamil
 
-----Original Message-----
From: dba-vb-bounces at databaseadvisors.com
[mailto:dba-vb-bounces at databaseadvisors.com] On Behalf Of Mark Breen
Sent: 4 ??????? 2011 ?. 12:10
To: Discussion concerning Visual Basic and related programming issues.
Subject: Re: [dba-VB] "Smart" fulltext search on large
documents(data)base...

Hello Shamil

Just FYI, I happened to look yesterday at the DNN Keyword search tables,

you may find them interesting also.  I was surprised to see such "heavy
lifting" way of providing search.

Mark


On 4 February 2011 02:38, Shamil Salakhetdinov
<shamil at smsconsulting.spb.ru>wrote:

> Hi Doug,
>
> Yes, search google docs docs can be automated  - I have found and I 
> used http://code.google.com/p/google-gdata/
> That wasn't very quick to make it working but I have got it working 
> after all.
>
> My sample quick & dirty code, which worked for me was as the following:
>
>    string userName = "myTest at gmail.com";
>    string password = "mypassword";
>
>    List<Document> all = new List<Document>();
>    GDataCredentials credentials = new GDataCredentials(userName,
password);
>    DocumentsService service = new
> DocumentsService("GoogleDocumentsSample");
>
>    System.Console.WriteLine("Logging in...");
>
>    RequestSettings settings = new 
> RequestSettings("GoogleDocumentsSample",
> credentials);
>    settings.AutoPaging = true;
>    settings.PageSize = 100;
>    if (settings != null)
>    {
>        DocumentsRequest request = new DocumentsRequest(settings);
>        System.Console.WriteLine("Successfully logged in");
>
>        System.Console.WriteLine("Gettings docs...");
>
>        FeedQuery query = new FeedQuery();
>        query.Uri = new Uri( request.BaseUri);
>        query.Query = "quick brown fox";
>
>        Feed<Document> feed = request.Get<Document>(query);
>
>
>        // this takes care of paging the results in
>        System.Console.WriteLine("Collecting docs info...");
>
>        int index = 1;
>        foreach (Document entry in feed.Entries)
>        {
>            System.Console.WriteLine("{0}. {1}", index, entry.Title);
>            all.Add(entry);
>            index++;
>        }
>
>        System.Console.WriteLine("\n *** Docs collected - processing 
> them***\n");
>
>        index = 1;
>        foreach (Document doc in all)
>        {
>        // just listing collected docs...
>            System.Console.WriteLine("{0}. {1}", index, doc.Title);
>            index++;
>        }
>    }
>    else
>    {
>        System.Console.WriteLine("Login failed.");
>    }
> }
>
>
> Uploading docs to GoogleDocs can be also automated usinhg the same C# lib.
>
> It's funny one can also use GoogleDocs engine as a document formats 
> convertor e.g. txt -> pdf, or txt -> doc etc. - just upload one 
> format, and download another one...
>
>
> Thank you.
>
> --
> Shamil
>
> -----Original Message-----
> From: dba-vb-bounces at databaseadvisors.com
> [mailto:dba-vb-bounces at databaseadvisors.com] On Behalf Of Doug Murphy
> Sent: 22 ?????? 2011 ?. 20:56
> To: 'Discussion concerning Visual Basic and related programming issues.'
> Subject: Re: [dba-VB] "Smart" fulltext search on large 
> documents(data)base...
>
> Shamil,
>
> Very innovative approach. Good use of the low cost and high power 
> offered by the "Cloud" services. I'll be interested in how this comes 
> out.
>
> Doug
>
> -----Original Message-----
> From: dba-vb-bounces at databaseadvisors.com
> [mailto:dba-vb-bounces at databaseadvisors.com] On Behalf Of Shamil 
> Salakhetdinov
> Sent: Friday, January 21, 2011 8:34 AM
> To: 'Discussion concerning Visual Basic and related programming issues.'
> Subject: [dba-VB] "Smart" fulltext search on large documents (data)base...
>
> Hi All --
>
> I have a task to implement a system providing "smart" fulltext search 
> over a large base of text documents.
> My current plan is to use Google Docs.
>
> I plan to get in the future 80 GB ($20.00 USD per year) hosted space 
> on GoogleDocs, put all the subject docs there, and then use Google API 
> to search via my documents base.
>
> That seems to be it?
>
> It should be even possible to create a simple (free?) Google Web Site 
> as front-end to that GoogleDocs documents base?
>
> That GoogleDocs base/site is planned to be used by non-profit
organization.
>
> Am I missing something?
> Additional overhead costs to keep that solution's stuff on Google site?
>
> And why I'm writing about that solution here in dba-VBA? - because I 
> plan to implement a front-end to that application system as an 
> WinForms application coomunicating with Windows API...
>
> Thank you.
>
> --
> Shamil
>
>
> _______________________________________________
> dba-VB mailing list
> dba-VB at databaseadvisors.com
> http://databaseadvisors.com/mailman/listinfo/dba-vb
> http://www.databaseadvisors.com
>
>
_______________________________________________
dba-VB mailing list
dba-VB at databaseadvisors.com
http://databaseadvisors.com/mailman/listinfo/dba-vb
http://www.databaseadvisors.com




More information about the dba-VB mailing list