[AccessD] Needs analysis

Mon Jun 21 21:03:17 CDT 2010

Drew,

 > First, you said you split your RAID volume into 2 partitions due to a Windows limitation... what 
limitation is that?  FAT 32 is limited to 2 terrabytes, but NTFS can have WAY more than that.  We 
have several servers here running Windows 2003, (standard and enterprise mixed) which have 5+ terrbytes.

When I was studying the issue, my take was that drives larger than 2 terabytes used something funky 
that caused extreme performance issues.  Yea, it can be done, and yea, you don't want to do that. 
So I didn't do that.

 > I am specifically telling you to stay away from RAID 6 (and RAID 5). Either go with RAID 0 (with 
no redundancy) or a RAID 10.

When I was studying the issue, RAID 6 allowed multiple drive failures.  Raid 6 is also one of the 
fastest at streaming READ, simply because it is pulling data off of so many drives simultaneously. 
Making the assumption (yea, I know) that once it found data, the data would be contiguous, high 
streaming read is good.  Raid 6 and Raid 5 are approximately the same speed reading.

I had a very limited budget.  Back when I was doing this, each terabyte drive (consumer grade) was 
around $140 or $150.  I managed to scrape together the money for 10 of them but basically what Raid 
6 allowed was to minimize the "overhead" of the raid parity information.  I did what I did strictly 
because of budget.  Both raid 6 and raid 5 are significantly faster than raid 1 and can approach 
raid 0 FOR READS.  Not for writes of course because of the parity calculations.

Basically Raid1 is no faster than the individual drive.  It is essentially just whatever your 
individual drive specs are.  Raid 0, 5 or 6 read stripes of data off of each drive in the set.  My 
understanding wast that in the case of Raid5 and Raid 6 you even got the advantage of the data 
coming off of the parity drives.  IOW, raid 0 2 drive array reads data from two drives 
simultaneously, therefore 2X the individual drive specs in STREAMING READ.  As I understood it, Raid 
5 actually read data off of all three drives so you got 3X the individual drive speed.  Etc.

Writes are another matter of course, but remember that I do not write - AT ALL!!!.  OK, I write to 
log files, but not back to the data disks.

 > Where I am confused about SQL Server, is that while I know it uses whatever memory it can take, I 
didn't think it runs strictly off of memory, whether you are simply reading or not.  IE, if you 
access xx number of records, sure, it would cache what it pulls, but I don't think it pulls the 
entire DB into memory from the get go.

I think you are correct.  However I also think (and it is a big black box really, so there is a lof 
of "if I were doing this") that if you are reading data about two tables, joined on a single field, 
with where clauses on 4 or 5 other fields from one or both of those two tables...

SQL Server is going to be pulling as much as it can to work with the joins, to filter the records etc.

So if you have 13 gigs of RAM available and it has 40 gigs of index data that needs processing...

vs... you have 60 gigs of RAM available and 40 gigs of index data needing processing.

I have to believe that all other things being equal, 60 gigs available memory will trump 13 gigs 
available.  Maybe not, but I think so.  No matter how you slice it, having 4 times as much ram for 
SQL Server to cache and process the stuff it is working with is going to go a long way.

 > I would assume (and you know what happens when we assume things! LOL), that if you are truly 
treating a db as read only, then the disks are only going to be accessed for new information.

Well, what does that mean?  Let's use an analogy.

Assume I have two hundred feet of board.  On this board is painted a random number (limited to 
10000) every inch.  I have another 50 foot section of board.  Again, painted with a random number 
every inch (limited to 10000).

You are given a window within which you can lay the boards and slide them back and forth trying to 
match the numbers on one board with the numbers on the other board.  The window is 10 feet wide. 
How much sliding of the boards will have to occur?

Now get a window 40 feet wide.  How much sliding has to occur?

There is no such thing as "only accessing new information".  The server does not have enough memory 
to process both entire indexes so it pulls a tiny piece of one index and a tiny piece of another, 
then another tiny piece etc.  It is sliding the pieces in and out of memory trying to find the matches.

 > So here's a question for you, have you ever thought about splitting the load between different 
machines on a gigabit network?

Yes, I have but not seriously.  And now, when I can get a motherboard and 16 cores for $1000 (plus 
memory) it makes even less sense.  If I had fiber between the machines maybe but gigabit is really 
pretty slow, especially when a good chunk is protocol.

 > I have noted many of the performance issues you've mentioned in the past, and many of them are 
functionally identical to when a RAID goes beserk, or can't handle the load.

LOL.  Yea, I/O limitations in the disk subsystem.  It is a known issue, and I have it, and I know I 
have it.  However just attacking the disk system will only get me so far.  Let's assume I can triple 
the disk speed.  If my queries are taking a half hour I am down to 10 minutes.  That's good but not 
good enough.

John W. Colby
www.ColbyConsulting.com

Drew Wutka wrote:
> Hmmm, couple of questions.  But before I begin, please understand that I
> work with SQL Server, but nowhere near as in depth as you apparently
> are.  So PLEASE correct me if I am wrong, and I would appreciate any
> links/info you have to share on it.  I do understand quite a bit about
> hardware though, so that's why I am still tossing my two cents in here.
> 
> First, you said you split your RAID volume into 2 partitions due to a
> Windows limitation... what limitation is that?  FAT 32 is limited to 2
> terrabytes, but NTFS can have WAY more than that.  We have several
> servers here running Windows 2003, (standard and enterprise mixed) which
> have 5+ terrbytes.
> 
> Next, I am not saying 'get faster disks'.  LOL, I really am trying to
> just be helpful here.  In fact, right now, I'm in process of getting two
> new servers for our systems, so I know how expensive things can be! LOL.
> I am specifically telling you to stay away from RAID 6 (and RAID 5).
> Either go with RAID 0 (with no redundancy) or a RAID 10.  I certainly
> understand the cost involved, and I'm not trying to say 'tough, do this
> or die!' LOL.  I am simply pointing out that while you are looking at
> memory and processor specs, factor in a drive set that will compliment
> them.
> 
> Where I am confused about SQL Server, is that while I know it uses
> whatever memory it can take, I didn't think it runs strictly off of
> memory, whether you are simply reading or not.  IE, if you access xx
> number of records, sure, it would cache what it pulls, but I don't think
> it pulls the entire DB into memory from the get go.  That sounds more
> like the 'new' dbs that have been talked about, which do run completely
> in memory.  
> 
> I would assume (and you know what happens when we assume things! LOL),
> that if you are truly treating a db as read only, then the disks are
> only going to be accessed for new information.  But ANY write backs,
> would cause multiple memory <--> disk operations.  Memory is always
> going to be faster than disk, so the quicker the disk operations are,
> the less bottlenecking you'll face.
> 
> So here's a question for you, have you ever thought about splitting the
> load between different machines on a gigabit network?
> 
> Again, sorry if I seem clunky in my thoughts here, I've perused your
> threads about this stuff, but have too much going on, on my own, to keep
> too much detail of it in my head.
> 
> I have noted many of the performance issues you've mentioned in the
> past, and many of them are functionally identical to when a RAID goes
> beserk, or can't handle the load.
> 
> Drew