[AccessD] Needs analysis

Tue Jun 22 03:24:22 CDT 2010

Thanks for the detail.  Clears some things up for me. You've definitely
have your work cut out for you.

If you happen to run across that 2 Terrabyte issue, send it my way, this
is the first I've heard about it on a Windows server.

As far as the RAID stuff goes, you do have some misinformation there.
First, the lure of RAID 6, with having 2 drive failure redundancy is a
common misnomer as to it's usefulness. RAID 5 is a VERY common setup,
but it's a setup SPECIFICALLY for maximizing disk space.  For example,
we have that for our main file server.  A 5 terrabyte volume, of which
we have around 2 Terrabytes actively used.  It's a file server, so the
performance hit of RAID 5 is minimized by the network speed.

If you have a 3 drive RAID 5, you don't necessarily pull data from all
three drives at once.  First, it depends if your RAID is based on chunk
size or bits.  If it's bits, then yes, you actually read an entire byte
from 2 drives three times, and one drive two times.  If it's based on
chunk size, then it's possible that for a small read, you only hit one
drive.  RAID 6 was touted as being an improvement, but from all the pros
I have talked to, RAID 6 is the Pinto of RAID systems.  On paper it
sounds great, in practice, it's horrible.  Keep in mind, for the past
few years, this is what I have been involved with. (Not necessarily raid
drives, but servers and network administration)

A RAID 1 actually can perform better then a single drive, but only for a
read.  With a RAID 1, the raid controller can read data from both drives
simultaneously because it's the same data.

For your final reasoning, about the timing, I can't really sit here and
say, JWC, you're problems would be over with a RAID 10 replacing your
RAID 6.  What I CAN say, is that it is QUITE possible that you will see
a much bigger performance boost then just a 3x.  

Think of it this way, if  you were doing a straight read, like pulling
40 gigs from a drive to memory, first, in a proper RAID 10, with 3 to 4
striped mirrors, that 40 gigs would be read very quickly.  Let's, for
simple math, say that it takes 20 seconds in your RAID 6.  And going to
a RAID 10 gives you a 2x boost.  Great, now you save 10 seconds.  But
that's in a straight data read, so defragmentation aside,  the actual
performance of how the data is being pulled from the drives in a
straight data pull would be a small factor of improvement.  Where the
REAL boost that you should really be concerned about is that SQL Server
isn't reading multiple files.  It is reading parts of a file, a part
here, a part there, an index is pointing to another part.  All of those
are small individual seeks (no matter how much data gets pulled in the
end).  That is where a RAID 6 slows things down in a MUCH higher factor.

Ok, it's 3 in the morning, was up finishing up a system (with a RAID 0
+1...LOL), and decided to holler back.

Good luck with your project, let me know when you plan to scrap one of
these systems, I may buy it from ya! ;)

Drew

-----Original Message-----
From: accessd-bounces at databaseadvisors.com
[mailto:accessd-bounces at databaseadvisors.com] On Behalf Of jwcolby
Sent: Monday, June 21, 2010 9:03 PM
To: Access Developers discussion and problem solving
Subject: Re: [AccessD] Needs analysis

Drew,

 > First, you said you split your RAID volume into 2 partitions due to a
Windows limitation... what 
limitation is that?  FAT 32 is limited to 2 terrabytes, but NTFS can
have WAY more than that.  We 
have several servers here running Windows 2003, (standard and enterprise
mixed) which have 5+ terrbytes.

When I was studying the issue, my take was that drives larger than 2
terabytes used something funky 
that caused extreme performance issues.  Yea, it can be done, and yea,
you don't want to do that. 
So I didn't do that.

 > I am specifically telling you to stay away from RAID 6 (and RAID 5).
Either go with RAID 0 (with 
no redundancy) or a RAID 10.

When I was studying the issue, RAID 6 allowed multiple drive failures.
Raid 6 is also one of the 
fastest at streaming READ, simply because it is pulling data off of so
many drives simultaneously. 
Making the assumption (yea, I know) that once it found data, the data
would be contiguous, high 
streaming read is good.  Raid 6 and Raid 5 are approximately the same
speed reading.

I had a very limited budget.  Back when I was doing this, each terabyte
drive (consumer grade) was 
around $140 or $150.  I managed to scrape together the money for 10 of
them but basically what Raid 
6 allowed was to minimize the "overhead" of the raid parity information.
I did what I did strictly 
because of budget.  Both raid 6 and raid 5 are significantly faster than
raid 1 and can approach 
raid 0 FOR READS.  Not for writes of course because of the parity
calculations.

Basically Raid1 is no faster than the individual drive.  It is
essentially just whatever your 
individual drive specs are.  Raid 0, 5 or 6 read stripes of data off of
each drive in the set.  My 
understanding wast that in the case of Raid5 and Raid 6 you even got the
advantage of the data 
coming off of the parity drives.  IOW, raid 0 2 drive array reads data
from two drives 
simultaneously, therefore 2X the individual drive specs in STREAMING
READ.  As I understood it, Raid 
5 actually read data off of all three drives so you got 3X the
individual drive speed.  Etc.

Writes are another matter of course, but remember that I do not write -
AT ALL!!!.  OK, I write to 
log files, but not back to the data disks.

 > Where I am confused about SQL Server, is that while I know it uses
whatever memory it can take, I 
didn't think it runs strictly off of memory, whether you are simply
reading or not.  IE, if you 
access xx number of records, sure, it would cache what it pulls, but I
don't think it pulls the 
entire DB into memory from the get go.

I think you are correct.  However I also think (and it is a big black
box really, so there is a lof 
of "if I were doing this") that if you are reading data about two
tables, joined on a single field, 
with where clauses on 4 or 5 other fields from one or both of those two
tables...

SQL Server is going to be pulling as much as it can to work with the
joins, to filter the records etc.

So if you have 13 gigs of RAM available and it has 40 gigs of index data
that needs processing...

vs... you have 60 gigs of RAM available and 40 gigs of index data
needing processing.

I have to believe that all other things being equal, 60 gigs available
memory will trump 13 gigs 
available.  Maybe not, but I think so.  No matter how you slice it,
having 4 times as much ram for 
SQL Server to cache and process the stuff it is working with is going to
go a long way.

 > I would assume (and you know what happens when we assume things!
LOL), that if you are truly 
treating a db as read only, then the disks are only going to be accessed
for new information.

Well, what does that mean?  Let's use an analogy.

Assume I have two hundred feet of board.  On this board is painted a
random number (limited to 
10000) every inch.  I have another 50 foot section of board.  Again,
painted with a random number 
every inch (limited to 10000).

You are given a window within which you can lay the boards and slide
them back and forth trying to 
match the numbers on one board with the numbers on the other board.  The
window is 10 feet wide. 
How much sliding of the boards will have to occur?

Now get a window 40 feet wide.  How much sliding has to occur?

There is no such thing as "only accessing new information".  The server
does not have enough memory 
to process both entire indexes so it pulls a tiny piece of one index and
a tiny piece of another, 
then another tiny piece etc.  It is sliding the pieces in and out of
memory trying to find the matches.

 > So here's a question for you, have you ever thought about splitting
the load between different 
machines on a gigabit network?

Yes, I have but not seriously.  And now, when I can get a motherboard
and 16 cores for $1000 (plus 
memory) it makes even less sense.  If I had fiber between the machines
maybe but gigabit is really 
pretty slow, especially when a good chunk is protocol.

 > I have noted many of the performance issues you've mentioned in the
past, and many of them are 
functionally identical to when a RAID goes beserk, or can't handle the
load.

LOL.  Yea, I/O limitations in the disk subsystem.  It is a known issue,
and I have it, and I know I 
have it.  However just attacking the disk system will only get me so
far.  Let's assume I can triple 
the disk speed.  If my queries are taking a half hour I am down to 10
minutes.  That's good but not 
good enough.

John W. Colby
www.ColbyConsulting.com

Drew Wutka wrote:
> Hmmm, couple of questions.  But before I begin, please understand that
I
> work with SQL Server, but nowhere near as in depth as you apparently
> are.  So PLEASE correct me if I am wrong, and I would appreciate any
> links/info you have to share on it.  I do understand quite a bit about
> hardware though, so that's why I am still tossing my two cents in
here.
> 
> First, you said you split your RAID volume into 2 partitions due to a
> Windows limitation... what limitation is that?  FAT 32 is limited to 2
> terrabytes, but NTFS can have WAY more than that.  We have several
> servers here running Windows 2003, (standard and enterprise mixed)
which
> have 5+ terrbytes.
> 
> Next, I am not saying 'get faster disks'.  LOL, I really am trying to
> just be helpful here.  In fact, right now, I'm in process of getting
two
> new servers for our systems, so I know how expensive things can be!
LOL.
> I am specifically telling you to stay away from RAID 6 (and RAID 5).
> Either go with RAID 0 (with no redundancy) or a RAID 10.  I certainly
> understand the cost involved, and I'm not trying to say 'tough, do
this
> or die!' LOL.  I am simply pointing out that while you are looking at
> memory and processor specs, factor in a drive set that will compliment
> them.
> 
> Where I am confused about SQL Server, is that while I know it uses
> whatever memory it can take, I didn't think it runs strictly off of
> memory, whether you are simply reading or not.  IE, if you access xx
> number of records, sure, it would cache what it pulls, but I don't
think
> it pulls the entire DB into memory from the get go.  That sounds more
> like the 'new' dbs that have been talked about, which do run
completely
> in memory.  
> 
> I would assume (and you know what happens when we assume things! LOL),
> that if you are truly treating a db as read only, then the disks are
> only going to be accessed for new information.  But ANY write backs,
> would cause multiple memory <--> disk operations.  Memory is always
> going to be faster than disk, so the quicker the disk operations are,
> the less bottlenecking you'll face.
> 
> So here's a question for you, have you ever thought about splitting
the
> load between different machines on a gigabit network?
> 
> Again, sorry if I seem clunky in my thoughts here, I've perused your
> threads about this stuff, but have too much going on, on my own, to
keep
> too much detail of it in my head.
> 
> I have noted many of the performance issues you've mentioned in the
> past, and many of them are functionally identical to when a RAID goes
> beserk, or can't handle the load.
> 
> Drew

-- 
AccessD mailing list
AccessD at databaseadvisors.com
http://databaseadvisors.com/mailman/listinfo/accessd
Website: http://www.databaseadvisors.com
The information contained in this transmission is intended only for the person or entity 
to which it is addressed and may contain II-VI Proprietary and/or II-VI Business 
Sensitive material. If you are not the intended recipient, please contact the sender 
immediately and destroy the material in its entirety, whether electronic or hard copy. 
You are notified that any review, retransmission, copying, disclosure, dissemination, 
or other use of, or taking of any action in reliance upon this information by persons 
or entities other than the intended recipient is prohibited.