[AccessD] Needs analysis

Drew Wutka DWUTKA at Marlow.com
Tue Jun 22 10:45:11 CDT 2010


Hmmm, our main file server and a backup server we have both have Windows
2003, with more then 2 terrabytes.  Both are 32 bit, and operate Windows
2003.

Drew

-----Original Message-----
From: accessd-bounces at databaseadvisors.com
[mailto:accessd-bounces at databaseadvisors.com] On Behalf Of Lembit Soobik
Sent: Tuesday, June 22, 2010 5:42 AM
To: Access Developers discussion and problem solving
Subject: Re: [AccessD] Needs analysis

> If you happen to run across that 2 Terrabyte issue, send it my way,
this
> is the first I've heard about it on a Windows server.

Drew, John,
I just touched this issue when I configured my new workstation for video

editing:
I use two 1.5 TB disks for a 3 TB RAID 0.
the problem is the Master Boot Record MBR can only handle 2 TB because
it 
uses the LBA addressing (4 Byte)
to address sectors. with 512 Byte sectors as in XP you get 2 TB.
The newer operating systems use GPT (GUID Partition Table) instead.
this is for WinXP 64 bit SP1, Win 2003 64 bit SP1, Win 2008, Vista, Win
7.
Since my new WS has Win 7, 64 bit, I dont have a problem.

So you might - depending on your OS - not have any problem at all.

Lembit

----- Original Message ----- 
From: "Drew Wutka" <DWUTKA at marlow.com>
To: "Access Developers discussion and problem solving" 
<accessd at databaseadvisors.com>
Sent: Tuesday, June 22, 2010 10:24 AM
Subject: Re: [AccessD] Needs analysis


> Thanks for the detail.  Clears some things up for me. You've
definitely
> have your work cut out for you.
>
> If you happen to run across that 2 Terrabyte issue, send it my way,
this
> is the first I've heard about it on a Windows server.
>
> As far as the RAID stuff goes, you do have some misinformation there.
> First, the lure of RAID 6, with having 2 drive failure redundancy is a
> common misnomer as to it's usefulness. RAID 5 is a VERY common setup,
> but it's a setup SPECIFICALLY for maximizing disk space.  For example,
> we have that for our main file server.  A 5 terrabyte volume, of which
> we have around 2 Terrabytes actively used.  It's a file server, so the
> performance hit of RAID 5 is minimized by the network speed.
>
> If you have a 3 drive RAID 5, you don't necessarily pull data from all
> three drives at once.  First, it depends if your RAID is based on
chunk
> size or bits.  If it's bits, then yes, you actually read an entire
byte
> from 2 drives three times, and one drive two times.  If it's based on
> chunk size, then it's possible that for a small read, you only hit one
> drive.  RAID 6 was touted as being an improvement, but from all the
pros
> I have talked to, RAID 6 is the Pinto of RAID systems.  On paper it
> sounds great, in practice, it's horrible.  Keep in mind, for the past
> few years, this is what I have been involved with. (Not necessarily
raid
> drives, but servers and network administration)
>
> A RAID 1 actually can perform better then a single drive, but only for
a
> read.  With a RAID 1, the raid controller can read data from both
drives
> simultaneously because it's the same data.
>
> For your final reasoning, about the timing, I can't really sit here
and
> say, JWC, you're problems would be over with a RAID 10 replacing your
> RAID 6.  What I CAN say, is that it is QUITE possible that you will
see
> a much bigger performance boost then just a 3x.
>
> Think of it this way, if  you were doing a straight read, like pulling
> 40 gigs from a drive to memory, first, in a proper RAID 10, with 3 to
4
> striped mirrors, that 40 gigs would be read very quickly.  Let's, for
> simple math, say that it takes 20 seconds in your RAID 6.  And going
to
> a RAID 10 gives you a 2x boost.  Great, now you save 10 seconds.  But
> that's in a straight data read, so defragmentation aside,  the actual
> performance of how the data is being pulled from the drives in a
> straight data pull would be a small factor of improvement.  Where the
> REAL boost that you should really be concerned about is that SQL
Server
> isn't reading multiple files.  It is reading parts of a file, a part
> here, a part there, an index is pointing to another part.  All of
those
> are small individual seeks (no matter how much data gets pulled in the
> end).  That is where a RAID 6 slows things down in a MUCH higher
factor.
>
> Ok, it's 3 in the morning, was up finishing up a system (with a RAID 0
> +1...LOL), and decided to holler back.
>
> Good luck with your project, let me know when you plan to scrap one of
> these systems, I may buy it from ya! ;)
>
> Drew
>
> -----Original Message-----
> From: accessd-bounces at databaseadvisors.com
> [mailto:accessd-bounces at databaseadvisors.com] On Behalf Of jwcolby
> Sent: Monday, June 21, 2010 9:03 PM
> To: Access Developers discussion and problem solving
> Subject: Re: [AccessD] Needs analysis
>
> Drew,
>
> > First, you said you split your RAID volume into 2 partitions due to
a
> Windows limitation... what
> limitation is that?  FAT 32 is limited to 2 terrabytes, but NTFS can
> have WAY more than that.  We
> have several servers here running Windows 2003, (standard and
enterprise
> mixed) which have 5+ terrbytes.
>
> When I was studying the issue, my take was that drives larger than 2
> terabytes used something funky
> that caused extreme performance issues.  Yea, it can be done, and yea,
> you don't want to do that.
> So I didn't do that.
>
> > I am specifically telling you to stay away from RAID 6 (and RAID 5).
> Either go with RAID 0 (with
> no redundancy) or a RAID 10.
>
> When I was studying the issue, RAID 6 allowed multiple drive failures.
> Raid 6 is also one of the
> fastest at streaming READ, simply because it is pulling data off of so
> many drives simultaneously.
> Making the assumption (yea, I know) that once it found data, the data
> would be contiguous, high
> streaming read is good.  Raid 6 and Raid 5 are approximately the same
> speed reading.
>
> I had a very limited budget.  Back when I was doing this, each
terabyte
> drive (consumer grade) was
> around $140 or $150.  I managed to scrape together the money for 10 of
> them but basically what Raid
> 6 allowed was to minimize the "overhead" of the raid parity
information.
> I did what I did strictly
> because of budget.  Both raid 6 and raid 5 are significantly faster
than
> raid 1 and can approach
> raid 0 FOR READS.  Not for writes of course because of the parity
> calculations.
>
> Basically Raid1 is no faster than the individual drive.  It is
> essentially just whatever your
> individual drive specs are.  Raid 0, 5 or 6 read stripes of data off
of
> each drive in the set.  My
> understanding wast that in the case of Raid5 and Raid 6 you even got
the
> advantage of the data
> coming off of the parity drives.  IOW, raid 0 2 drive array reads data
> from two drives
> simultaneously, therefore 2X the individual drive specs in STREAMING
> READ.  As I understood it, Raid
> 5 actually read data off of all three drives so you got 3X the
> individual drive speed.  Etc.
>
> Writes are another matter of course, but remember that I do not write
-
> AT ALL!!!.  OK, I write to
> log files, but not back to the data disks.
>
> > Where I am confused about SQL Server, is that while I know it uses
> whatever memory it can take, I
> didn't think it runs strictly off of memory, whether you are simply
> reading or not.  IE, if you
> access xx number of records, sure, it would cache what it pulls, but I
> don't think it pulls the
> entire DB into memory from the get go.
>
> I think you are correct.  However I also think (and it is a big black
> box really, so there is a lof
> of "if I were doing this") that if you are reading data about two
> tables, joined on a single field,
> with where clauses on 4 or 5 other fields from one or both of those
two
> tables...
>
> SQL Server is going to be pulling as much as it can to work with the
> joins, to filter the records etc.
>
> So if you have 13 gigs of RAM available and it has 40 gigs of index
data
> that needs processing...
>
> vs... you have 60 gigs of RAM available and 40 gigs of index data
> needing processing.
>
> I have to believe that all other things being equal, 60 gigs available
> memory will trump 13 gigs
> available.  Maybe not, but I think so.  No matter how you slice it,
> having 4 times as much ram for
> SQL Server to cache and process the stuff it is working with is going
to
> go a long way.
>
> > I would assume (and you know what happens when we assume things!
> LOL), that if you are truly
> treating a db as read only, then the disks are only going to be
accessed
> for new information.
>
> Well, what does that mean?  Let's use an analogy.
>
> Assume I have two hundred feet of board.  On this board is painted a
> random number (limited to
> 10000) every inch.  I have another 50 foot section of board.  Again,
> painted with a random number
> every inch (limited to 10000).
>
> You are given a window within which you can lay the boards and slide
> them back and forth trying to
> match the numbers on one board with the numbers on the other board.
The
> window is 10 feet wide.
> How much sliding of the boards will have to occur?
>
> Now get a window 40 feet wide.  How much sliding has to occur?
>
> There is no such thing as "only accessing new information".  The
server
> does not have enough memory
> to process both entire indexes so it pulls a tiny piece of one index
and
> a tiny piece of another,
> then another tiny piece etc.  It is sliding the pieces in and out of
> memory trying to find the matches.
>
> > So here's a question for you, have you ever thought about splitting
> the load between different
> machines on a gigabit network?
>
> Yes, I have but not seriously.  And now, when I can get a motherboard
>and 16 cores for $1000 (plus
> memory) it makes even less sense.  If I had fiber between the machines
> maybe but gigabit is really
> pretty slow, especially when a good chunk is protocol.
>
> > I have noted many of the performance issues you've mentioned in the
> past, and many of them are
> functionally identical to when a RAID goes beserk, or can't handle the
> load.
>
> LOL.  Yea, I/O limitations in the disk subsystem.  It is a known
issue,
> and I have it, and I know I
> have it.  However just attacking the disk system will only get me so
> far.  Let's assume I can triple
> the disk speed.  If my queries are taking a half hour I am down to 10
> minutes.  That's good but not
> good enough.
>
> John W. Colby
> www.ColbyConsulting.com
>
>
> Drew Wutka wrote:
>> Hmmm, couple of questions.  But before I begin, please understand
that
> I
>> work with SQL Server, but nowhere near as in depth as you apparently
>> are.  So PLEASE correct me if I am wrong, and I would appreciate any
>> links/info you have to share on it.  I do understand quite a bit
about
>> hardware though, so that's why I am still tossing my two cents in
> here.
>>
>> First, you said you split your RAID volume into 2 partitions due to a
>> Windows limitation... what limitation is that?  FAT 32 is limited to
2
>> terrabytes, but NTFS can have WAY more than that.  We have several
>> servers here running Windows 2003, (standard and enterprise mixed)
> which
>> have 5+ terrbytes.
>>
>> Next, I am not saying 'get faster disks'.  LOL, I really am trying to
>> just be helpful here.  In fact, right now, I'm in process of getting
> two
>> new servers for our systems, so I know how expensive things can be!
> LOL.
>> I am specifically telling you to stay away from RAID 6 (and RAID 5).
>> Either go with RAID 0 (with no redundancy) or a RAID 10.  I certainly
>> understand the cost involved, and I'm not trying to say 'tough, do
> this
>> or die!' LOL.  I am simply pointing out that while you are looking at
>> memory and processor specs, factor in a drive set that will
compliment
>> them.
>>
>> Where I am confused about SQL Server, is that while I know it uses
>> whatever memory it can take, I didn't think it runs strictly off of
>> memory, whether you are simply reading or not.  IE, if you access xx
>> number of records, sure, it would cache what it pulls, but I don't
> think
>> it pulls the entire DB into memory from the get go.  That sounds more
>> like the 'new' dbs that have been talked about, which do run
> completely
>> in memory.
>>
>> I would assume (and you know what happens when we assume things!
LOL),
>> that if you are truly treating a db as read only, then the disks are
>> only going to be accessed for new information.  But ANY write backs,
>> would cause multiple memory <--> disk operations.  Memory is always
>> going to be faster than disk, so the quicker the disk operations are,
>> the less bottlenecking you'll face.
>>
>> So here's a question for you, have you ever thought about splitting
> the
>> load between different machines on a gigabit network?
>>
>> Again, sorry if I seem clunky in my thoughts here, I've perused your
>> threads about this stuff, but have too much going on, on my own, to
> keep
>> too much detail of it in my head.
>>
>> I have noted many of the performance issues you've mentioned in the
>> past, and many of them are functionally identical to when a RAID goes
>> beserk, or can't handle the load.
>>
>> Drew
>
> -- 
> AccessD mailing list
> AccessD at databaseadvisors.com
> http://databaseadvisors.com/mailman/listinfo/accessd
> Website: http://www.databaseadvisors.com
> The information contained in this transmission is intended only for
the 
> person or entity
> to which it is addressed and may contain II-VI Proprietary and/or
II-VI 
> Business
> Sensitive material. If you are not the intended recipient, please
contact 
> the sender
> immediately and destroy the material in its entirety, whether
electronic 
> or hard copy.
> You are notified that any review, retransmission, copying, disclosure,

> dissemination,
> or other use of, or taking of any action in reliance upon this
information 
> by persons
> or entities other than the intended recipient is prohibited.
>
>
> -- 
> AccessD mailing list
> AccessD at databaseadvisors.com
> http://databaseadvisors.com/mailman/listinfo/accessd
> Website: http://www.databaseadvisors.com 

-- 
AccessD mailing list
AccessD at databaseadvisors.com
http://databaseadvisors.com/mailman/listinfo/accessd
Website: http://www.databaseadvisors.com
The information contained in this transmission is intended only for the person or entity 
to which it is addressed and may contain II-VI Proprietary and/or II-VI Business 
Sensitive material. If you are not the intended recipient, please contact the sender 
immediately and destroy the material in its entirety, whether electronic or hard copy. 
You are notified that any review, retransmission, copying, disclosure, dissemination, 
or other use of, or taking of any action in reliance upon this information by persons 
or entities other than the intended recipient is prohibited.





More information about the AccessD mailing list