Raid1 - why not read speed improved

boospy · Guru Joined: 07 Feb 2010 Posts: 310 Location: Austria

I have an Softwareraid Raid1 with 2 Drives. Theoretically the readspeed must be twice as fast. But it is not. Are there any special kernel options?

Greetings
boospy

jormartr · Apprentice Joined: 02 Jan 2008 Posts: 174

That is how raid1 works at the moment, there is no option to change it as it is not implemented.

frostschutz · Advocate Joined: 22 Feb 2005 Posts: 2977 Location: Germany

There is a read speed improvement, but only for concurrent disk access. Try running two reading processes at the same time.

depontius · Advocate Joined: 05 May 2004 Posts: 3526

To expand on that last statement just a little bit...

RAID1 doesn't have any code for striping, so in a simple situation it doesn't have any mechanism for reading from both disks and combining the data. However under normal circumstances, RAID1 only needs to read from one disk, needing to consult the second disk only if the first says it failed. In other words, there's no need to do any sort of "dynamic compare" between the two disks. That means that in single-threaded situations, the second disk sits idle. However in multi-threaded situations the second disk can be scheduled to read for another thread.

Both disks must be scheduled together for any write.
_________________
.sigs waste space and bandwidth

krinn · Watchman Joined: 02 May 2003 Posts: 7470

technically you can get read improvments, as both disks are perfect mirror, you can find the same datas on both disks.

so instead of reading them from only 1 disk, reading them from the both disks would get you twice the throuput.

if it's not done, it's just for software programming reason ; or hardware limitation, using two controllers that handle 1 disk each would be more easier to handle from the software side.

OP: you should dig mdadm help for multi-controllers support, it might be just (lol) you need two controllers to perform that.

depontius · Advocate Joined: 05 May 2004 Posts: 3526

I'm running RAID-1 mirrors on my servers, and use separate controllers. I simply knew I wanted the separate write paths, at the very least.
_________________
.sigs waste space and bandwidth

NeddySeagoon · Posted: Mon Dec 19, 2011 9:41 pm Post subject:

boospy,

How do you measure read speed?
Random reads or sequential reads.

The read speed is determined by two elements.
1. The time to position the read head onto the right track and the time for the right sector to come under the head, so your data can be read. This is called latency.
2. The time it takes to get your data (once its been located) onto or off of the platter. Its limited by the head/platter data rate. For small amounts of data cache and read ahead don't help much. For large amounts of data that don't fit in the drive cache it doesn't help either.

In many real world situations, the latency is much larger than the time spent actually moving data. The kernel tries to minimise latency by positioning the heads over different parts of the drives in a mirror set and using the drive that can get your data with lowest latency.
Native Command Queuing is another trick aimed at reducing latency by reordering commands for minimum head movements. Drives do that with no help from the kernel, other than turning the feature on.

The answer you get depends on how you define 'data rate' in your question.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

mbar · Veteran Joined: 19 Jan 2005 Posts: 1991 Location: Poland

I'm quite sure there *is* a raid1 alignment that gives you double raw data read speed. I can't find some source about it now...

EDIT: found it on Wikipedia:

frostschutz · Advocate Joined: 22 Feb 2005 Posts: 2977 Location: Germany

In a server scenario there are usually multiple processes working anyway; so in overall performance it makes no difference whether you do the striping thing or not. For me the speed improvement is noticable even on a desktop. It's only if you benchmark using a single process, for example dd, that there does not seem to be any difference.

You could do a RAID 5 with 2 disks if you want striping. The two disk raid 5 mode is a bit odd though, and usually only used as an intermediate step when converting RAID 1 to RAID 5.