View previous topic :: View next topic |
Author |
Message |
Onorhc n00b
Joined: 06 Dec 2005 Posts: 6
|
Posted: Tue Dec 06, 2005 3:06 am Post subject: Errors on Proliant ML370 Server |
|
|
Ok, im trying to install Gentoo on my Proliant ML370 Server. The server is useing the Smart Aray that normally comes with Compaq servers. Their are 6 18.?GB HD's in the server. It has 2 1.1Ghz Processors and is running with 512MB SD Ram.
When I boot off the cd with gentoo-nofb (So I can see what happens before the errors appear, same thing happens with just the gentoo kernel) gets to init, it sees both CPU's, the autoconfig runs fine, and then I see lots of erros with Coldplug. One I saw was segment fault in coldplug, it kept on going, then their were alot more coldplug errors, a command prompt appeard briefly, then the screen filled with errors looking like this :
[<c0245d0d>]
[<c010f482>]
and so on down the page.
I think it might have loaded to a command line once and I proceded to start the install, but that only happend the very first time, and it didnt see my network card.
Anyone have any ideas? I was also having troubles running Debian install, so possibly something im not passing into the boot options? |
|
Back to top |
|
|
Mitusin n00b
Joined: 29 Nov 2004 Posts: 6
|
Posted: Tue Dec 06, 2005 3:51 am Post subject: Possible solution |
|
|
I had some similar problems with an older HP server (Lh3 i think) 2005.1 doesn't boot, 2005.0 with a lot of errors, so I take my old gentoo 1.4 and works flawlessly, after install I did an "emerge world" and my system is up to date with no problems.
I think that was a problem with the scsi, try to boot with "noscsi" an then load the correct module (but this was in my case). |
|
Back to top |
|
|
Onorhc n00b
Joined: 06 Dec 2005 Posts: 6
|
Posted: Tue Dec 06, 2005 4:42 am Post subject: |
|
|
I did a basic look for older versions on the website and I cant find it. Do you know of a source, or should I do a more detailed search? |
|
Back to top |
|
|
Bad Penguin Guru
Joined: 18 Aug 2004 Posts: 507
|
Posted: Tue Dec 06, 2005 3:21 pm Post subject: Re: Errors on Proliant ML370 Server |
|
|
Onorhc wrote: | Ok, im trying to install Gentoo on my Proliant ML370 Server. The server is useing the Smart Aray that normally comes with Compaq servers. Their are 6 18.?GB HD's in the server. It has 2 1.1Ghz Processors and is running with 512MB SD Ram. |
The trick is something along the lines of:
gentoo-nofb nohotplug
or
gentoo-nofb nodetect |
|
Back to top |
|
|
Onorhc n00b
Joined: 06 Dec 2005 Posts: 6
|
Posted: Wed Dec 07, 2005 12:37 am Post subject: |
|
|
That worked for alittle longer. The prompt was their for 10 seconds, then it went into more hex errors like I described.
Then Code: ef 0f 9d 00 00 00 8b 44 24 18 8b 15 10 6b 42 c0 8b 0c ect....
Then it said <6>note: runscript.sh{6620} exited with preempt_count 1
SQUASHFS error: zlib_fs returned unexpected reult 0xfffffffd
SQUASHFS error: Unable to read fragment cache block [16235cf]
and lost more errors allong that line.
Ctrl alt del doesnt restart at this point.
That was with nohotplug. Same with nodetect. |
|
Back to top |
|
|
Bad Penguin Guru
Joined: 18 Aug 2004 Posts: 507
|
Posted: Wed Dec 07, 2005 1:37 am Post subject: |
|
|
Onorhc wrote: | That worked for alittle longer. The prompt was their for 10 seconds, then it went into more hex errors like I described.
Then Code: ef 0f 9d 00 00 00 8b 44 24 18 8b 15 10 6b 42 c0 8b 0c ect....
Then it said <6>note: runscript.sh{6620} exited with preempt_count 1
SQUASHFS error: zlib_fs returned unexpected reult 0xfffffffd
SQUASHFS error: Unable to read fragment cache block [16235cf]
and lost more errors allong that line.
Ctrl alt del doesnt restart at this point.
That was with nohotplug. Same with nodetect. |
Um, that sounds more like a media or memory error. I have installed several ML370s, I know it is possible, just can't recall the exact boot args right at the moment.
Perhaps:
gentoo-nofb noscsi nohotplug nodetect noapci |
|
Back to top |
|
|
Onorhc n00b
Joined: 06 Dec 2005 Posts: 6
|
Posted: Wed Dec 07, 2005 1:45 am Post subject: |
|
|
Nope, still errors out.
The new ones with those args is:
Unable to handle kernel NULL pointer dereference at virtual address 0000004 printing eip:
c0103b9e
*pde = 00000000
Recursive die() failure, output supressed
<1>Unable to handle kernel paging request at virtual address eda2d3ec printing eip:
c01108f9
*pde = 00000000
Recursive die() failrue, output supressed.
Someone earlier suggested trying 1.4, anyone know where I can get that? Doesnt appear to be on the mirrors or on the main site. |
|
Back to top |
|
|
Bad Penguin Guru
Joined: 18 Aug 2004 Posts: 507
|
Posted: Wed Dec 07, 2005 2:04 am Post subject: |
|
|
Onorhc wrote: | The new ones with those args is:
Unable to handle kernel NULL pointer dereference at virtual address 0000004 printing eip:
c0103b9e
*pde = 00000000
Recursive die() failure, output supressed
<1>Unable to handle kernel paging request at virtual address eda2d3ec printing eip:
c01108f9
*pde = 00000000
Recursive die() failrue, output supressed.
|
Does it appear to be happening as an attempt is made to load a certain module? How far does it get before this happens? |
|
Back to top |
|
|
augury l33t
Joined: 22 May 2004 Posts: 722 Location: philadelphia
|
Posted: Wed Dec 07, 2005 2:37 am Post subject: |
|
|
I'd pull the drives and ram and migrate. |
|
Back to top |
|
|
Onorhc n00b
Joined: 06 Dec 2005 Posts: 6
|
Posted: Wed Dec 07, 2005 2:58 am Post subject: |
|
|
augury wrote: | I'd pull the drives and ram and migrate. |
Pull the ram and drives and migrate? As in ditch the server. That I cant do, as its my only server
Any more ideas? |
|
Back to top |
|
|
jmbsvicetto Moderator
Joined: 27 Apr 2005 Posts: 4734 Location: Angra do Heroísmo (PT)
|
Posted: Wed Dec 07, 2005 3:07 am Post subject: |
|
|
Hi.
I've read some posts here in the forums about old Compaq servers, R3000?, that used a lot of kernel options including some memory address mappings. In case you don't need so extreme options, have you tried the following?
Code: | gentoo-nofb nodetect nohotplug noscsi noapic acpi=off |
_________________ Jorge.
Your twisted, but hopefully friendly daemon.
AMD64 / x86 / Sparc Gentoo
Help answer || emwrap.sh
|
|
Back to top |
|
|
Bad Penguin Guru
Joined: 18 Aug 2004 Posts: 507
|
Posted: Wed Dec 07, 2005 3:11 am Post subject: |
|
|
Onorhc wrote: | augury wrote: | I'd pull the drives and ram and migrate. |
Pull the ram and drives and migrate? As in ditch the server. That I cant do, as its my only server
Any more ideas? |
I recall having problems getting ml370's to boot, but they were more to do with the e100 module not working when the eepro100 would. I had to turn off autodetection to prevent it from loading. I don't have one of the install cds in from of me, but there is a readme on it with all of the boot flags. Try booting with turning everything that you possibly can off. Any kind of hot/coldplugging, no smp, no dhcp, try to bypass everything the initrd and kudzu is trying to probe.
Your squashfs errors tell me that you are having a memory or media error though. It is failing as it attempts to uncompress the livecd.squashfs. You might try running the memtest on the gentoo install cd just to rule out memory problems. If you have 2 dimms you can remove one of them at a time to see if they are bad... |
|
Back to top |
|
|
Onorhc n00b
Joined: 06 Dec 2005 Posts: 6
|
Posted: Sat Dec 10, 2005 8:05 pm Post subject: |
|
|
Ok, checking my memory with memtest.
Lets assume it passes just fine. The one thing suggested that I have not tried is useing Gentoo 1.4. Does anyone know where I can get that, or would one kind soul be nice enough to supply me with a copy? |
|
Back to top |
|
|
keyson l33t
Joined: 10 Jun 2003 Posts: 830 Location: Sweden
|
Posted: Fri Dec 16, 2005 8:11 am Post subject: |
|
|
Hi.
When you start up the installation media and get the boot:
Then enter
And it should boot up. When you mount the file system don't
forgett to do
Code: |
mount -o bind /dev /mnt/gentoo/dev
|
so the grub can find the scsi raid array.
This worked on the ml370 I installed. |
|
Back to top |
|
|
alterself.com Apprentice
Joined: 13 Nov 2003 Posts: 175 Location: Iowa
|
Posted: Sun Sep 03, 2006 6:23 pm Post subject: |
|
|
what crap these ML370's are...
i finally ot mine booted off of 2006.0. had to do
Code: | gentoo-nofb ide=nodma docache |
its booted now! woot, now the process begins.... _________________ Gentoo ROCKS! |
|
Back to top |
|
|
|