OpenSolaris 2008.05, and other places the sun don't shine
Way back in the dark ages of 1993, we were introduced to this thing called email. Email in the olden days was not like email now. All the packets traveled uphill no matter where they were going, and usually there was a good three or four feet of snow on the internet backbone. We used these big hulking things called VAX/VMS nodes that were attached to some pretty sweet fourteen inch monochrome VT 100 terminals. There were also these machines that ran something called UNIX, which sounded to us like something that should have been found in the college health center, not the computer lab.Eons passed, and things changed. Though there were many more email packets flying around, plate tectonics had changed the course of things so that now they traveled downhill, really really fast. The internet backbone became a series of tubes. All the VT 100 terminals banded together and created an archipelago in the South Pacific. And UNIX...
UNIX evolved. Mutated. It trickled down into various UNIX brands and distributions. There were things like BSD UNIX, HP-UX, and AT&T Bell Labs UNIX. There were other branches, too, rogue sprouts on the evolutionary tree: FreeBSD, and OpenBSD. Slightly alien but vaguely reminiscent life forms injected their DNA into the gene pool: Linux, and this weird little UNIX-esque animal called Solaris.
Sun recently let Solaris go open source. OpenSolaris is more a traditional UNIX environment than a Linux type environment, but the appeal of taking a peek at the 2008.05 OpenSolaris release was too great for us to resist. The folks at OpenSolaris knew this, and baked some goodies into the OS that no Linux user could refuse.
We were given a no-strings attached liveCD, so our Linux install would never know we cheated. We had a bash shell, and the GNOME desktop environment, so our eye candy and commands would feel familiar and easy.
Sometimes, though, evolution goes horribly, horribly wrong.
We eagerly booted into our OpenSolaris environment on our Athlon64 X2 system. We selected our keyboard layout and language, and OpenSolaris graciously presented us with the X server and GNOME. It was pretty, and we could even activate Compiz, of all things. Yes, it was very pretty. And for a reason we'll discuss a little later, we thought it would be really beneficial to stop what we were doing with the liveCD, and reboot.

OpenSolaris had other ideas. It just didn't want to tell us what those other ideas were.
GRUB would launch, and we could select the standard OpenSolaris boot option. The splash screen showed its mug, and we'd be dumped on to a screen with the standard trademark and legal disclaimers. And the cursor would blink, and blink, and blink. We waited. Patiently.
But it was clear that the optical drive was no longer reading, and the hard drive was inactive. Keyboard presses did nothing. Trying the whole process again with a text console GRUB selection played out the same way.
Certain things disturb us much more than they probably should. Humanoid puppets, hyper-intelligent birds, and when our computers behave irrationally for no apparent reason are the big three. OpenSolaris had thrown down the first gauntlet, and by honor, we could not walk away. We had to know not only why it booted once, but why it didn't seem to want to again.
The condensed version of the next 21,600 seconds (six hours, for those not wanting to do the math) went something like this:
It's a 64-bit versus 32-bit problem. We'll edit our GRUB boot lines so we boot into 64-bit no matter what.
It's a 64-bit versus 32-bit problem. We'll edit our GRUB boot lines so we boot into 32-bit no matter what, then.
Does this piece of crap have an issue with SATA drives? How can it? That makes no sense!
Take out a stick of RAM? Are they kidding? This is based on Solaris. How can it freak at 2 gigs of RAM?
Let's disable PnP in the BIOS. Oh. It already is.
Let's mess with our memory timing. Reduce by half, they say. We laugh in the face of unstable systems.
We have got to be crazy. We've got to. Why is it so damn important if this thing ever boots again?
Why is our IDE controller on in the BIOS anyway? We don't have any IDE drives. Not that it affects anything anyway.
Let's try every ACPI setting there is. Twice. Let's disable ACPI APIC. Let's enable it again.
Who would realistically ever go through this to boot an operating system? What is wrong with us?
Oh, hey, there is a switch for kernel debugging and a verbose switch in GRUB! Edit kernel line with -k -v and boot.
At the six hour mark, we approached the solution:
ehci is having an issue. That's... USB stuff? Why does it say it is ignoring the error? Duh. Obviously it isn't.
Hey, BIOS! It's us again. Bye, auto legacy support.
That didn't work. Do we have any USB devices even attached? The printer. The printer isn't on. Or plugged in.
Let's boot with -k -v again. There, it's talking about the keyboard, and then ehci freezes--
They have got to be kidding. Seriously. Our mouse? It is a USB mouse. Fine. We'll put a PS/2 adapter on it. If this works, we will renounce the idea that technology operates on any sort of logic. Let's reboot 'er.
Dear readers, all that technology you see on the desk in front of you today? Apparently, it is a collection of happy accidents. Putting a PS/2 adapter on our mouse allowed us to reliably boot into OpenSolaris.
This confounds us on a few levels. Solaris is a UNIX derivative. UNIX is (at heart) a command line that has the ability to run a GUI. It doesn't have to. We could have booted without a mouse. But booting with a mouse that wasn't necessarily fully recognized as such is apparently disastrous. Is it an input device issue? A USB issue?
Linux, which is also at heart a command line that is able to run a GUI, does on occasion have issues with various mouse types. USB support on Linux has also been historically buggy, though it is much better than it was even two years ago. However, we've never had a Linux machine hang indefinitely because the pointer device was somewhere the OS didn't want it. We've had plenty of experiences with non-working mice.
Non-working mice are a lot easier to troubleshoot than non-working systems.
We tried out a Zen outlook (we've learned so much through this agonizing enlightening process!) and began playing with OpenSolaris in earnest.
Remember the history of email we touched on at the beginning of this post? Slow, uphill both ways? Snow on the backbone? Migraine inducing VT100 terminals? They are all still more effective than trying to connect an OpenSolaris machine to the internet.
We admit that Linux has a long way to go with wireless support. However, we can not remember the last time we had a modern, full-bodied distribution have any real issues with a wired connection. The most we might have to do in Linux is activate the appropriate device, or point it at the right gateway.

OpenSolaris does have this nice Device Driver Utility. Utility is a bit of a misnomer. Unless you write device drivers, it doesn't really do much, except tell where on the system the device driver is located and whether the driver is supported and working.
Our ethernet connection is located at nge0 (as opposed to the Linux eth0). Good to know. Our driver is working for the ethernet card. Awesome. Er, so why can't we get online?

We can't reconfigure our card. OpenSolaris knows best. It uses this little Network Auto-Magic daemon. It thinks it is connected. It clearly isn't. Disabling the Magic is just different enough for a seasoned Linux user to feel a bit like they're in a bizarro universe, but at least the command line concepts between UNIX and Linux aren't horribly different.

On a liveCD such as OpenSolaris, manually configuring the ethernet device isn't a walk in the park either. There's that whole root password thing. Where it seemed we should be able to manually enter nameserver information, and our gateway, and all sorts of things, it didn't let us. We are not root, or remotely worthy.
We finally dug up the "pfexec su" command, and tried to get online that way. We could add our nameservers, and gateway, and we could activate it all, now that we were disguised as root. Apparently, though, OpenSolaris has a problem with our router, and the way it assigns various addresses. We may be activated. We are not connected. Where is VAX/VMS when you need it?
We'd just spent six hours getting OpenSolaris to boot. To hell with the internet connection. We are determined to the point of amazing stupidity, but even stupidity has limits. At least ours does.
Don't get us entirely wrong, there are some nice things about OpenSolaris. It is very light on pre-installed applications. There are the applications that come with GNOME, like gedit, some games, and system administration tools. There are a few multimedia apps, GIMP, and gtkam. That's about it. It's a nice way to build a system that does exactly what you'd like. (We are puzzled by the inclusion of Compiz, frankly, because it's a feature that doesn't contribute a whole lot of function to the desktop. It seems illogical to be there by default given what else is and isn't included.)
Despite our USB mouse woes, flash drives mount automatically and work as expected. Our audio drivers exist and supposedly function, but they've gone to ground with our network connection. So system sounds won't play through GNOME's esd. Because there are some gstreamer plugins that require installation via the packaging system, we couldn't try out any media files.
If we were hardcore software developers with a keen interest in UNIX-type environments, we'd go ahead and install OpenSolaris on our computers. We'd just rely heavily on the ol' VAX/VMS set up and the cutting edge Mosaic browser for our networking needs.
For the desktop Linux user who is simply curious about UNIX derivatives, we'd have to advise taking a pass on this release, anyway. We'd probably tell you to check out OpenBSD, or FreeBSD, for a shot at a more successful change of pace.












Comments
39
Subscribe to commentsGavMay 16th 2008 8:29AM
I couldn't get OpenSolaris working, which is weird, considering Solaris 10 (the closed source, but still free version) works just fine on there.
I was looking forward to a nice new OS too :(
bnitzMay 16th 2008 9:44AM
This is most likely a memory problem. Solaris 10 could get by with less memory. The OpenSolaris 2008.05 LiveCD and installer seems to occasionally need a bit more than the 512MB prescribed.
GavMay 16th 2008 11:35AM
Certainly isn't a memory issue, I've got 4GB, and it's all pretty modern hardware (Q6600, Asus P5K-E), so why it's not working is a total mystery, won't even boot into the LiveCD.
bnitzMay 16th 2008 12:18PM
Gav, that is strange. Driver support is generally more complete (though probably more buggy) in OpenSolaris 2008.05 than it is in Supported/proprietary Solaris 10. Did you verify the md5 of the CD ISO? How far along does the boot get? There is at least one known bug which hangs the system due a driver conflict:
http://defect.opensolaris.org/bz/show_bug.cgi?id=1334
GavMay 16th 2008 12:40PM
Hey blitz, could you e-mail me on gavdownloadsquad@temporaryforwarding.com please, this probably isn't the best place to be doing this. :)
bnitzMay 16th 2008 9:44AM
Interesting article. It reminds me of problems I had booting a Dell on an early OpenSolaris build until I found out that the Dell power supply wasn't pushing enough watts to spin up the 2nd hard drive reliably. Your troubleshooting sequence indicates that you're familiar with the chaos behind any OS's interaction with Wintel hardware.
Linux may be ahead on the ugly hacks necessary to make (n-1)! hardware devices play nicely together and MS Windows is ahead of that. But to solve this, Sun either needs to go the Apple route of saying "You MUST run our OS on our hardware!" Or Sun can hope OpenSolaris users such as yourself can submit bugs and info which helps yourself and others get past these unusual HW issues. (A reminder, early Linux 2.6 kernels destroyed hard drives from a well-known manufacturer because that manufacturer implemented a standard bus reset command as 'upload firmware.')
Eroom's law - every 18 months as the corporate budget devoted to proprietary OS $oftware rises, the laws of economics dictates that the hardware trends towards @^%#*.
bnitzMay 16th 2008 10:25AM
If you have plenty of memory and already have a GNU/Linux distribution with room on a partition, its probably worth trying to install and run OpenSolaris 2008.05 on VirtualBox. Yes, a few things are broken (generally specific to hardware). Your network issue is probably that a free/legal opensolaris driver doesn't exist for your NIC. Try removing the NIC drivers from your favorite GNU/Linux dist and see how helpful the error messages are.
But some things are just different and therefore take some getting use to for those accustomed to Windows or a particular Linux distro. My first experience in going from VMS to Unix was "no CLI standardization, it looks like it was written for hackers, by hackers." Did I give up, or did I become a hacker? Change is difficult. I'll never forget the Byte magazine review of the 1985 Amiga which faulted it for not having an AUTOEXEC.BAT file. (Instead it executed startup scripts for the machine and GUI and each shell,as do all modern preemptive multitasking OSs... something single threaded DOS CLI users wouldn't have ever dreamt of.)
GeirMay 16th 2008 10:57AM
If you do have problems with your NIC with [Open]Solaris, then consider a visit to the "Free NIC drivers for Solaris" page at http://homepage2.nifty.com/mrym3/taiyodo/eng/. It may solve your problem, and you may have to build the driver (particularly if you use OpenSolaris). It can be a huge relief if you get the NIC working though, so it is worth a shot...
Kristin ShoemakerMay 16th 2008 11:18AM
Good advice on the NICs... Thanks! Unfortunately in my case it wasn't that the NIC wasn't supported (Driver Utility reported it working fine. It showed up on an ifconfig -a and OpenSolaris really believed it was connecting.)
I am not totally well versed in how it all works, but I sense my issue was the router I am using, and how it identifies/issues IP addresses (it also acts as a nameserver, which I am fairly sure was complicating the process.)
I have a few devices connecting to this router both physically and wirelessly, running a variety of operating systems. They don't have an issue using the router as a nameserver, or communicating with the outside world.
I am totally confident that this issue could be overcome. I am also guessing it would require, at least if I took the path of least resistance, changing how my router handles the network traffic. I think it probably could work. I'd also have to change a few settings on a few pre-existing machines.
I also imagine there is more than one way to solve that particular problem... Which is the cool things about computers.
Truth be told, however, I was working off a LiveCD. And unfortunately, it once again falls into the category where I wonder how many people would want to venture installing a LiveCD without a working network connection on a partition in hopes they can make it work from there?
Hardware and its interaction with software has got to be endlessly frustrating for programmers. I can't possibly even imagine... In a world where revisions in the same model component can present wildly different levels of support, it's got to drive them nuts.
I have a funny feeling in my case, though, there were some weird protocol and identification issues.
bnitzMay 16th 2008 11:47AM
Regarding networking, have you tried:
Open a terminal
su -
{enter root password}
svcadm restart nwam {restarts the network automatic daemon and its dependencies... sometimes clears up wifi connect or dhcp allocation issues.)
If that doesn't work:
svcadm disable nwam
network-admin &
The use that gui to enter the parameters for your network.
If you ever tried to connect GNU/Linux to wifi on weird networks, you are probably familiar with wificonfig. That exists in opensolaris also.
Have fun. I hope you don't give up too easily ;-)
(O.K. 6 hours seems like alot, but I've been compiling kernels, patching drivers and googling everywhere for any Linux dist which can view the composite video input of my Intel pro webcam for more than 6 months!)
Kristin ShoemakerMay 16th 2008 12:00PM
Alas, I tried that sequence of events so many times, I actually felt like I knew the soul that wrote the nwam man page personally. :)
All the pertinent information was given to Network-Admin and nge0 activated nicely by DHCP. I actually took a peek at resolv.conf and related files after the activated nge0 still wasn't letting me ping... Anywhere. Everything seemed placed appropriately. Still no love though.
Good luck with the webcam! :) All I can say is... my husband gets so cheesed at me when I go on "Gentoo kicks" and spend long weekends (or weeks) compiling. He says it's because the electric bill will suck. I prefer to think it's because he misses me. ;)
bnitzMay 16th 2008 11:21AM
P.S. Kristin, I forgot to mention, you don't need to use pfexec. You can login as a privileged user (e.g. "jack" for the livecd or the user you create during the install), and then just:
su - {the root password you entered during install}
RBAC, ZFS, pkg... are different and more capable than the access control, filesystem+volume management and packaging tools found in GNU/Linux distributions, but wherever possible, OpenSolaris developers try to make things easy to newcomers from other *nixes.
Kristin ShoemakerMay 16th 2008 11:23AM
Oddly, the jack password wasn't working. Tried it... still was denying permission. Hence the sledgehammer approach of pfexec su
Read somewhere, a bit later, that su - password opensolaris also works on the LiveCD... But I'd already shoved the round pfexec su peg in the square hole.
A. LarcherMay 17th 2008 10:59AM
I had no problem to install OpenSolaris on my Thinkpad, audio works fine with OSS (just a package to install), installing Blastwave's mplayer package dos the job as far as multimedia is concerned and the issue you mention was addressed months ago and is documented (search on OpenSolaris 5 minutes and there you are).
I shall mention that OpenSolaris people answer fast and kindly on mailing lists (if you don't say "Solaris sucks and must be dead, Linux rules" as it can be seen sometimes on the ML :D).
For USB devices I never had problem with Solaris but my usb key don't work with Linux... so all is relative when you think about it... *grunt*
Great article ;)
bnitzMay 19th 2008 5:04AM
I should apologize to Kristin for doubting the cause was related to a USB mouse. I was about to log this bug and found this:
http://bugs.opensolaris.org/view_bug.do?bug_id=6699624
I'll add details and try to see how to put it in the external OpenSolaris 2008.05 distribution specific database. Thanks Kristin for your help and patience in diagnosing this one!
Rohan DhruvaMay 25th 2008 5:51AM
Hi,
I've written my review of OpenSolaris 2008.05 here - http://rohandhruva.blogspot.com/
FWIW, I had similar problems like you, but I still feel they've put together a nice livecd for a first release :)
Rohan.
RolandixorJul 3rd 2008 2:22PM
this is why there is Virtual Box...
OpenSolaris looks sweet, but tastes bitter.
SteveHigginsJul 13th 2008 9:46AM
My system specs are:
AMD Athlon 64 3200+
1.5 GB RAM
Asus A8S-X Motherboard with Intel 82540EM LAN
Nvidia GeForce 6200TC PCI-Express
SBLive
40GB Maxtor PATA HDD (I wan't to put Opensolaris here)
250GB Maxtor SATA HDD (For Windows XP and data)
USB Mouse
USB Keyboard
Samsung Syncmaster 930BF 19" 1280x1024 connected with DVI
Samsung R87 26" HDTV connected with VGA.
When I boot the Opensolaris LiveCD I can get past the language selection, but I do not get to the GNOME desktop. Instead I am left with a login prompt at the console. After about a minute of further CD activity, the system hangs (the keyboard cannot be used and I have to hit the reset button to reboot).
I have tried using PS/2 adaptors for my mouse and keyboard, disabling APIC, USB2 in the BIOS settings, but nothing works.
Can anyone help me, please?
germsteelJul 27th 2008 6:43AM
Just installed OpenSolaris 2008.05 LiveCD on VirtualBox. Gnome fails to start, keyboard mapping on login panel off by one letter; shifted to the right. bonobo activation server error 3.
Well I have spent more hours than I care to on this already back to SLES 10! I'll try the next OpenSolaris release when it comes out.