OK, spent some time on the software side of the fence for demo/training purposes. After looking at ROCKS for awhile, I decided I liked to "roll my own" environment a bit more than use their methods (which are fine, but I'm pretty set in my ways).
I've got the four nodes now configured with
- Sun Gridengine
- Lustre (very basic single OST config for now)
- Cfengine (still version 2, working on 3)
- lm_sensors with some tweaks to read from the Intel boards
- Ganglia, Cacti, Nagios
Some highlights have been getting lm_sensors updates from the fine folks at Elrepo and having the w83627dhg module start working.
mcr1u1 [~]> sensors w83627dhg-isa-0a00 Adapter: ISA adapter VCore: +1.03 V (min = +0.00 V, max = +1.74 V) 12VCC: +10.56 V (min = +13.04 V, max = +0.74 V) ALARM AVCC: +3.42 V (min = +0.85 V, max = +2.66 V) ALARM 3VCC: +3.42 V (min = +2.43 V, max = +3.97 V) in4: +1.86 V (min = +0.94 V, max = +0.15 V) ALARM in5: +1.22 V (min = +0.11 V, max = +1.08 V) ALARM 5VCC: +5.12 V (min = +6.43 V, max = +4.38 V) ALARM VSB: +3.44 V (min = +1.54 V, max = +1.76 V) ALARM VBAT: +3.31 V (min = +1.81 V, max = +0.42 V) ALARM Case_Fan: 0 RPM (min = 3515 RPM, div = 32) ALARM CPU_Fan: 0 RPM (min = 2109 RPM, div = 128) ALARM Aux_Fan: 0 RPM (min = 2812 RPM, div = 32) ALARM fan4: 0 RPM (min = 5625 RPM, div = 16) ALARM fan5: 0 RPM (min = 84375 RPM, div = 16) ALARM Sys_Temp: +35 C (high = +18 C, hyst = -96 C) [CPU diode ] ALARM CPU_Temp: +47.5 C (high = +80.0 C, hyst = +75.0 C) [CPU diode ] AUX_Temp: +127.0 C (high = +80.0 C, hyst = +75.0 C) [thermistor] ALARM vid: +2.050 V
My initial reaction was "Ahh! The Fans are not spinning!" but then I calmed down remembering the board is fanless
. I also need to look at the config for the module and compare to the BIOS values for the voltages. They seem a little off.
The head node uses a f71882fg hwmon driver and after reading some MSI docs, I re-labelled the outputs to what I think are correct values. I've attached the power supply fan to one of the board fan pins
[root@mini etc]# sensors
f71882fg-isa-0a10
Adapter: ISA adapter
3VCC: +3.36 V
Vcore: +1.18 V (max = +2.04 V)
Vdimm: +4.08 V
Vchip: +4.08 V
5VCC: +5.04 V
12VCC: +14.27 V
5VSB: +10.71 V
3VSB: +3.36 V
Battery: +3.36 V
PSU_Fan: 1043 RPM
CPU_Fan: 5660 RPM
CPU_Temp: +41 C (high = +85 C, hyst = +81 C)
(crit = +100 C, hyst = +96 C) sensor = transistor
On the cosmetic side, I've gotten some proper length SATA cables with 90 degree bends in them.
Hey, and UPS delivered another 2GB stick of memory for the head node, bringing the total "cluster" to 10G of ram total. (2GB already in the three nodes, but I forgot I put a 1GB stick in the head node).