At approximately 22:00 PST, our primary MySQL / secondary DNS server went
offline. At this time we don't know the root cause. There is strong evidence
the power supply has failed, however.
Users may be seeing timeouts or connection failures when their web applications
attempt to connect to the MySQL server. DNS resolution for things under the
parodius.com domain, as well as domains we host, may take slightly longer than
usual (but should be working).
We will be going to the datacenter later tonight to investigate the problem,
and/or move services off to another machine. We'll provide further updates as
we have them.
Update (23:43 PST) — we're preparing to leave for the datacenter, and
will be replacing the box with a brand new 4-core, 4-disk system. It will take
us some time to migrate the previous servers' data to the new system. There is
no ETR at this time.
Update (04:20 PST) — we've returned from the datacenter. The new server
has been installed, but has not been configured at this time.
Regarding the bad hardware: analysis shows the power supply is in excellent
condition (all voltages are perfect); only when hooked up to the motherboard
does it fail to power on. The same happens with an alternate PSU. We believe
the issue to be a bad motherboard or a bad CPU, as either of these will cause
a system not to power up. Further investigation will be done in days to come.
All data which was on the system safe/intact. We're now beginning a combination
of data restoration + system reconfiguration (for the new server). This will
take some time, so please be patient.
Update (06:47 PST) — we have finished the restoration. MySQL services
are again up and running. We apologise for the unexpected interruption
in service.
Post-outage follow-up — hardware analysis initially determined the failed
component to be the motherboard. This conclusion was based on a testing the
PSU with a voltage tester (it passed), and replacing the CPU (the motherboard
would still not power on). We then replaced the motherboard, but experienced
the same problem. We then hooked up a spare PSU and the system powered up
normally, concluding the faulty part was the PSU. Upon disassembling the PSU,
we found a single bulging capacitor which also showed signs of having leaked.
We will be replacing the PSU and eventually repurposing the system.