Site Downtime on 24 February

Started by 青山 素子, February 24, 2009, 10:55:09 PM

Previous topic - Next topic

青山 素子

Today our site experienced an outage that lasted several hours. Once our staff became aware of the outage, our emergency backup site was activated for community support.

First, I'd like to let you all know that the outage wasn't the result of a hacking attempt or anything so glamorous. Rather, it was the very boring condition of failed hardware.

Specifically, the main power supply in our master database server decided to die. While some attempts were made to resurrect the machine, the power supply refused to perform its primary duty of supplying power. As a result, our database server was deprived of electrons and would not boot.

We have moved our database drives to our replica database server and got it booted back up. All the databases crucial for running our services have checked out okay. If you notice any problems that didn't exist previously, especially database errors, let us know.
Motoko-chan
Director, Simple Machines

Note: Unless otherwise stated, my posts are not representative of any official position or opinion of Simple Machines.


lovearat

I know ya'll often go unappreciated for all the great work and time put into running this site. So I want to give ya'll my heartfelt Thank you for all that ya'll do. And for working so hard to get the site back online.
<span style="font-size: 12px; color: red;">Do Not Pm Me For Support. Please use the appropriate board</span>

Apllicmz

thank you
i try to seach she not work
can check please

Nice work



fords8

Glad everything is back up and running!

_Anthony_


青山 素子

I believe our search server was running on the slave SQL server, which is now disabled. I'm not sure if we'll be able to get it to return until we get the new power supply and boot it back up.
Motoko-chan
Director, Simple Machines

Note: Unless otherwise stated, my posts are not representative of any official position or opinion of Simple Machines.


fords8

#6
No worries Motoko-chan! Do what ya got to do to get back to 100% . Thanks for the updates also!  8) :D :)

EDIT: I hope this didn't hurt CodeFest at all too. They didn't have to take time away from that to deal with this?

青山 素子

No, those that could attend (I was unable to attend due to work) were still able to meet. I did keep them informed of developments by phone.
Motoko-chan
Director, Simple Machines

Note: Unless otherwise stated, my posts are not representative of any official position or opinion of Simple Machines.


fords8

Quote from: Motoko-chan on February 25, 2009, 01:55:19 AM
No, those that could attend (I was unable to attend due to work) were still able to meet. I did keep them informed of developments by phone.

Great! Looking forward to hearing what they get done.

metallica48423

We all got here tonight -- the last will be here in the early afternoon tomorrow :)  Thanks for asking.

We'll try to get the search issue resolved as fast as we can :)
Justin O'Leary
Ex-Project Manager
Ex-Lead Support Specialist

QuoteMicrosoft wants us to "Imagine life without walls"...
I say, "If there are no walls, who needs Windows?"


Useful Links:
Online Manual!
How to Help us Help you
Search
Settings Repair Tool

Aleksi "Lex" Kilpinen

Considering the fact that you actually lost hardware, you guys sure got back on line fast I think... :)
Slava
Ukraini!
"Before you allow people access to your forum, especially in an administrative position, you must be aware that that person can seriously damage your forum. Therefore, you should only allow people that you trust, implicitly, to have such access." -Douglas

How you can help SMF

metallica48423

Well, we have two database servers - a master and slave used for replication.  We pretty much just swapped the hard drives from one to the other.  Problem is, the sphinx search index was on the second server, which is now the one without a power supply
Justin O'Leary
Ex-Project Manager
Ex-Lead Support Specialist

QuoteMicrosoft wants us to "Imagine life without walls"...
I say, "If there are no walls, who needs Windows?"


Useful Links:
Online Manual!
How to Help us Help you
Search
Settings Repair Tool

Amacythe

Quote from: fords8 on February 25, 2009, 02:05:30 AM
Quote from: Motoko-chan on February 25, 2009, 01:55:19 AM
No, those that could attend (I was unable to attend due to work) were still able to meet. I did keep them informed of developments by phone.

Great! Looking forward to hearing what they get done.

Well, thus far we managed to get everyone here who was on an airline flight, and those who got here early have managed not to kill each other.  We've had a few discussions about how much we love this project, and how we all want to double our pay (It's a joke since we are all unpaid volunteers!) but alas, we haven't gotten drunk, nor have we had any of the famous orgies that most 'business' meetings tend to manage.

Ok, seriously... metallica will be posting some of the details of our conferences to the Blog as time permits.

PacificWx

Great work getting the site back - as had been mentioned in this thread, you guys should be congratulated about how quick you got the site back up considering the power supply failure.

Great work!

Dzonny


TW1ST3D

Wow !!!   I was actually having symptoms of SMF Withdrawl Disorder...............
Running 2.0 Gold.......SMF Rocks!!

SleePy

Quote from: TW1ST3D on February 25, 2009, 07:30:36 AM
Wow !!!   I was actually having symptoms of SMF Withdrawl Disorder...............

Your not the one who spent over 12 hours in travel time to get somewhere :P
* SleePy injects SMF into himself.
Jeremy D ~ Site Team / SMF Developer ~ GitHub Profile ~ Join us on IRC @ Libera.chat/#smf ~ Support the SMF Support team!

Brettflan

Glad to see things back up and running. :)

fords8

Traveling does stink. But at least you are with people that like the samething you do. Now that is some coding power in one room! I wish I was there just to learn somethings!

LiroyvH

Quote
As a result, our database server was deprived of electrons and would not boot

Lmfao, that's the most nice phrased explanation i've ever seen for a failing power supply :P

Good job getting it back up :)
((U + C + I)x(10 − S)) / 20xAx1 / (1 − sin(F / 10))
President/CEO of Simple Machines - Server Manager
Please do not PM for support - anything else is usually OK.

Advertisement: