mastohost,
@mastohost@mastodon.social avatar

One of the RAID disks in the database server 5 failed and it needs to be replaced as soon as possible.

I requested the replacement and expect it to be done in the next couple of hours. This requires the server to go offline but it should take less than an hour.

I will update here once the intervention starts/finishes.

Only instances using the Database 5 will be impacted (see the database server in your Masto.host account).

Sorry for the inconvenience. This week servers decided to act up :|

duncanhart,
@duncanhart@mastodon.duncanhart.com avatar

@mastohost you're worth every penny 👏 🙇‍♂️ 🙂 😀

mastohost,
@mastohost@mastodon.social avatar

@duncanhart ♥️

dc,
@dc@social.advancedsoftware.engineering avatar

@mastohost, just curious: Have they given a reason why they can't hot-swap the disk?

mastohost,
@mastohost@mastodon.social avatar

@dc the drives on these servers are not hot-swappable

chris,

deleted_by_author

  • Loading...
  • mastohost,
    @mastohost@mastodon.social avatar

    @chris 😨

    mastohost,
    @mastohost@mastodon.social avatar

    The intervention in Database server 5 is about to start.

    zip,
    @zip@wandering.shop avatar

    @mastohost that's my favourite sci fi movie

    mastohost,
    @mastohost@mastodon.social avatar

    @zip I am watching it now 😂

    mastohost,
    @mastohost@mastodon.social avatar

    Database server 5 went offline about 5 minutes ago and the disk is being replaced now.

    mastohost,
    @mastohost@mastodon.social avatar

    There is a problem with the disk replacement that is causing the server not to boot. They are trying to understand what could be causing the problem.

    I am really sorry but it's something that I have no access to and will have to rely on the information I am provided, causing me to provide timings and have expectations that are inaccurate.

    I'm contacting everyone I can to obtain more informations and ways to speed up the process.

    joe,

    @mastohost It is what it is. We trust you.

    fuzzface,

    @mastohost Thank you for your efforts!!

    mastohost,
    @mastohost@mastodon.social avatar

    OK, Database Server 5 is back online. I am still looking into it and will update here later.

    dnkrupinski,
    @dnkrupinski@hannover.town avatar

    @mastohost 💗

    breadandwater,

    @mastohost sounds stressful!

    teapot_ben,
    @teapot_ben@glammr.us avatar

    @mastohost thank you for all your hard work.

    Please remember to take a break from this and look after yourself though!

    mastohost,
    @mastohost@mastodon.social avatar

    Everything is looking good with Database server 5 and everyone is back online.

    Still, the faulty drive is on the server and I still need to deal with that because running with a single disk RAID is dangerous. I do have remote backups that run daily but only expect to use them as a last case scenario.

    Will spin a test server to see if I can replicate the issue and see if I can work around it.

    Tomorrow will provide more information.

    Enormous apologies to anyone affected by this downtime.

    Strange_Bundle,
    @Strange_Bundle@mastodon.online avatar

    @mastohost Thanks for your hard work!!

    leo,
    @leo@twit.social avatar

    @mastohost Thank you!! You do a great job Hugo and I appreciate it!

    mastohost,
    @mastohost@mastodon.social avatar

    @leo Thank you Leo. Love the hair BTW :)

    dnkrupinski,
    @dnkrupinski@hannover.town avatar

    @mastohost 🙃

    coffe,
    @coffe@social.piewpiew.se avatar

    @mastohost keep on truckin! 👍☕

    mastohost,
    @mastohost@mastodon.social avatar

    @coffe 🤗

    dnkrupinski,
    @dnkrupinski@hannover.town avatar

    @mastohost 🍬 🍬 🍬

    dnkrupinski,
    @dnkrupinski@hannover.town avatar

    @mastohost 🍫 🍪 🍫 🍪

    dnkrupinski,
    @dnkrupinski@hannover.town avatar

    @mastohost We call it "soulfood" in Germany! Always needed if you are trying to solve big problems.

    mastohost,
    @mastohost@mastodon.social avatar

    @dnkrupinski Thank you :)

    timo,
    @timo@hetzel.net avatar

    @mastohost have you thought about offering managed VM-hosted Mastodon?

    mastohost,
    @mastohost@mastodon.social avatar

    @timo Thought about it when starting Masto.host, tried it with some instances and decided it wasn't the best path for me.

    Dragon,
    @Dragon@toast.dragon2611.net avatar

    @mastohost If they can't sort that they should provision you a spare server to migrate to, it's not like someone the size of OVH doesn't have entire spare servers laying about.

    mastohost,
    @mastohost@mastodon.social avatar

    @Dragon It's one of the options on the table, I just don't really want to be again in the same situation, with a server down for hours. I have done disk replacements with them in the past without any major issues, around 30 minutes and it's back online. Hope to have a plan in place in the next hours.

    mastohost,
    @mastohost@mastodon.social avatar

    After much back and forth, I have decided to migrate the databases on database server 5 one by one to a new server. It's the safest option to be sure there is no endless downtime, where I am left waiting without any information and ways to act.

    After the migration of all data is done, I will play with the server to make sure future disk replacements don't cause the same situation.

    mastohost,
    @mastohost@mastodon.social avatar

    This means that there will be some downtime during the database migrations but for small databases it will be minimal, a couple of minutes, but for larger ones it can mean up to two hours of downtime. I will email the larger instance admins to inform of a specific downtime for their instance.

    I will keep updating this thread. Thank you.

    masukomi,
    @masukomi@connectified.com avatar

    @mastohost while i don't think i'm affected, i appreciate your open communications on this matter.

    mastohost,
    @mastohost@mastodon.social avatar

    @masukomi Yep, your instance is not affected. Thanks :)

    jarno,
    @jarno@mastonederland.nl avatar

    @mastohost thanks for your effort and good luck 👍🏻

    mastohost,
    @mastohost@mastodon.social avatar

    @jarno thanks :)

    dnkrupinski,
    @dnkrupinski@hannover.town avatar

    @mastohost 🤞🤞🤞

    mastohost,
    @mastohost@mastodon.social avatar

    All databases have been migrated from Database Server 5 🎉

    This issue will not cause any more problems.

    Next week, me and the data center team will be doing some investigation into Database Server 5 to determine the best way to avoid a problem like this from happening again.

    Terribly sorry for the inconvenience this caused and any issues or questions, please let me know.

    Thanks 🐘

    galooph,
    @galooph@masto.galooph.com avatar

    @mastohost Hope you get to relax for the rest of the weekend, Hugo!

    mastohost,
    @mastohost@mastodon.social avatar

    @galooph Me too 😂 Thanks

    dnkrupinski,
    @dnkrupinski@hannover.town avatar

    @mastohost Thanks for all the work you've done!

    mastohost,
    @mastohost@mastodon.social avatar

    @dnkrupinski You are very kind. Thank you so much :)

    selea,

    @mastohost

    You have, and are doing a great job of informing and delivering a great service!

    Things break, everywhere - we can't get past that. Enjoy some good food and a drink this evening :)

    mastohost,
    @mastohost@mastodon.social avatar

    @selea Thank you so much ♥️

    I will be going our for dinner tonight for sure 😅

    Psy_Fer_,

    @mastohost legend!

    david,
    @david@theblower.au avatar

    @mastohost Rough week :(

    mastohost,
    @mastohost@mastodon.social avatar

    Massive SNAFU! I can't apologise enough but it was not on my end.

    The old drive stored the EFI and that is required to boot the server. The person that did the disk replacement didn't know that (EFI cannot be on the RAID), so they were hours attempting to reboot the server without EFI, until they gave up and handed me the broken server.

    Now, an attempt will be made to place the old drive back and see if the server boots like that and then I can fix the situation.

    foosel,
    @foosel@chaos.social avatar

    @mastohost hugops 🫂

    dnkrupinski,
    @dnkrupinski@hannover.town avatar

    @mastohost 🤞

    kyle,
    @kyle@keyboards.social avatar

    @mastohost you’ve had a day but thank you for keeping everyone informed and I really appreciate you!

    mastohost,
    @mastohost@mastodon.social avatar

    @kyle thank you :)

    gunchleoc,
    @gunchleoc@mastodon.scot avatar

    @mastohost 😱 May the fix go smoothly!

    kf,
    @kf@666.glitchwit.ch avatar

    @mastohost :elmo_fire: :iluvu:

    markigra,

    @mastohost Thanks for working to save the situation. I know you are neck deep in this, but it would be great to have an update or an ETA.

    hchorey,

    @mastohost 🤞

    jeremy,
    @jeremy@mapstodon.space avatar

    @mastohost Thanks for the info! FYI my.masto.host and status.masto.host don't seem to be responding right now. Maybe it's related.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • random
  • Durango
  • magazineikmin
  • mdbf
  • thenastyranch
  • khanakhh
  • rosin
  • Youngstown
  • ethstaker
  • slotface
  • modclub
  • kavyap
  • DreamBathrooms
  • everett
  • ngwrru68w68
  • JUstTest
  • InstantRegret
  • tacticalgear
  • GTA5RPClips
  • cubers
  • normalnudes
  • osvaldo12
  • tester
  • anitta
  • cisconetworking
  • megavids
  • Leos
  • provamag3
  • lostlight
  • All magazines