Alt account of [email protected] here.

Our instance is currently down and I can’t get remote access to the servers. It appears that there might have been a hardware failure of the main firewall, which is the one thing I can’t work around remotely.

I am still trying a few things, but I am not very optimistic that I can get access.

The really unfortunate part is that just now I am on one of my rare work deployments abroad, so I also can’t access it physically during the next few weeks and my usual back up that could restart it is not available either.

As something like that never happened in 3 years operating the servers, I thought I can risk it, but murphy’s law seems inescapable 😓

I will try to keep you posted here on any updates, but probably there will not be much I can do for a while. Really bad timing 😥

Edit: we might use this “opportunity” to migrate the instance to Piefed, which has been an idea for quite some time now. I will keep you posted on that.

  • originalucifer@moist.catsweat.com
    link
    fedilink
    arrow-up
    7
    arrow-down
    3
    ·
    3 days ago

    people hate the big cloud solutions, but this is the kind of thing their HA infrastructure prevents against… hardware failures

    i dont enjoy using (or paying for) aws, but i will never have a firewall or disk failure.

    • drspod@lemmy.ml
      link
      fedilink
      English
      arrow-up
      7
      ·
      3 days ago

      Their HA infrastructure is all built on open source projects. The thing they have that we don’t is teams of SREs on-call 24/7.