This subset of the manual documents our approach to the operations and maintenance of the hosted service, sr.ht. You may find this useful for running your own hosted sr.ht service, or to evaluate our practices & policies to consider if they meet your requirements for availability or robustness. You also might just find this stuff interesting, as SourceHut is one of the few largeish services which is not hosted in The Cloud™.

Additional resources:

Operational Resources

Status page

status.sr.ht is hosted on third-party infrastructure and is used to communicate about upcoming planned outages, and to provide updates during incident resolution. Planned outages are also posted to sr.ht-announce in advance.

The status page is updated by a human being, who is probably busy fixing the problem.

Monitoring & alarms

Our Prometheus instance at metrics.sr.ht is available to the public for querying our monitoring systems and viewing the state of various alarms. Some alarms are also fed to the IRC channel and mailing list.

Mailing list

The sr.ht-ops mailing list is used for automated reports from our services, including alarm notifications of important or urgent severity, and automated reports on operational status of backups and other systems.

IRC channel

The #sr.ht.ops IRC channel on irc.freenode.net is used for triage and coordination during outages, and has a real-time feed of alarms raised by our monitoring system.

About this wiki

commit 316b198186ddf05b8a6bb6cc4e6f9fb79748a333
Author: Drew DeVault <sir@cmpwn.com>
Date:   2020-03-26T10:26:26-04:00

installation.md: minor tweaks
Clone this wiki
https://git.sr.ht/~sircmpwn/sr.ht-docs (read-only)
git@git.sr.ht:~sircmpwn/sr.ht-docs (read/write)