Skip to content

Commit 892bea1

Browse files
committed
Added additional SRE resources from major tech companies
1 parent 4fd5a3a commit 892bea1

File tree

1 file changed

+17
-4
lines changed

1 file changed

+17
-4
lines changed

README.md

Lines changed: 17 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1657,14 +1657,27 @@ Numerous organizations frequently share their insights and expertise, encompassi
16571657

16581658
#### SRE Resources from various organizations
16591659

1660-
* [Google SRE Page](https://sre.google/)
1661-
* [Google SRE Classroom](https://sre.google/classroom/)
1660+
* [Airbnb Engineering - Lessons Learned in Incident Management](https://dropbox.tech/infrastructure/lessons-learned-in-incident-management)
1661+
* [Atlassian - Blameless Postmortems](https://www.atlassian.com/incident-management/postmortem/blameless)
1662+
* [Atlassian - Creating Postmortem Reports](https://www.atlassian.com/incident-management/postmortem/reports)
1663+
* [AWS Observability Recipes](https://aws-observability.github.io/aws-o11y-recipes/)
1664+
* [Awesome Sysadmin](https://github.com/awesome-foss/awesome-sysadmin)
1665+
* [Cloudflare - Incident Analysis and Response](https://blog.cloudflare.com/cloudflare-incident-on-august-21-2025/)
1666+
* [Dropbox - Monitoring Server Applications with Vortex](https://dropbox.tech/infrastructure/monitoring-server-applications-with-vortex)
16621667
* [Google Cloud SRE Page](https://cloud.google.com/sre)
1668+
* [Google SRE Classroom](https://sre.google/classroom/)
1669+
* [Google SRE Page](https://sre.google/)
1670+
* [Google SRE - Blameless Postmortem Culture](https://sre.google/sre-book/postmortem-culture/)
1671+
* [Google SRE - Incident Response and Analysis](https://sre.google/workbook/incident-response/)
16631672
* [Microsoft SRE Page](https://docs.microsoft.com/en-us/azure/site-reliability-engineering/)
1673+
* [Netflix - Centralized Site Reliability Practice](https://netflixtechblog.com/keeping-customers-streaming-the-centralized-site-reliability-practice-at-netflix-205cc37aa9fb)
1674+
* [PagerDuty - Incident Response Automation](https://www.pagerduty.com/blog/automation/from-alert-to-resolution-how-incident-response-automation-cuts-mttr-and-closes-gaps/)
16641675
* [School of SRE from LinkedIn](https://linkedin.github.io/school-of-sre/)
1676+
* [Spotify - Incident Management Practices](https://engineering.atspotify.com/2013/06/04/incident-management-at-spotify)
16651677
* [Stripe Increment Magazine Issue 16 on Reliability](https://increment.com/reliability/)
1666-
* [AWS Observability Recipes](https://aws-observability.github.io/aws-o11y-recipes/)
1667-
* [Awesome Sysadmin](https://github.com/awesome-foss/awesome-sysadmin)
1678+
* [Uber - Observability at Scale](https://www.uber.com/en-IN/blog/observability-at-scale/)
1679+
1680+
16681681

16691682
#### Incidents & postmortems
16701683

0 commit comments

Comments
 (0)