SLOs That Product Managers Actually UnderstandDEV Community [Unofficial]·23h ago·3 min readsresloproductreliability
Google SRE Review - Cheat SheetDEV Community [Unofficial]·2d ago·6 min readgooglesredevopsGoogle Site Reliability Engineering Book
Log Management at Scale: How We Cut Costs 70% Without Losing SignalDEV Community [Unofficial]·3d ago·3 min readloggingobservabilitydevopssre
Kubernetes 1.36: 8 Features Worth Your AttentionDEV Community [Unofficial]·6d ago·4 min readkubernetesawsdevopssre
The Golden Signals: A Practical Implementation GuideDEV Community [Unofficial]·6d ago·3 min readsremonitoringobservabilitydevops
Kubernetes Observability: What to Monitor and WhyDEV Community [Unofficial]·6d ago·3 min readkubernetesobservabilitymonitoringsre
On-Call Wellness: Protecting Your Engineers from BurnoutDEV Community [Unofficial]·Jun 27·3 min readsreoncallburnoutculture
Async LLM inference in CI: stop build workers blocking on slow jobsDEV Community [Unofficial]·Jun 25·5 min readdevopsinfrastructurellmsre
Code isn’t the only thing causing your production failures…The Stack Overflow Blog - Stack Overflow [Unofficial]·Jun 25·3 min readpodcastse-techse-stackoverflowagentic-ai
Capacity Planning Without ML: The 80/20 ApproachDEV Community [Unofficial]·Jun 23·3 min readsredevopscapacityscaling
Automate creation of Amazon CloudWatch alarmsDEV Community [Unofficial]·Jun 22·5 min readawssregithubactionsgithub
Semantic caching our flaky-test summariser: 58% fewer LLM callsDEV Community [Unofficial]·Jun 22·5 min readsredevopsllmmlops
What 60+ Claude Code memory entries taught me about solo opsDEV Community [Unofficial]·Jun 22·6 min readclaudesredevopsai
Chaos Engineering for Node.js Without the InfrastructureDEV Community [Unofficial]·Jun 20·6 min readapinodesretesting
Humanizing Artificial Intelligence in DevOps Documentation: Making Runbooks Easier to Create and UseDEV Community [Unofficial]·Jun 19·11 min readdevopsaidocumentationsre
Fault-injecting our LLM provider to trust Bifrost fallbacksDEV Community [Unofficial]·Jun 19·5 min readsredevopsllminfrastructure
Agent Handoff Contracts: The Missing Piece in Production Agent SystemsDEV Community [Unofficial]·Jun 18·4 min readsredevopsaiarchitecture
The SRE Mindset in API ArchitectureHow my SRE experience with sizing, observability and SLOs shapes the way I approach API architecture - planning for reality, not hopes.Ricky Moorhouse·Mar 7·7 min readFollowarchitecturesre