
Lexiang Huang | Metastable Failures in the Wild | #17
Disseminate: The Computer Science Research Podcast
Episode · 0 Play
Episode · 53:18 · Jan 9, 2023
About
Summary: In this episode Lexiang Huang talks about a framework for understanding a class of failures in distributed systems called metastable failures. Lexiang tells us about his study on the prevalence of such failures in the wild and how he and his colleagues scoured over publicly available incident reports from many organizations, ranging from hyperscalers to small companies. Listen to the episode to find out about his main findings and gain a deeper understanding of metastable failures and how you can identity, prevent, and mitigate against them!Links: OSDI paper and talkPersonal websiteTwitterLinkedIn Hosted on Acast. See acast.com/privacy for more information.
53m 18s · Jan 9, 2023
© 2023 Acast AB (OG)