PodcastsTecnologiaGoogle SRE Prodcast

Google SRE Prodcast

Salim Virji
Google SRE Prodcast
Último episódio

55 episódios

  • Google SRE Prodcast

    Matt Zelesko and the Future of SRE

    26/05/2026 | 23min
    We sit down with Matt Zelesko, VP of SRE at Google, for a candid talk about how AI is changing SRE — and how it's not.
  • Google SRE Prodcast

    Handling Burnout with Sam Anderson

    21/05/2026 | 10min
    Sam Anderson shares his experiences with burnout, and how to support yourself as a reliable system.  Sam provides guidance on how to deal with burnout, and some suggestions on how to avoid burnout through understanding yourself and finding the help and support you need.
  • Google SRE Prodcast

    The One with Crisis Engineering and Mikey Dickerson

    15/05/2026 | 43min
    Crisis Engineer Mikey Dickerson joins us to talk about what constitutes a crisis. Mikey draws on his broad experience across industry and the public sector, as well as on work with his team of systems fixers.
  • Google SRE Prodcast

    This is Fine! With Colette Alexander and Clint Byrum

    12/05/2026 | 9min
    What's happening in the world of SRE and resilience engineering? Join us as we catch up with fellow podcast hosts Colette Alexander and Clint Byrum of the This Is Fine! podcast at SREcon in Seattle.
  • Google SRE Prodcast

    The One With Damion Yates and Building AI systems

    26/02/2026 | 31min
    How do you introduce Site Reliability Engineering to an AI research lab, bringing concepts of scale to engineers who are at the leading edge of AI systems?
    In the latest episode of The Prodcast, hosts Steve McGhee and Florian Rathgeber chat with Damion Yates, who helped establish the reliability engineering culture at Google DeepMind. Damion shares his journey of bringing scalable infrastructure to DeepMind, supporting massive machine learning experiments.
    Discover the unique challenges of supporting AI research, such as managing highly expensive "lockstep" training models where a single machine failure halts the entire process. Damion also explains why he believes "luck is our enemy" in systems engineering, and why protecting a research scientist's time is the ultimate metric for success.
Mais podcasts de Tecnologia
Sobre Google SRE Prodcast
SRE Prodcast brings Google's experience with Site Reliability Engineering together with special guests and exciting topics to discuss the present and future of reliable production engineering!
Site de podcast

Ouça Google SRE Prodcast, Hard Fork e muitos outros podcasts de todo o mundo com o aplicativo o radio.net

Obtenha o aplicativo gratuito radio.net

  • Guardar rádios e podcasts favoritos
  • Transmissão via Wi-Fi ou Bluetooth
  • Carplay & Android Audo compatìvel
  • E ainda mais funções