エピソード

  • Mixture of Experts
    2024/10/08

    In this episode we talk about the paper "Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean.

    続きを読む 一部表示
    55 分
  • LoRA
    2023/09/02

    We talk about Low Rank Approximation for fine tuning Transformers. We are also on YouTube now! Check out the video here: https://youtu.be/lLzHr0VFi3Y

    続きを読む 一部表示
    1 時間 3 分
  • 15: InstructGPT
    2023/03/28

    In this episode we discuss the paper "Training language models to follow instructions with human feedback" by Ouyang et al (2022). We discuss the RLHF paradigm and how important RL is to tuning GPT.

    続きを読む 一部表示
    57 分
  • 14: Whisper
    2023/03/17
    This week we talk about Whisper. It is a weakly supervised speech recognition model.



    続きを読む 一部表示
    49 分
  • 13: AlphaTensor
    2023/03/11

    We talk about AlphaTensor, and how researchers were able to find a new algorithm for matrix multiplication.

    続きを読む 一部表示
    49 分
  • 12: SIRENs
    2022/10/25

    In this episode we talked about "Implicit Neural Representations with Periodic Activation Functions" and the strength of periodic non-linearities.

    続きを読む 一部表示
    54 分
  • 11: CVPR Workshop on Autonomous Driving Keynote by Ashok Elluswamy, a Tesla engineer
    2022/09/30

    In this episode we discuss this video: https://youtu.be/jPCV4GKX9Dw

    How Tesla approaches collision detection with novel methods.

    続きを読む 一部表示
    49 分
  • 10: Outracing champion Gran Turismo drivers with deep reinforcement learning
    2022/08/23

    We discuss Sony AI's accomplishment of creating a novel AI agent that can beat professional racers in Gran Turismo. Some topics include:
    - The crafting of rewards to make the agent behave nicely
    - What is QR-SAC?
    - How to deal with "rare" experiences in the replay buffer

    Link to paper: https://www.nature.com/articles/s41586-021-04357-7

    続きを読む 一部表示
    55 分