Technical discussions with deep learning researchers who study how to build intelligence. Made for researchers, by researchers.
July 11, 2024
01:34:19
May 9, 2024
01:01:55
March 12, 2024
01:55:45
Episode 33: Tri Dao, Stanford: On FlashAttention and sparsity, quantization, and efficient inference
Aug. 9, 2023
01:20:29
June 22, 2023
01:01:54
March 29, 2023
01:15:24
March 23, 2023
01:45:56
March 9, 2023
01:26:45
March 1, 2023
01:34:49
Feb. 9, 2023
01:44:54