Timestamps: 0:00 Introduction 0:54 Historical overview 2:49 What are State Space Models? 7:04 The design of the new alternative architecture 13:43 Why input gating is important 17:18 Scaling State Space Models 19:03 Model efficiency on device 20:39 A note on efficiency improvements 22:04 How do alternative models perform during inference 24:34 The caveat of the alternative models 26:11 Summary 27:13 Q&A
Timestamps:
0:00 Introduction
0:54 Historical overview
2:49 What are State Space Models?
7:04 The design of the new alternative architecture
13:43 Why input gating is important
17:18 Scaling State Space Models
19:03 Model efficiency on device
20:39 A note on efficiency improvements
22:04 How do alternative models perform during inference
24:34 The caveat of the alternative models
26:11 Summary
27:13 Q&A
Join our community on Slack join.slack.com/t/buzzrobot/shared_invite/zt-1zsh7k8pd-iMu_M8bUxIK3pOJgqJgCRQ