Evaluating Models Is Hard

Release Date:

Mike, Deaton, and Sploosh talk about how we attempt to evaluate models and do some model comparison exercises.
Join our discord where we chat every day: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://discord.gg/kVtYy7Z⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠
If you enjoy this content and are in a position to support us, please consider becoming a patron: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://www.patreon.com/TheDangerRoomPodcast⁠

---

Support this podcast: https://podcasters.spotify.com/pod/show/the-danger-room/support

Evaluating Models Is Hard

Title
Evaluating Models Is Hard
Copyright
Release Date

flashback