SLATE is a new way to teach AI to think step by step while using a search engine, giving feedback at each step instead of only at the end.
LiveMedBench is a new, always-updating test for medical AIs that keeps test questions safely separated from training data to avoid cheating by memorization.