๐ŸŽ“How I Study AIHISA
๐Ÿ“–Read
๐Ÿ“„Papers๐Ÿ“ฐBlogs๐ŸŽฌCourses
๐Ÿ’กLearn
๐Ÿ›ค๏ธPaths๐Ÿ“šTopics๐Ÿ’กConcepts๐ŸŽดShorts
๐ŸŽฏPractice
๐Ÿ“Daily Log๐ŸŽฏPrompts๐Ÿง Review
SearchSettings
How I Study AI - Learn AI Papers & Lectures the Easy Way

Papers2

AllBeginnerIntermediateAdvanced
All SourcesarXiv
#dynamic evaluation

Interactive Benchmarks

Beginner
Baoqing Yue, Zihan Zhu et al.Mar 5arXiv

This paper says we should test AI the way real life works: by letting it ask questions, gather clues, and make smart moves step by step under a limited budget.

#interactive benchmarks#information acquisition#budgeted reasoning

Not triaged yet

GISA: A Benchmark for General Information-Seeking Assistant

Intermediate
Yutao Zhu, Xingshuo Zhang et al.Feb 9arXiv

GISA is a new test (benchmark) that checks how well AI assistants can search the web like real people do.

#GISA#information-seeking agents#web search benchmark

Not triaged yet