Papers3

#MIMIC-IV

Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models

The paper asks AI to hunt for insights in big databases without being told exact questions, like a curious scientist instead of a test-taker.

#Deep Data Research#Agentic LLMs#Investigatory Intelligence

Not triaged yet

AgentEHR: Advancing Autonomous Clinical Decision-Making via Retrospective Summarization

Intermediate

Yusheng Liao, Chuan Xuan et al.Jan 20arXiv

AgentEHR is a new, realistic test that asks AI agents to read messy hospital records and make full clinical decisions, not just look up facts.

#AgentEHR#RETROSUM#retrospective summarization

Not triaged yet

Patient-Similarity Cohort Reasoning in Clinical Text-to-SQL

Intermediate

Yifei Shen, Yilun Zhao et al.Jan 14arXiv

This paper introduces CLINSQL, a 633-task benchmark that turns real clinician-style questions into SQL challenges over the MIMIC-IV v3.1 hospital database.

#clinical text-to-SQL#EHR#MIMIC-IV

Not triaged yet