🎓How I Study AIHISA
📖Read
📄Papers📰Blogs🎬Courses
💡Learn
🛤️Paths📚Topics💡Concepts🎴Shorts
🎯Practice
📝Daily Log🎯Prompts🧠Review
SearchSettings
How I Study AI - Learn AI Papers & Lectures the Easy Way

Papers1055

AllBeginnerIntermediateAdvanced
All SourcesarXiv

GLM-5: from Vibe Coding to Agentic Engineering

Intermediate
GLM-5 Team, Aohan Zeng et al.Feb 17arXiv

GLM-5 is a new open-weight AI model that moves from 'vibe coding' (prompting the model to write code) to 'agentic engineering' (letting the model plan, build, test, and fix software on its own).

#GLM-5#Agentic Engineering#DeepSeek Sparse Attention

Spanning the Visual Analogy Space with a Weight Basis of LoRAs

Intermediate
Hila Manor, Rinon Gal et al.Feb 17arXiv

This paper teaches image models to copy a change shown in one image pair and apply it to a new image, like saying 'hat added here, add a similar hat there.'

#visual analogy learning#LoRA#LoRA basis

World Action Models are Zero-shot Policies

Intermediate
Seonghyeon Ye, Yunhao Ge et al.Feb 17arXiv

DreamZero is a robot brain that learns actions by predicting short videos of the future and the matching moves at the same time.

#World Action Models#DreamZero#video diffusion

jina-embeddings-v5-text: Task-Targeted Embedding Distillation

Intermediate
Mohammad Kalim Akram, Saba Sturua et al.Feb 17arXiv

The paper teaches small AI models to make high‑quality text embeddings by first copying a big expert model (distillation) and then practicing four jobs with special mini‑modules (LoRA adapters): retrieval, similarity, clustering, and classification.

#text embeddings#knowledge distillation#contrastive learning

TAROT: Test-driven and Capability-adaptive Curriculum Reinforcement Fine-tuning for Code Generation with Large Language Models

Intermediate
Chansung Park, Juyong Jiang et al.Feb 17arXiv

TAROT teaches code-writing AI the way good teachers teach kids: start at the right level and raise the bar at the right time.

#TAROT#curriculum learning#reinforcement fine-tuning

On Surprising Effectiveness of Masking Updates in Adaptive Optimizers

Intermediate
Taejong Joo, Wenhan Xia et al.Feb 17arXiv

The paper finds a simple trick—randomly skipping some parameter updates—can train large language models better than fancy optimizers.

#Magma#random masking#adaptive optimizers

COMPOT: Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers Compression

Intermediate
Denis Makhov, Dmitriy Shopkhoev et al.Feb 16arXiv

COMPOT is a training-free way to shrink Transformer models while keeping their smarts.

#Transformer compression#orthogonal dictionary learning#orthogonal Procrustes

Panini: Continual Learning in Token Space via Structured Memory

Intermediate
Shreyas Rajesh, Pavan Holur et al.Feb 16arXiv

Panini is a way for AI to keep learning new facts without changing its brain by storing them as tiny linked Q&A facts in an external memory.

#non-parametric continual learning#structured memory#Generative Semantic Workspace

ResearchGym: Evaluating Language Model Agents on Real-World AI Research

Intermediate
Aniketh Garikaparthi, Manasi Patwardhan et al.Feb 16arXiv

ResearchGym is a new "gym" where AI agents are tested on real research projects end to end, not just on toy problems.

#ResearchGym#closed-loop research#objective evaluation

World Models for Policy Refinement in StarCraft II

Intermediate
Yixin Zhang, Ziyi Wang et al.Feb 16arXiv

The paper builds StarWM, a ‘world model’ that lets a StarCraft II agent imagine what will happen a few seconds after it takes an action.

#world model#action-conditioned dynamics#StarCraft II

Efficient Text-Guided Convolutional Adapter for the Diffusion Model

Intermediate
Aryan Das, Koushik Biswas et al.Feb 16arXiv

This paper introduces Nexus Adapters, tiny helper networks that let a diffusion model follow both a text prompt and a structure map (like edges or depth) at the same time.

#Nexus Adapter#text-guided adapter#cross-attention

Uncertainty-Aware Vision-Language Segmentation for Medical Imaging

Intermediate
Aryan Das, Tanishq Rachamalla et al.Feb 16arXiv

This paper builds a medical image segmentation system that uses both pictures (like X-rays) and words (short clinical text) at the same time.

#medical image segmentation#vision-language segmentation#uncertainty estimation
1213141516