Papers1262

jina-embeddings-v5-text: Task-Targeted Embedding Distillation

Mohammad Kalim Akram, Saba Sturua et al.Feb 17arXiv

The paper teaches small AI models to make high‑quality text embeddings by first copying a big expert model (distillation) and then practicing four jobs with special mini‑modules (LoRA adapters): retrieval, similarity, clustering, and classification.

#text embeddings#knowledge distillation#contrastive learning

Not triaged yet

TAROT: Test-driven and Capability-adaptive Curriculum Reinforcement Fine-tuning for Code Generation with Large Language Models

Intermediate

Chansung Park, Juyong Jiang et al.Feb 17arXiv

TAROT teaches code-writing AI the way good teachers teach kids: start at the right level and raise the bar at the right time.

#TAROT#curriculum learning#reinforcement fine-tuning

Not triaged yet

On Surprising Effectiveness of Masking Updates in Adaptive Optimizers

Intermediate

Taejong Joo, Wenhan Xia et al.Feb 17arXiv

The paper finds a simple trick—randomly skipping some parameter updates—can train large language models better than fancy optimizers.

#Magma#random masking#adaptive optimizers

Not triaged yet

COMPOT: Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers Compression

Intermediate

Denis Makhov, Dmitriy Shopkhoev et al.Feb 16arXiv

COMPOT is a training-free way to shrink Transformer models while keeping their smarts.

#Transformer compression#orthogonal dictionary learning#orthogonal Procrustes

Not triaged yet

Panini: Continual Learning in Token Space via Structured Memory

Intermediate

Shreyas Rajesh, Pavan Holur et al.Feb 16arXiv

Panini is a way for AI to keep learning new facts without changing its brain by storing them as tiny linked Q&A facts in an external memory.

#non-parametric continual learning#structured memory#Generative Semantic Workspace

Not triaged yet

ResearchGym: Evaluating Language Model Agents on Real-World AI Research

Intermediate

Aniketh Garikaparthi, Manasi Patwardhan et al.Feb 16arXiv

ResearchGym is a new "gym" where AI agents are tested on real research projects end to end, not just on toy problems.

#ResearchGym#closed-loop research#objective evaluation

Not triaged yet

Image Generation with a Sphere Encoder

Beginner

Kaiyu Yue, Menglin Jia et al.Feb 16arXiv

The Sphere Encoder is a new way to make images fast by teaching an autoencoder to place all images evenly on a big imaginary sphere and then decode random spots on that sphere back into pictures.

#Sphere Encoder#Spherical Latent Space#RMS Normalization

Not triaged yet

World Models for Policy Refinement in StarCraft II

Intermediate

Yixin Zhang, Ziyi Wang et al.Feb 16arXiv

The paper builds StarWM, a ‘world model’ that lets a StarCraft II agent imagine what will happen a few seconds after it takes an action.

#world model#action-conditioned dynamics#StarCraft II

Not triaged yet

Efficient Text-Guided Convolutional Adapter for the Diffusion Model

Intermediate

Aryan Das, Koushik Biswas et al.Feb 16arXiv

This paper introduces Nexus Adapters, tiny helper networks that let a diffusion model follow both a text prompt and a structure map (like edges or depth) at the same time.

#Nexus Adapter#text-guided adapter#cross-attention

Not triaged yet

Uncertainty-Aware Vision-Language Segmentation for Medical Imaging

Intermediate

Aryan Das, Tanishq Rachamalla et al.Feb 16arXiv

This paper builds a medical image segmentation system that uses both pictures (like X-rays) and words (short clinical text) at the same time.

#medical image segmentation#vision-language segmentation#uncertainty estimation

Not triaged yet

Revisiting the Platonic Representation Hypothesis: An Aristotelian View

Intermediate

Fabian Gröger, Shuo Wen et al.Feb 16arXiv

People thought big AI models were all learning the same overall picture of the world, but those measurements were secretly biased by model size and depth.

#representational similarity#Centered Kernel Alignment (CKA)#mutual k-Nearest Neighbors (mKNN)

Not triaged yet

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

Intermediate

Dongrui Liu, Yi Yu et al.Feb 16arXiv

This report studies the biggest new dangers from super-capable AI and tests them in realistic, well-controlled labs so we can fix problems before they cause real harm.

#frontier AI#agentic AI#cyber offense

Not triaged yet

16 17 18 19 20