DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning
IntermediateYicheng Chen, Zerun Ma et al.Feb 11arXiv
DataChef teaches a large language model to be a smart data chef: it plans and codes full data pipelines that turn messy datasets into great training meals for other models.
#data recipe#data processing pipeline#reinforcement learning