zhouyang.xie 97fe68c387 更换unsloth grpo的训练数据集并验证 3 mesi fa
..
train.jsonl 97fe68c387 更换unsloth grpo的训练数据集并验证 3 mesi fa
train_windturbine_old.jsonl 97fe68c387 更换unsloth grpo的训练数据集并验证 3 mesi fa
unified_chip2.jsonl 7270ddb56d Initialize commit 3 mesi fa