zhouyang.xie ca5fe63b52 完善README.MD说明 10 months ago
..
__pycache__ ca5fe63b52 完善README.MD说明 10 months ago
UnslothAlignPropTrainer.py ca5fe63b52 完善README.MD说明 10 months ago
UnslothBCOTrainer.py c5fc011955 完善代码及说明 10 months ago
UnslothCPOTrainer.py 7270ddb56d Initialize commit 10 months ago
UnslothDDPOTrainer.py ca5fe63b52 完善README.MD说明 10 months ago
UnslothDPOTrainer.py 7270ddb56d Initialize commit 10 months ago
UnslothGKDTrainer.py 7270ddb56d Initialize commit 10 months ago
UnslothGRPOTrainer.py e2a3b8ab23 2025-3-5 README.MD大模型选型评估,训练、微调所需计算资源评估;完善训练数据集生成、训练、推理源程序; 10 months ago
UnslothKTOTrainer.py 7270ddb56d Initialize commit 10 months ago
UnslothNashMDTrainer.py 7270ddb56d Initialize commit 10 months ago
UnslothORPOTrainer.py 7270ddb56d Initialize commit 10 months ago
UnslothOnlineDPOTrainer.py e2a3b8ab23 2025-3-5 README.MD大模型选型评估,训练、微调所需计算资源评估;完善训练数据集生成、训练、推理源程序; 10 months ago
UnslothPPOTrainer.py 7270ddb56d Initialize commit 10 months ago
UnslothPRMTrainer.py 7270ddb56d Initialize commit 10 months ago
UnslothRLOOTrainer.py 7270ddb56d Initialize commit 10 months ago
UnslothRewardTrainer.py 7270ddb56d Initialize commit 10 months ago
UnslothSFTTrainer.py 7270ddb56d Initialize commit 10 months ago
UnslothXPOTrainer.py 7270ddb56d Initialize commit 10 months ago