.. |
__pycache__
|
2275fcf164
遵循面向对象思想重构train_model_grpo.py
|
3 mesiacov pred |
UnslothAlignPropTrainer.py
|
2275fcf164
遵循面向对象思想重构train_model_grpo.py
|
3 mesiacov pred |
UnslothBCOTrainer.py
|
c5fc011955
完善代码及说明
|
3 mesiacov pred |
UnslothCPOTrainer.py
|
7270ddb56d
Initialize commit
|
3 mesiacov pred |
UnslothDDPOTrainer.py
|
2275fcf164
遵循面向对象思想重构train_model_grpo.py
|
3 mesiacov pred |
UnslothDPOTrainer.py
|
7270ddb56d
Initialize commit
|
3 mesiacov pred |
UnslothGKDTrainer.py
|
7270ddb56d
Initialize commit
|
3 mesiacov pred |
UnslothGRPOTrainer.py
|
e2a3b8ab23
2025-3-5 README.MD大模型选型评估,训练、微调所需计算资源评估;完善训练数据集生成、训练、推理源程序;
|
3 mesiacov pred |
UnslothKTOTrainer.py
|
7270ddb56d
Initialize commit
|
3 mesiacov pred |
UnslothNashMDTrainer.py
|
7270ddb56d
Initialize commit
|
3 mesiacov pred |
UnslothORPOTrainer.py
|
7270ddb56d
Initialize commit
|
3 mesiacov pred |
UnslothOnlineDPOTrainer.py
|
e2a3b8ab23
2025-3-5 README.MD大模型选型评估,训练、微调所需计算资源评估;完善训练数据集生成、训练、推理源程序;
|
3 mesiacov pred |
UnslothPPOTrainer.py
|
7270ddb56d
Initialize commit
|
3 mesiacov pred |
UnslothPRMTrainer.py
|
7270ddb56d
Initialize commit
|
3 mesiacov pred |
UnslothRLOOTrainer.py
|
7270ddb56d
Initialize commit
|
3 mesiacov pred |
UnslothRewardTrainer.py
|
7270ddb56d
Initialize commit
|
3 mesiacov pred |
UnslothSFTTrainer.py
|
7270ddb56d
Initialize commit
|
3 mesiacov pred |
UnslothXPOTrainer.py
|
7270ddb56d
Initialize commit
|
3 mesiacov pred |