.. |
__pycache__
|
2275fcf164
遵循面向对象思想重构train_model_grpo.py
|
3 bulan lalu |
UnslothAlignPropTrainer.py
|
2275fcf164
遵循面向对象思想重构train_model_grpo.py
|
3 bulan lalu |
UnslothBCOTrainer.py
|
c5fc011955
完善代码及说明
|
3 bulan lalu |
UnslothCPOTrainer.py
|
7270ddb56d
Initialize commit
|
3 bulan lalu |
UnslothDDPOTrainer.py
|
2275fcf164
遵循面向对象思想重构train_model_grpo.py
|
3 bulan lalu |
UnslothDPOTrainer.py
|
7270ddb56d
Initialize commit
|
3 bulan lalu |
UnslothGKDTrainer.py
|
7270ddb56d
Initialize commit
|
3 bulan lalu |
UnslothGRPOTrainer.py
|
e2a3b8ab23
2025-3-5 README.MD大模型选型评估,训练、微调所需计算资源评估;完善训练数据集生成、训练、推理源程序;
|
3 bulan lalu |
UnslothKTOTrainer.py
|
7270ddb56d
Initialize commit
|
3 bulan lalu |
UnslothNashMDTrainer.py
|
7270ddb56d
Initialize commit
|
3 bulan lalu |
UnslothORPOTrainer.py
|
7270ddb56d
Initialize commit
|
3 bulan lalu |
UnslothOnlineDPOTrainer.py
|
e2a3b8ab23
2025-3-5 README.MD大模型选型评估,训练、微调所需计算资源评估;完善训练数据集生成、训练、推理源程序;
|
3 bulan lalu |
UnslothPPOTrainer.py
|
7270ddb56d
Initialize commit
|
3 bulan lalu |
UnslothPRMTrainer.py
|
7270ddb56d
Initialize commit
|
3 bulan lalu |
UnslothRLOOTrainer.py
|
7270ddb56d
Initialize commit
|
3 bulan lalu |
UnslothRewardTrainer.py
|
7270ddb56d
Initialize commit
|
3 bulan lalu |
UnslothSFTTrainer.py
|
7270ddb56d
Initialize commit
|
3 bulan lalu |
UnslothXPOTrainer.py
|
7270ddb56d
Initialize commit
|
3 bulan lalu |