Jiahong 的个人博客

凡事预则立,不预则废


  • Home

  • Tags

  • Archives

  • Navigation

  • Search
Excellent! 618 posts in total. Keep on posting.

NLP——LLM对齐微调-RuscaRL

NLP——LLM对齐微调-Self-Rewarding-RubricRL

NLP——LLM对齐微调-SDPO

NLP——LLM对齐微调-Skywork-Reward

NLP——LLM对齐微调-VAPO

NLP——LLM对齐微调-SimPO

NLP——LLM对齐微调-TIS

NLP——LLM-API调用示例

NLP——样本packing与权重讨论

NLP——LoRA-Without-Regret

1…111213…62
Joe Zhou

Joe Zhou

Stay Hungry. Stay Foolish.

618 posts
52 tags
GitHub E-Mail
© 2026 Joe Zhou
Powered by Hexo
|
Theme — NexT.Gemini v5.1.4