Jiahong 的个人博客

凡事预则立,不预则废


  • Home

  • Tags

  • Archives

  • Navigation

  • Search
Excellent! 608 posts in total. Keep on posting.

NLP——LLM对齐微调-RLHF

NLP——ChatHome

NLP——Secrets-of-RLHF(PPO)

NLP——LLM-Attention优化之MLA

NLP——LLM相关名词

Python——Ray-远程函数与本地函数的区别

RL——IQL

Python——Ray-分布式架构简单了解

RL——PPO

RL——PPO论文精读

1…141516…61
Joe Zhou

Joe Zhou

Stay Hungry. Stay Foolish.

608 posts
49 tags
GitHub E-Mail
© 2026 Joe Zhou
Powered by Hexo
|
Theme — NexT.Gemini v5.1.4