Markola
Network
Log In / Sign Up
吴正龙
’s
reinforcement-learning
Bookmarks
29 SEP 2025
[Thinking Machines] LoRA Without Regret
LoRA may offer advantages in the cost and speed of post-training, and there are also a few operational reasons to prefer it to full fine-tuning.
large-language-models
reinforcement-learning