吴正龙’s reinforcement-learning Bookmarks

29 SEP 2025
LoRA may offer advantages in the cost and speed of post-training, and there are also a few operational reasons to prefer it to full fine-tuning.