Towards a Unified View of Preference Learning for Large Language Models: A Survey Paper • 2409.02795 • Published 19 days ago • 70
MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct Paper • 2409.05840 • Published 14 days ago • 43
OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs Paper • 2409.05152 • Published 15 days ago • 29
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published 4 days ago • 99