-
Can large language models explore in-context?
Paper ā¢ 2403.15371 ā¢ Published ā¢ 31 -
Advancing LLM Reasoning Generalists with Preference Trees
Paper ā¢ 2404.02078 ā¢ Published ā¢ 43 -
Long-context LLMs Struggle with Long In-context Learning
Paper ā¢ 2404.02060 ā¢ Published ā¢ 34 -
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper ā¢ 2404.03715 ā¢ Published ā¢ 59
Collections
Discover the best community collections!
Collections including paper arxiv:2409.12917
-
ORPO: Monolithic Preference Optimization without Reference Model
Paper ā¢ 2403.07691 ā¢ Published ā¢ 59 -
sDPO: Don't Use Your Data All at Once
Paper ā¢ 2403.19270 ā¢ Published ā¢ 38 -
Teaching Large Language Models to Reason with Reinforcement Learning
Paper ā¢ 2403.04642 ā¢ Published ā¢ 46 -
Best Practices and Lessons Learned on Synthetic Data for Language Models
Paper ā¢ 2404.07503 ā¢ Published ā¢ 29
-
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper ā¢ 2403.03507 ā¢ Published ā¢ 182 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper ā¢ 2403.10131 ā¢ Published ā¢ 66 -
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Paper ā¢ 2403.13372 ā¢ Published ā¢ 58 -
InternLM2 Technical Report
Paper ā¢ 2403.17297 ā¢ Published ā¢ 28
-
Nemotron-4 15B Technical Report
Paper ā¢ 2402.16819 ā¢ Published ā¢ 42 -
InternLM2 Technical Report
Paper ā¢ 2403.17297 ā¢ Published ā¢ 28 -
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Paper ā¢ 2404.04167 ā¢ Published ā¢ 12 -
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
Paper ā¢ 2402.14905 ā¢ Published ā¢ 107
-
LoRA+: Efficient Low Rank Adaptation of Large Models
Paper ā¢ 2402.12354 ā¢ Published ā¢ 6 -
The FinBen: An Holistic Financial Benchmark for Large Language Models
Paper ā¢ 2402.12659 ā¢ Published ā¢ 16 -
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Paper ā¢ 2402.13249 ā¢ Published ā¢ 10 -
TrustLLM: Trustworthiness in Large Language Models
Paper ā¢ 2401.05561 ā¢ Published ā¢ 63
-
Can Large Language Models Understand Context?
Paper ā¢ 2402.00858 ā¢ Published ā¢ 21 -
OLMo: Accelerating the Science of Language Models
Paper ā¢ 2402.00838 ā¢ Published ā¢ 79 -
Self-Rewarding Language Models
Paper ā¢ 2401.10020 ā¢ Published ā¢ 141 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper ā¢ 2401.17072 ā¢ Published ā¢ 25
-
Self-Rewarding Language Models
Paper ā¢ 2401.10020 ā¢ Published ā¢ 141 -
Orion-14B: Open-source Multilingual Large Language Models
Paper ā¢ 2401.12246 ā¢ Published ā¢ 10 -
MambaByte: Token-free Selective State Space Model
Paper ā¢ 2401.13660 ā¢ Published ā¢ 49 -
MM-LLMs: Recent Advances in MultiModal Large Language Models
Paper ā¢ 2401.13601 ā¢ Published ā¢ 44
-
Attention Is All You Need
Paper ā¢ 1706.03762 ā¢ Published ā¢ 41 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper ā¢ 2307.08691 ā¢ Published ā¢ 8 -
Mixtral of Experts
Paper ā¢ 2401.04088 ā¢ Published ā¢ 157 -
Mistral 7B
Paper ā¢ 2310.06825 ā¢ Published ā¢ 47