qwerrwe / FAQS.md
winglian's picture
shuffle and split dataset after save/load
4f2584f
|
raw
history blame
327 Bytes

FAQs

  • Can you train StableLM with this? Yes, but only with a single GPU atm. Multi GPU support is coming soon! Just waiting on this PR
  • Will this work with Deepspeed? That's still a WIP, but setting export ACCELERATE_USE_DEEPSPEED=true should work in some cases