-
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Paper • 2311.09257 • Published • 45 -
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Paper • 2312.14125 • Published • 44 -
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Paper • 2312.16862 • Published • 30 -
VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM
Paper • 2401.01256 • Published • 19
Alex PRO
aslessor
AI & ML interests
None yet
Organizations
None yet
Collections
8
models
8
aslessor/layoutlm-invoices
Document Question Answering
•
Updated
•
1
aslessor/layoutlm-funsd-tf
Token Classification
•
Updated
•
2
aslessor/bert-large-uncased-whole-word-masking-finetuned-squad
Other
•
Updated
•
3
aslessor/donut-base-finetuned-cord-v2
Image-to-Text
•
Updated
•
10
aslessor/layoutlmv2-base-uncased
Other
•
Updated
•
3
aslessor/layoutlm-funsd
Other
•
Updated
•
1
aslessor/layoutlm-funsd-2
Token Classification
•
Updated
•
1
aslessor/custom-inf-example
Updated