Image-to-Video
sv3d / README.md
MasterDee's picture
Update README.md
a757d8c verified
|
raw
history blame
3.17 kB
---
license: other
license_name: sv3d-nc-community
license_link: LICENSE
pipeline_tag: text-to-3d
extra_gated_prompt: >-
By clicking "Agree", you agree to the [License
Agreement](https://huggingface.co/stabilityai/sv3d/blob/main/LICENSE) and
acknowledge Stability AI's [Privacy
Policy](https://stability.ai/privacy-policy).
extra_gated_fields:
Name: text
Email: text
Country: country
Organization or Affiliation: text
Receive email updates and promotions on Stability AI products, services, and research?:
type: select
options:
- 'Yes'
- 'No'
---
# Stable Video 3D
![](sv3doutputs.gif)
**Stable Video 3D (SV3D)** is a generative model based on [Stable Video Diffusion](https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt) that takes in a still image of an object as a conditioning frame, and generates an orbital video of that object.
Please note: For commercial use, please refer to https://stability.ai/membership.
## Model Details
This model was trained to generate 21 frames at resolution 576x576 given a context frame of the same size, finetuned from SVD Image-to-Video. Please check our [tech report](https://stability.ai/s/SV3D_report.pdf) and [video summary](https://youtu.be/Zqw4-1LcfWg) for details.
We release two variants of the model:
1. **SV3D_u**: This variant generates orbital videos based on single image inputs without camera conditioning.
2. **SV3D_p**: Extending the capability of SVD3_u, this variant accommodates both single images and orbital views allowing for the creation of 3D video along specified camera paths.
### Model Description
* **Developed by**: [Stability AI](https://stability.ai/)
* **Model type**: Generative image-to-video model
* **License**: [StabilityAI Non-Commercial Research Community License](https://huggingface.co/stabilityai/sv3d/raw/main/LICENSE).
* **Commercial License**: to use this model commercially, please refer to https://stability.ai/membership
### Model Sources
* **Repository**: https://github.com/Stability-AI/generative-models
* **Tech report**: https://stability.ai/s/SV3D_report.pdf
* **Video summary**: https://youtu.be/Zqw4-1LcfWg
* **Project page**: https://sv3d.github.io
* **arXiv page**: https://arxiv.org/abs/2403.12008
### Training Dataset
We use renders from the [Objaverse](https://objaverse.allenai.org/objaverse-1.0) dataset, utilizing our enhanced rendering method that more closely replicate the distribution of images found in the real world, significantly improving our model’s ability to generalize. We selected a carefully curated subset of the Objaverse dataset for the training data, which is available under the CC-BY license.
## Usage
For usage instructions, please refer to our [generative models GitHub repository](https://github.com/Stability-AI/generative-models)
### Out-of-Scope Use
The model was not trained to be factual or true representations of people or events,
and therefore using the model to generate such content is out-of-scope for the abilities of this model.
The model should not be used in any way that violates Stability AI's [Acceptable Use Policy](https://stability.ai/use-policy).