metadata

license: other
license_name: sv3d-nc-community
license_link: LICENSE
pipeline_tag: text-to-3d
extra_gated_prompt: >-
  By clicking "Agree", you agree to the [License
  Agreement](https://huggingface.co/stabilityai/sv3d/blob/main/LICENSE) and
  acknowledge Stability AI's [Privacy
  Policy](https://stability.ai/privacy-policy).
extra_gated_fields:
  Name: text
  Email: text
  Country: country
  Organization or Affiliation: text
  Receive email updates and promotions on Stability AI products, services, and research?:
    type: select
    options:
      - 'Yes'
      - 'No'

Stable Video 3D

Stable Video 3D (SV3D) is a generative model based on Stable Video Diffusion that takes in a still image of an object as a conditioning frame, and generates an orbital video of that object.

Please note: For commercial use, please refer to https://stability.ai/membership.

Model Details

This model was trained to generate 21 frames at resolution 576x576 given a context frame of the same size, finetuned from SVD Image-to-Video. Please check our tech report and video summary for details.

We release two variants of the model:

SV3D_u: This variant generates orbital videos based on single image inputs without camera conditioning.
SV3D_p: Extending the capability of SVD3_u, this variant accommodates both single images and orbital views allowing for the creation of 3D video along specified camera paths.

Model Description

Developed by: Stability AI
Model type: Generative image-to-video model
License: StabilityAI Non-Commercial Research Community License.
Commercial License: to use this model commercially, please refer to https://stability.ai/membership

Model Sources

Repository: https://github.com/Stability-AI/generative-models
Tech report: https://stability.ai/s/SV3D_report.pdf
Video summary: https://youtu.be/Zqw4-1LcfWg
Project page: https://sv3d.github.io
arXiv page: https://arxiv.org/abs/2403.12008

Training Dataset

We use renders from the Objaverse dataset, utilizing our enhanced rendering method that more closely replicate the distribution of images found in the real world, significantly improving our model’s ability to generalize. We selected a carefully curated subset of the Objaverse dataset for the training data, which is available under the CC-BY license.

Usage

For usage instructions, please refer to our generative models GitHub repository

Out-of-Scope Use

The model was not trained to be factual or true representations of people or events, and therefore using the model to generate such content is out-of-scope for the abilities of this model. The model should not be used in any way that violates Stability AI's Acceptable Use Policy.