| --- |
| license: other |
| license_name: sv3d-nc-community |
| license_link: LICENSE |
| datasets: |
| - allenai/objaverse |
| pipeline_tag: image-to-video |
| extra_gated_prompt: >- |
| By clicking "Agree", you agree to the [License Agreement](https://huggingface.co/stabilityai/sv3d/blob/main/LICENSE.md) and acknowledge Stability AI's [Privacy Policy](https://stability.ai/privacy-policy). |
| extra_gated_fields: |
| Name: text |
| Email: text |
| Country: country |
| Organization or Affiliation: text |
| Receive email updates and promotions on Stability AI products, services, and research?: |
| type: select |
| options: |
| - Yes |
| - No |
| --- |
| # Stable Video 3D |
|  |
| **Stable Video 3D (SV3D)** is a generative model based on [Stable Video Diffusion](https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt) that takes in a still image of an object as a conditioning frame, and generates an orbital video of that object. |
|
|
| Please note: For commercial use, please refer to https://stability.ai/license. |
|
|
| ## Model Details |
|
|
| This model was trained to generate 21 frames at resolution 576x576 given a context frame of the same size, finetuned from SVD Image-to-Video. Please check our [tech report](https://stability.ai/s/SV3D_report.pdf) and [video summary](https://youtu.be/Zqw4-1LcfWg) for details. |
|
|
| We release two variants of the model: |
| 1. **SV3D_u**: This variant generates orbital videos based on single image inputs without camera conditioning. |
| 2. **SV3D_p**: Extending the capability of SVD3_u, this variant accommodates both single images and orbital views allowing for the creation of 3D video along specified camera paths. |
| |
| |
| ### Model Description |
| |
| * **Developed by**: [Stability AI](https://stability.ai/) |
| * **Model type**: Generative image-to-video model |
| * **License**: [Stability AI Community License](https://huggingface.co/stabilityai/sv3d/raw/main/LICENSE.md). |
| * **Commercial License**: to use this model commercially, please refer to https://stability.ai/license |
| |
| |
| ### Model Sources |
| |
| * **Repository**: https://github.com/Stability-AI/generative-models |
| * **Tech report**: https://stability.ai/s/SV3D_report.pdf |
| * **Video summary**: https://youtu.be/Zqw4-1LcfWg |
| * **Project page**: https://sv3d.github.io |
| * **arXiv page**: https://arxiv.org/abs/2403.12008 |
|
|
| ### Training Dataset |
|
|
| We use renders from the [Objaverse](https://objaverse.allenai.org/objaverse-1.0) dataset, utilizing our enhanced rendering method that more closely replicate the distribution of images found in the real world, significantly improving our model’s ability to generalize. We selected a carefully curated subset of the Objaverse dataset for the training data, which is available under the CC-BY license. |
|
|
|
|
| ## Usage |
|
|
| For usage instructions, please refer to our [generative models GitHub repository](https://github.com/Stability-AI/generative-models) |
|
|
|
|
| ### Out-of-Scope Use |
|
|
| The model was not trained to be factual or true representations of people or events, |
| and therefore using the model to generate such content is out-of-scope for the abilities of this model. |
| The model should not be used in any way that violates Stability AI's [Acceptable Use Policy](https://stability.ai/use-policy). |