File size: 2,340 Bytes
346d8ab
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
---
license: other
license_name: apple-ml-research
license_link: LICENSE
tags:
  - 3d-gaussian-splatting
  - view-synthesis
  - monocular-3d
  - gsplat
  - sharp
  - apple
library_name: sharp
pipeline_tag: image-to-3d
---

# SHARP — Single-Image 3D Gaussian View Synthesis

**Mirror of Apple's [SHARP](https://github.com/apple/ml-sharp) model weights**, converted to safetensors format for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA).

## Model Description

SHARP predicts 3D Gaussian Splat (3DGS) parameters from a single photograph in under 1 second, then renders camera trajectory videos using [gsplat](https://github.com/nerfstudio-project/gsplat).

- **Input:** Single RGB image
- **Output:** 3D Gaussian splat representation → camera trajectory video or `.ply` export
- **Speed:** <1s prediction on GPU
- **Resolution:** 1536×1536 internal processing

## Usage in ComfyUI-FFMPEGA

1. Set `llm_model` to `none`
2. Set `no_llm_mode` to `sharp`
3. Connect an image to `image_a` or `image_path_a`
4. Run — the model will auto-download on first use

### Parameters

| Parameter | Default | Description |
|-----------|---------|-------------|
| `sharp_trajectory` | `rotate_forward` | Camera motion: rotate_forward, swipe, shake, rotate |
| `sharp_num_frames` | `60` | Number of video frames (10–300) |
| `sharp_max_disparity` | `0.08` | Lateral camera range |
| `sharp_max_zoom` | `0.15` | Zoom intensity |
| `sharp_save_ply` | `false` | Export .ply Gaussian splat file |

## License

> **⚠️ Research Use Only — Non-Commercial**

This model is licensed under the [Apple Machine Learning Research License](LICENSE). Model weights are restricted to **Research Purposes only** — non-commercial scientific research and academic development.

See [LICENSE](LICENSE) for the full terms.

## Citation

```bibtex
@article{stier2025sharp,
  title={SHARP: Synthesizing 3D Gaussians from a Single Monocular Image with High Fidelity and Accurate Geometry},
  author={Stier, Nikolai and Wadhwa, Neal and Szeliski, Richard},
  year={2025},
  url={https://github.com/apple/ml-sharp}
}
```

## Original Repository

- **Code:** [github.com/apple/ml-sharp](https://github.com/apple/ml-sharp) (BSD-like license)
- **Paper:** See repository for links
- **Copyright:** © 2025 Apple Inc. All Rights Reserved.