> inference itself is a few seconds on a recent Mac
This is impressive as hell
Very cool demo. It works in about ~9 seconds on my machine.
A few asks if you're going to devote more time to the project: can you make a full orbital camera - it seems to not be able to orbit 360? Also, can you use double click drag to move the camera in non-orbiting mode for view refinement? (Super minor nitpicks - this demo is really cool.)
> Caveats: SHARP's released weights are research-use only (Apple's model license, not the code's).
Nobody should GAF about this. We have all the major players distilling each other in the open. This gives Apple the ability to slap you with lawyers, but in practice you'll often get more done if you just break the rules.
Do you know of any other image-to-splat models? WorldLabs has a few versions of their Marble model, and the Tencent Hunyuan team just released HyWorld as open weights:
https://github.com/Tencent-Hunyuan/HY-World-2.0
HyWorld looks to be SOTA and better than all the other players.
Apple's Sharp is awesome in that it is fast, but it only generates a small depth sample from the image. There are no back faces or splats, so if you move the camera even slightly from the original perspective, you'll see lots of holes.