Transformers
Safetensors
Inference Endpoints
File size: 2,693 Bytes
583de57
 
edbf290
 
 
 
 
 
 
67fedb1
17c73af
edbf290
67fedb1
 
edbf290
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
---
license: cc-by-nc-4.0
---

# Model Card for Tailor3D

## ๐Ÿ  Overview

This model card is for the **Tailor3D** project. You can see the more information below.
- **Github Code**: [Link to Github](https://github.com/Qi-Zhangyang/Tailor3D/)
- **Arxiv Paper**: [Link to Arxiv](https://arxiv.org/abs/2407.06191)
- **Project Page**: [Link to Project](https://tailor3d-2024.github.io/)
- **Youtube Video**: [Link to Youtube](https://youtu.be/z9w5CN8mUJQ/)
- **Hugging Face Demo**: [Link to Hugging Face Demo](https://huggingface.co/spaces/alexzyqi/Tailor3D)

## ๐Ÿ—๏ธ Model Details

- Pretrained Models

    | Model | Pretained Model | Training Data |
    | :---: | :---: | :---: |
    | [tailor3d-small-1.0](https://huggingface.co/alexzyqi/tailor3d-small-1.0) | [openlrm-mix-small-1.1](https://huggingface.co/zxhezexin/openlrm-mix-small-1.1) | Objaverse + MVImgNet |
    | [tailor3d-base-1.0](https://huggingface.co/alexzyqi/tailor3d-base-1.0) | [openlrm-mix-base-1.1](https://huggingface.co/zxhezexin/openlrm-mix-base-1.1) | Objaverse + MVImgNet |
    | [tailor3d-large-1.0](https://huggingface.co/alexzyqi/tailor3d-large-1.0)| [openlrm-mix-large-1.1](https://huggingface.co/zxhezexin/openlrm-mix-large-1.1) | Objaverse + MVImgNet |

- Model architecture (same as [OpenLRM](https://huggingface.co/zxhezexin/openlrm-mix-large-1.1)).

    | Type  | Layers | Feat. Dim | Attn. Heads | Triplane Dim. | Input Res. | Image Encoder     |
    | :---: | :----: | :-------: | :---------: | :-----------: | :--------: | :---------------: |
    | small |   12   |    512    |      8      |      32       |    224     | dinov2_vits14_reg |
    | base  |   12   |    768    |     12      |      48       |    336     | dinov2_vitb14_reg |
    | large |   16   |   1024    |     16      |      80       |    448     | dinov2_vitb14_reg |

- Training settings (same as [OpenLRM](https://huggingface.co/zxhezexin/openlrm-mix-large-1.1)).

    | Type  | Rend. Res. | Rend. Patch | Ray Samples |
    | :---: | :--------: | :---------: | :---------: |
    | small |    192     |     64      |     96      |
    | base  |    288     |     96      |     96      |
    | large |    384     |    128      |    128      |

## ๐Ÿ“š OpenLRM as the Pretained Model
This repo, we use the LRM as the pretrained model. These are the link of its [arxiv paper](https://arxiv.org/abs/2311.04400) and [open-source implementation](https://github.com/3DTopia/OpenLRM).

All regulations are the same as the OpenLRM.

## ๐Ÿ“„ License

- The model weights are released under the Creative Commons Attribution-NonCommercial 4.0 International License.
- They are provided for research purposes only, and CANNOT be used commercially.