LION/README.md

## <p align="center">LION: Latent Point Diffusion Models for 3D Shape Generation<br><br> NeurIPS 2022 </p>
<div align="center">
  <a href="https://www.cs.utoronto.ca/~xiaohui/" target="_blank">Xiaohui&nbsp;Zeng</a> &emsp; 
  <a href="http://latentspace.cc/" target="_blank">Arash&nbsp;Vahdat</a> &emsp; 
  <a href="https://www.fwilliams.info/" target="_blank">Francis&nbsp;Williams</a> &emsp; 
  <a href="https://zgojcic.github.io/" target="_blank">Zan&nbsp;Gojcic</a> &emsp; 
  <a href="https://orlitany.github.io/" target="_blank">Or&nbsp;Litany</a> &emsp; 
  <a href="https://www.cs.utoronto.ca/~fidler/" target="_blank">Sanja&nbsp;Fidler</a> &emsp; 
  <a href="https://karstenkreis.github.io/" target="_blank">Karsten&nbsp;Kreis</a>
  <br> <br>
  <a href="https://arxiv.org/abs/2210.06978" target="_blank">Paper</a> &emsp;
  <a href="https://nv-tlabs.github.io/LION" target="_blank">Project&nbsp;Page</a> 
</div>

<p align="center">
    <img width="750" alt="Animation" src="assets/animation.gif"/>
</p>

## Update
* add pointclouds rendering code used for paper figure, see `utils/render_mitsuba_pc.py`
* When opening an issue, please add @ZENGXH so that I can reponse faster! 

## Install 
* Dependencies: 
    * CUDA 11.6 
    
* Setup the environment 
    Install from conda file  
    ``` 
        mamba env create -f environment.yml
        # mamba env update -f environment.yml 
        conda activate LION 

        # Install some other packages (use proxy)
        pip install git+https://github.com/openai/CLIP.git 

        # build some packages first (optional)
        export CUDA_HOME=/usr/local/cuda # just in case rosetta cucks you
        module load compilers
        module load mpfr
        python build_pkg.py
    ```
    Tested with conda version 22.9.0

* Using Docker
    * build the docker with `bash ./docker/build_docker.sh`
    * launch the docker with `bash ./docker/run.sh`


## Demo
run `python demo.py`, will load the released text2shape model on hugging face and generate a chair point cloud. Download checkpoints from [HuggingFace Hub](https://huggingface.co/xiaohui2022/lion_ckpt)

## Released checkpoint and samples 
* will be release soon
* after download, run the checksum with `python ./script/check_sum.py ./lion_ckpt.zip`
* put the downloaded file under `./lion_ckpt/`

## Training 

### data 
* ShapeNet can be downloaded [here](https://github.com/stevenygd/PointFlow#dataset). 
* Put the downloaded data as `./data/ShapeNetCore.v2.PC15k` *or* edit the `pointflow` entry in `./datasets/data_path.py` for the ShapeNet dataset path. 

### train VAE 
* run `bash ./script/train_vae.sh $NGPU` (the released checkpoint is trained with `NGPU=4` on A100) 
* if want to use comet to log the experiment, add `.comet_api` file under the current folder, write the api key as `{"api_key": "${COMET_API_KEY}"}` in the `.comet_api` file

### train diffusion prior 
* require the vae checkpoint
* run `bash ./script/train_prior.sh $NGPU` (the released checkpoint is trained with `NGPU=8` with 2 node on V100)

### train diffusion prior with clip feat 
* this script trains model for single-view-reconstruction or text2shape task
    * the idea is that we take the encoder and decoder trained on the data as usual (without conditioning input), and when training the diffusion prior, we feed the clip image embedding as conditioning input: the shape-latent prior model will take the clip embedding through AdaGN layer.
* require the vae checkpoint trained above
* require the rendered ShapeNet data, you can render yourself or download it from [here](https://github.com/autonomousvision/occupancy_networks#preprocessed-data)
    * put the rendered data as `./data/shapenet_render/` or edit the `clip_forge_image` entry in `./datasets/data_path.py`
    * the img data will be read under `./datasets/pointflow_datasets.py` with the `render_img_path`, you may need to cutomize this variable depending of the folder structure 
* run `bash ./script/train_prior_clip.sh $NGPU` 

### (Optional) monitor exp 
* (tested) use comet-ml: need to add a file `.comet_api` under this `LION` folder, example of the `.comet_api` file: 
```
{"api_key": "...", "project_name": "lion", "workspace": "..."}
```
* (not tested) use wandb: need to add a `.wandb_api` file, and set the env variable `export USE_WB=1` before training 
```
{"project": "...", "entity": "..."}
```
* (not tested) use tensorboard, set the env variable `export USE_TFB=1` before training
* see the `utils/utils.py` files for the details of the experiment logger; I usually use comet-ml for my experiments

### evaluate a trained prior 
* download the test data (Table 1) from [here](https://drive.google.com/file/d/1uEp0o6UpRqfYwvRXQGZ5ZgT1IYBQvUSV/view?usp=share_link), unzip and put it as `./datasets/test_data/`
* download the released checkpoint from above
```
checkpoint="./lion_ckpt/unconditional/airplane/checkpoints/model.pt" 
bash ./script/eval.sh $checkpoint  # will take 1-2 hour 
```
#### other test data
* ShapeNet-Vol test data:
  * please check [here](https://github.com/nv-tlabs/LION/issues/20#issuecomment-1436315100) before using this data
  * [all category](https://drive.google.com/file/d/1QXrCbYKjTIAnH1OhZMathwdtQEXG5TjO/view?usp=sharing): 1000 shapes are sampled from the full validation set 
  * [chair, airplane, car](https://drive.google.com/file/d/11ZU_Bq5JwN3ggI7Ffj4NAjIxxhc2pNZ8/view?usp=share_link)
* table 21 and table 20, point-flow test data 
  * check [here](https://github.com/nv-tlabs/LION/issues/26#issuecomment-1466915318) before using this data
  * [mug](https://drive.google.com/file/d/1lvJh2V94Nd7nZPcRqsCwW5oygsHOD3EE/view?usp=share_link) and [bottle](https://drive.google.com/file/d/1MRl4EgW6-4hOrdRq_e2iGh348a0aCH5f/view?usp=share_link) 
  * 55 catergory [data](https://drive.google.com/file/d/1Rbj1_33sN_S2YUbcJu6h922tKuJyQ2Dm/view?usp=share_link)

## Evaluate the samples with the 1-NNA metrics 
* download the test data from [here](https://drive.google.com/file/d/1uEp0o6UpRqfYwvRXQGZ5ZgT1IYBQvUSV/view?usp=share_link), unzip and put it as `./datasets/test_data/`
* run `python ./script/compute_score.py` (Note: for ShapeNet-Vol data and table 21, 20, need to set `norm_box=True`)

## Citation
```
@inproceedings{zeng2022lion,
    title={LION: Latent Point Diffusion Models for 3D Shape Generation},
        author={Xiaohui Zeng and Arash Vahdat and Francis Williams and Zan Gojcic and Or Litany and Sanja Fidler and Karsten Kreis},
        booktitle={Advances in Neural Information Processing Systems (NeurIPS)},
        year={2022}
}
```
added README to main branch 2022-10-08 10:08:20 +00:00			`## <p align="center">LION: Latent Point Diffusion Models for 3D Shape Generation<br><br> NeurIPS 2022 </p>`
			`<div align="center">`
init 2023-01-23 05:14:49 +00:00			`<a href="https://www.cs.utoronto.ca/~xiaohui/" target="_blank">Xiaohui Zeng</a> &emsp;`
			`<a href="http://latentspace.cc/" target="_blank">Arash Vahdat</a> &emsp;`
			`<a href="https://www.fwilliams.info/" target="_blank">Francis Williams</a> &emsp;`
			`<a href="https://zgojcic.github.io/" target="_blank">Zan Gojcic</a> &emsp;`
			`<a href="https://orlitany.github.io/" target="_blank">Or Litany</a> &emsp;`
			`<a href="https://www.cs.utoronto.ca/~fidler/" target="_blank">Sanja Fidler</a> &emsp;`
added README to main branch 2022-10-08 10:08:20 +00:00			`<a href="https://karstenkreis.github.io/" target="_blank">Karsten Kreis</a>`
			`<br> <br>`
fix paper link 2022-10-18 00:34:11 +00:00			`<a href="https://arxiv.org/abs/2210.06978" target="_blank">Paper</a> &emsp;`
added README to main branch 2022-10-08 10:08:20 +00:00			`<a href="https://nv-tlabs.github.io/LION" target="_blank">Project Page</a>`
			`</div>`
init 2023-01-23 05:14:49 +00:00
added README to main branch 2022-10-08 10:08:20 +00:00			`<p align="center">`
updated README for main 2022-10-08 10:21:43 +00:00			`<img width="750" alt="Animation" src="assets/animation.gif"/>`
fix paper link 2022-10-18 00:34:11 +00:00			`</p>`
update 2023-01-23 05:15:15 +00:00
Update README.md 2023-02-20 05:33:03 +00:00			`## Update`
add rendering pointcloud and voxel ode 2023-03-13 20:42:12 +00:00			* add pointclouds rendering code used for paper figure, see `utils/render_mitsuba_pc.py`
Update README.md 2023-02-20 05:33:03 +00:00			`* When opening an issue, please add @ZENGXH so that I can reponse faster!`

init 2023-01-23 05:14:49 +00:00			`## Install`
			`* Dependencies:`
			`* CUDA 11.6`

			`* Setup the environment`
			`Install from conda file`
			```
fix: update README instructions for rosetta 2023-04-07 11:31:51 +00:00			`mamba env create -f environment.yml`
			`# mamba env update -f environment.yml`
			`conda activate LION`
init 2023-01-23 05:14:49 +00:00
fix: update README instructions for rosetta 2023-04-07 11:31:51 +00:00			`# Install some other packages (use proxy)`
init 2023-01-23 05:14:49 +00:00			`pip install git+https://github.com/openai/CLIP.git`

			`# build some packages first (optional)`
fix: outdated/broken instructions 2023-07-18 12:40:41 +00:00			`export CUDA_HOME=/usr/local/cuda # just in case rosetta cucks you`
			`module load compilers`
			`module load mpfr`
init 2023-01-23 05:14:49 +00:00			`python build_pkg.py`
			```
			`Tested with conda version 22.9.0`

add docker 2023-02-20 05:27:19 +00:00			`* Using Docker`
			* build the docker with `bash ./docker/build_docker.sh`
			* launch the docker with `bash ./docker/run.sh`


init 2023-01-23 05:14:49 +00:00			`## Demo`
fix: add link to checkpoints 2023-07-18 12:41:59 +00:00			run `python demo.py`, will load the released text2shape model on hugging face and generate a chair point cloud. Download checkpoints from [HuggingFace Hub](https://huggingface.co/xiaohui2022/lion_ckpt)
init 2023-01-23 05:14:49 +00:00
			`## Released checkpoint and samples`
			`* will be release soon`
add check sum 2023-03-28 22:36:37 +00:00			* after download, run the checksum with `python ./script/check_sum.py ./lion_ckpt.zip`
init 2023-01-23 05:14:49 +00:00			* put the downloaded file under `./lion_ckpt/`

			`## Training`

			`### data`
			`* ShapeNet can be downloaded [here](https://github.com/stevenygd/PointFlow#dataset).`
			* Put the downloaded data as `./data/ShapeNetCore.v2.PC15k` or edit the `pointflow` entry in `./datasets/data_path.py` for the ShapeNet dataset path.

			`### train VAE`
update readme; minor fix and add log msg 2023-01-25 22:02:07 +00:00			* run `bash ./script/train_vae.sh $NGPU` (the released checkpoint is trained with `NGPU=4` on A100)
			* if want to use comet to log the experiment, add `.comet_api` file under the current folder, write the api key as `{"api_key": "${COMET_API_KEY}"}` in the `.comet_api` file
init 2023-01-23 05:14:49 +00:00
			`### train diffusion prior`
			`* require the vae checkpoint`
update readme; minor fix and add log msg 2023-01-25 22:02:07 +00:00			* run `bash ./script/train_prior.sh $NGPU` (the released checkpoint is trained with `NGPU=8` with 2 node on V100)
init 2023-01-23 05:14:49 +00:00
add instruct for clip prior 2023-04-03 21:09:29 +00:00			`### train diffusion prior with clip feat`
fix typo 2023-04-03 21:12:47 +00:00			`* this script trains model for single-view-reconstruction or text2shape task`
			`* the idea is that we take the encoder and decoder trained on the data as usual (without conditioning input), and when training the diffusion prior, we feed the clip image embedding as conditioning input: the shape-latent prior model will take the clip embedding through AdaGN layer.`
add instruct for clip prior 2023-04-03 21:09:29 +00:00			`* require the vae checkpoint trained above`
			`* require the rendered ShapeNet data, you can render yourself or download it from [here](https://github.com/autonomousvision/occupancy_networks#preprocessed-data)`
			* put the rendered data as `./data/shapenet_render/` or edit the `clip_forge_image` entry in `./datasets/data_path.py`
			* the img data will be read under `./datasets/pointflow_datasets.py` with the `render_img_path`, you may need to cutomize this variable depending of the folder structure
			* run `bash ./script/train_prior_clip.sh $NGPU`

add comment and exp logger 2023-03-16 16:44:47 +00:00			`### (Optional) monitor exp`
			* (tested) use comet-ml: need to add a file `.comet_api` under this `LION` folder, example of the `.comet_api` file:
			```
			`{"api_key": "...", "project_name": "lion", "workspace": "..."}`
			```
			* (not tested) use wandb: need to add a `.wandb_api` file, and set the env variable `export USE_WB=1` before training
			```
			`{"project": "...", "entity": "..."}`
			```
			* (not tested) use tensorboard, set the env variable `export USE_TFB=1` before training
			* see the `utils/utils.py` files for the details of the experiment logger; I usually use comet-ml for my experiments

init 2023-01-23 05:14:49 +00:00			`### evaluate a trained prior`
add other test data 2023-03-16 16:19:31 +00:00			* download the test data (Table 1) from [here](https://drive.google.com/file/d/1uEp0o6UpRqfYwvRXQGZ5ZgT1IYBQvUSV/view?usp=share_link), unzip and put it as `./datasets/test_data/`
init 2023-01-23 05:14:49 +00:00			`* download the released checkpoint from above`
			```
			`checkpoint="./lion_ckpt/unconditional/airplane/checkpoints/model.pt"`
			`bash ./script/eval.sh $checkpoint # will take 1-2 hour`
			```
add other test data 2023-03-16 16:19:31 +00:00			`#### other test data`
			`* ShapeNet-Vol test data:`
			`* please check [here](https://github.com/nv-tlabs/LION/issues/20#issuecomment-1436315100) before using this data`
			`* [all category](https://drive.google.com/file/d/1QXrCbYKjTIAnH1OhZMathwdtQEXG5TjO/view?usp=sharing): 1000 shapes are sampled from the full validation set`
			`* [chair, airplane, car](https://drive.google.com/file/d/11ZU_Bq5JwN3ggI7Ffj4NAjIxxhc2pNZ8/view?usp=share_link)`
			`* table 21 and table 20, point-flow test data`
			`* check [here](https://github.com/nv-tlabs/LION/issues/26#issuecomment-1466915318) before using this data`
			`* [mug](https://drive.google.com/file/d/1lvJh2V94Nd7nZPcRqsCwW5oygsHOD3EE/view?usp=share_link) and [bottle](https://drive.google.com/file/d/1MRl4EgW6-4hOrdRq_e2iGh348a0aCH5f/view?usp=share_link)`
			`* 55 catergory [data](https://drive.google.com/file/d/1Rbj1_33sN_S2YUbcJu6h922tKuJyQ2Dm/view?usp=share_link)`
init 2023-01-23 05:14:49 +00:00
			`## Evaluate the samples with the 1-NNA metrics`
			* download the test data from [here](https://drive.google.com/file/d/1uEp0o6UpRqfYwvRXQGZ5ZgT1IYBQvUSV/view?usp=share_link), unzip and put it as `./datasets/test_data/`
add other test data 2023-03-16 16:19:31 +00:00			* run `python ./script/compute_score.py` (Note: for ShapeNet-Vol data and table 21, 20, need to set `norm_box=True`)
init 2023-01-23 05:14:49 +00:00
			`## Citation`
			```
			`@inproceedings{zeng2022lion,`
			`title={LION: Latent Point Diffusion Models for 3D Shape Generation},`
			`author={Xiaohui Zeng and Arash Vahdat and Francis Williams and Zan Gojcic and Or Litany and Sanja Fidler and Karsten Kreis},`
			`booktitle={Advances in Neural Information Processing Systems (NeurIPS)},`
			`year={2022}`
			`}`
			```