Awesome

SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks

Xiang Xu, Karl D.D. Willis, Joseph G. Lambourne, Chin-Yi Cheng, Pradeep Kumar Jayaraman, Yasutaka Furukawa

ICML 2022
project | paper | youtube

Installation

Environment

Linux
Python >= 3.8
PyTorch >= 1.10

Dependencies

Install PyTorch 1.10 with the correct CUDA version.
Install other dependencies:
```
pip install -r requirements.txt
```
Install pythonocc following the instruction here.

Docker

We also provide the docker image for running SkexGen. You can download it from dockerhub (~10GB).
Note: only tested on CUDA 11.4.

Data

Download the raw json data from DeepCAD. Unzip it into the data folder in the root of this repository. Also download the and train_val_test_split.json and place this in the data folder as well.

Follow these steps to convert DeepCAD data to SkexGen format:

# Under utils folder:

# parse DeepCAD json to a simple obj format 
  python convert.py --data_folder ../data/cad_json --output_folder ../data/cad_obj

# normalize CAD and update the obj file
  python normalize.py --data_folder ../data/cad_obj --out_folder ../data/cad_norm

# parse obj to primitive sequence 
  python parse.py --input ../data/cad_norm --output ../data/cad_data --bit 6

# remove duplicated sketch data
  python deduplicate.py --datapath ../data/cad_data --hash_type s

# remove duplicated extrude data
  python deduplicate.py --datapath ../data/cad_data --hash_type e

# Find all the invalid CAD models
  python invalid.py --datapath ../data/cad_data --bit 6

When running convert.py some files in the DeepCAD dataset fail to generate valid solid models. You may use the the --verbose option to see additional details about the problem files. If the convert.py script hangs during processing it can be safely restarted and will continue from where it left off.

You can download the already pre-processed data

Training

Train sketch branch (topology encoder, geometry encoder, sketch decoder):

  python train_sketch.py --train_data data/cad_data/train_deduplicate_s.pkl \
                         --output proj_log/exp_sketch \
                         --invalid data/cad_data/train_invalid.pkl \
                         --val_data data/cad_data/val.pkl \
                         --bit 6 --maxlen 200 --batchsize 128 --device 0

maxlen: sketch sequence length (default 200)

Train extrude branch (extrude encoder, extrude decoder):

  python train_extrude.py --train_data data/cad_data/train_deduplicate_e.pkl \
                          --val_data data/cad_data/val.pkl \
                          --output proj_log/exp_extrude \
                          --bit 6 --maxlen 5 --batchsize 128 --device 0

maxlen: number of extudes (default 5)

Extract codes:

  python extract_code.py --sketch_weight proj_log/exp_sketch \
                         --ext_weight proj_log/exp_extrude \
                         --device 0 --maxlen 200 --bit 6 \
                         --output proj_log/exp_code \
                         --data data/cad_data/train.pkl \
                         --invalid data/cad_data/train_invalid.pkl

Train code selector (random generation):

  python train_code.py --input proj_log/exp_code/code.pkl \
                       --output proj_log/exp_code \
                       --batchsize 512 --device 0 \
                       --code 1000 --seqlen 10

seqlen: 4 topology, 2 geometry, 4 extrude, code: max size of codebook is 1000

Download our pretrained models

Evaluation

Random generation:

# sample the codes and autoregressively decode it to sketch and extrude
  python sample.py --sketch_weight proj_log/exp_sketch \
                      --ext_weight proj_log/exp_extrude \
                      --code_weight proj_log/exp_code \
                      --device 1 --bit 6 \
                      --output proj_log/samples

Visualization:

# Under utils folder:

# convert generated sketch-and-extrude to stl format (timeout prevent occ hanging)
  timeout 180 python visual_obj.py --data_folder ../proj_log/samples 

# render and visualize to images 
  python cad_img.py  --input_dir ../proj_log/samples --output_dir ../proj_log/samples_visual

Evaluate the CAD models (after running visual_obj.py):

# Under utils folder:

# uniformly sample 2000 points 
  python sample_points.py --in_dir ../proj_log/samples --out_dir pcd

# evaluate performance 
  python eval_cad.py --fake ../proj_log/samples \
                     --real ../data/test_eval

Download test_eval and unzip it under the data folder. This contains the point clouds from DeepCAD test set.

Citation

If you find our work useful in your research, please cite our paper SkexGen:

@inproceedings{xu2022skexgen, 
title     = {SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks},
author    = {Xu, Xiang and Willis, Karl DD and Lambourne, Joseph G and Cheng, Chin-Yi and Jayaraman, Pradeep Kumar and Furukawa, Yasutaka},
booktitle = {International Conference on Machine Learning},
pages={24698--24724},
year={2022},
organization={PMLR}
}

License

Please see the license for further details.