ZephirFXEC/VQVDB

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 111 Commits
houdini_package		houdini_package
python		python
src		src
.clang-format		.clang-format
.gitignore		.gitignore
BUILDING.md		BUILDING.md
CMakeLists.txt		CMakeLists.txt
Jenkinsfile		Jenkinsfile
LICENSE		LICENSE
README.md		README.md
build.bat		build.bat
build.sh		build.sh
notebook_scalar.ipynb		notebook_scalar.ipynb
notebook_vec3f.ipynb		notebook_vec3f.ipynb

Repository files navigation

VQVDB: AI Compression for OpenVDB Grids

Overview

VQVDB is a deep learning-powered compressor for volumetric data stored in OpenVDB. It uses Vector Quantized Variational Autoencoders (VQ-VAE) to learn a compact latent space and achieves up to 32x compression of float voxel grids, nearly lossless at the visual level.

VQVDB is designed for GPU-accelerated decoding via CUDA but also support CPU encoding / decoding, enabling real-time decompression of large volumes, with native support for integration into Houdini.

File Format

Each .vqvdb file stores:

Section	Description
Header	Magic, version, codebook size, shape info
Codebook	256x128 float matrix
Index Tensors	[B x 4 x 4 x 4] uint8 values
Origins	Per-leaf grid coordinates

How it Works

Training Pipeline

I'm very bad at programming and couldn't compile pyopenvdb. so I had to extract vdb data to .npy files.

Leaf Extraction
Extract all non-empty 8x8x8 voxel blocks as well as the coords of each leaf origins (for reconstruction) from a dataset of VDB volumes.
VQ-VAE Model
A PyTorch-based encoder compresses each leaf into a latent vector of size D in this case 128 dimmenions.
The quantizer maps this vector to the closest of K learned codebook entries, the codebook as 256, which makes it fit in a uint8_t.
Loss Function
Optimized using the following objective:
$$\mathcal{L} = \|x - \hat{x}\|^2 + \beta \cdot \| \text{sg}[z_e(x)] - e \|^2$$
with Exponential Moving Average (EMA) updates to the embedding table.

Runtime Decompression (C++/CUDA)

Load .vqvdb file, which contains:
- Codebook (float32)
- Per-leaf index tensors (4x4x4xuint8_t = 64 bytes)
- Leaf origins (openvdb::Coord)
GPU Decoding
- Launch CUDA kernel to decode latent codes into dense voxel blocks
- Allocate leaf nodes in a new openvdb::FloatGrid
- Write the reconstructed 8x8x8 voxel blocks
Streaming-Friendly
- Decompression supports lazy loading in batches
- Low VRAM footprint when streaming large scenes

Future Work

Hierarchical VQ-VAE (multi-res compression)
Residual VAE
Transformer-based latent upsampling
Real-time decompression via Vulkan compute
VDB segmentation / semantic-aware encoding

Citation

If you use VQVDB in academic work, please cite the project:

@article{Crema2025, author = "Enzo Crema", title = "{VQVDB : VDB Compression using VQ-VAEs}", year = "2025", month = "7", url = "https://figshare.com/articles/dataset/VQVDB_VDB_Compression_using_VQ-VAEs/29469083", doi = "10.6084/m9.figshare.29469083.v1" }

About

OpenVDB Fog Volume Compression using VQ-VAE Neural Network

Topics

machine-learning cuda openvdb vq-vae

Resources

Readme

License

BSD-3-Clause license

Activity

Releases 7

VQVDB v0.0.7 Latest

Aug 13, 2025

+ 6 releases

Contributors

Languages

C++ 99.3%
Other 0.7%

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

ZephirFXEC/VQVDB

Folders and files

Latest commit

History

Repository files navigation

VQVDB: AI Compression for OpenVDB Grids

Download the latest version

Discord Support Community

Overview

File Format

How it Works

Training Pipeline

Runtime Decompression (C++/CUDA)

Future Work

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 7

Contributors

Uh oh!

Languages