LIVE: Towards Layer-wise Image Vectorization
CVPR 2022 (Oral presentation)

¹Northeastern University
²UIUC
³Adobe Research
⁴Picsart AI Research (PAIR)

Demo Video

Abstract

Image rasterization is a mature technique in computer graphics, while image vectorization, the reverse path of rasterization, remains a major challenge. Recent advanced deep learning-based models achieve vectorization and semantic interpolation of vector graphs and demonstrate a better topology of generating new figures. However, deep models cannot be easily generalized to out-of-domain testing data. The generated SVGs also contain complex and redundant shapes that are not quite convenient for further editing. Specifically, the crucial layer-wise topology and fundamental semantics in images are still not well understood and thus not fully explored. In this work, we propose Layer-wise Image Vectorization, namely LIVE, to convert raster images to SVGs and simultaneously maintain its image topology. LIVE can generate compact SVG forms with layer-wise structures that are semantically consistent with human perspective. We progressively add new bezier paths and optimize these paths with the layer-wise framework, newly designed loss functions, and component-wise path initialization technique. Our experiments demonstrate that LIVE presents more plausible vectorized forms than prior works and can be generalized to new images. With the help of this newly learned topology, LIVE initiates human editable SVGs for both designers and other downstream applications.

Overview

We present a new method to progressively generate a SVG that fits the raster image in a layer-wise fashion. Given an arbitrary input image, LIVE recursively learns the visual concepts by adding new optimizable closed bezier paths and optimizing all these paths.

More examples of layer-wise representation. Given a simple image, our LIVE is able to learn each component in the image in a layer-wise fashion. Here we show the learning progress using 8 paths, where each output appends a new path to the previous result.

Comparsions

From left to right are (1)input raster image, (2)output SVGs of DiffVG (path=5), (3)output SVGs of DiffVG (path=256), and (4)output of our LIVE (path=5). With only 5 paths, DiffVG cannot reconstruct the input image. When increasing the path number to 256 (which is significantly larger than the number of necessary paths), DiffVG is able to reconstruct the input. Differently, our LIVE is able to reconstruct the input smiling face by only 5 paths, and shows a compact layer-wise representation (We re-scale the speed to match the three gifs.).

BibTeX

If you find our project useful in your research, please cite:

                    
@InProceedings{xu2022live,
    author    = {Ma, Xu and Zhou, Yuqian and Xu, Xingqian and Sun, Bin and Filev, Valerii and  Orlov, Nikita and Fu, Yun and Shi, Humphrey},
    title     = {Towards Layer-wise Image Vectorization},
    booktitle = {Proceedings of the IEEE conference on computer vision and pattern recognition},
    year      = {2022}
}

Acknowledgments

The website template was borrowed from CurveNet.

LIVE: Towards Layer-wise Image Vectorization
CVPR 2022 (Oral presentation)

Main Paper

Supp. Materials

Demo Video

Hugging face Space

Colab Demo

Github Code

BibTeX Citation

Demo Video

Abstract

Overview

Comparsions

BibTeX

Acknowledgments

LIVE: Towards Layer-wise Image Vectorization CVPR 2022 (Oral presentation)

Main Paper

Supp. Materials

Demo Video

Hugging face Space

Colab Demo

Github Code

BibTeX Citation

Demo Video

Abstract

Overview

Comparsions

BibTeX

Acknowledgments

LIVE: Towards Layer-wise Image Vectorization
CVPR 2022 (Oral presentation)