Awesome

Fast Patch-based Style Transfer of Arbitrary Style

Code is written in Torch. CUDA and CPU modes are available.

Examples

<div align='center'> <img src='images/content/bike.jpg' width="280px"> <img src='images/examples/bike_starry_3x3_max.jpg' width="280px"> <img src='images/examples/bike_swo_3x3_max.jpg' width="280px"> </div> (3x3 Patches) Content - w/ Starry Night - w/ Small Worlds I <div align='center'> <img src='images/examples/bike_starry_3x3_avg.jpg' width="280px"> <img src='images/examples/bike_starry_3x3_max_invnet.jpg' width="280px"> <img src='images/examples/bike_swo_3x3_max_invnet.jpg' width="280px"> </div> with AvgPooling - using Inverse Network - using Inverse Network <div align='center'> <img src='images/content/brad_pitt.jpg' width="210px"> <img src='images/examples/brad_pitt_compx_5x5.jpg' width="210px"> <img src='images/examples/brad_pitt_compx_9x9.jpg' width="210px"> <img src='images/examples/brad_pitt_compx_15x15.jpg' width="210px"> </div> (w/ Composition X) Original - 5x5 Patch - 9x9 Patch - 15x15 Patch <div align='center'> <img src='images/content/brad_pitt.jpg' width="210px"> <img src='images/examples/brad_pitt_lamuse_3x3_max.jpg' width="210px"> <img src='images/examples/brad_pitt_lamuse_5x5_max.jpg' width="210px"> <img src='images/examples/brad_pitt_lamuse_9x9_max.jpg' width="210px"> </div> (w/ La Muse) Original - 3x3 Patch - 5x5 Patch - 9x9 Patch

Download Pretrained VGG-19

git clone https://github.com/rtqichen/style-swap
cd style-swap/models
sh download_models.sh
cd ..

Usage

Stylizing a single image:

th style-swap.lua --content images/content/bike.jpg --style images/style/starry_night.jpg

More options:

th style-swap.lua --help

eg. increase --patchSize for more abstract stylization

th style-swap.lua --content images/content/brad_pitt.jpg --style images/style/la_muse.jpg --patchSize 7 --patchStride 3

eg. use --contentBatch to stylize all images in a directory.

th style-swap.lua --contentBatch images/content --style images/style/starry_night.jpg

Training an inverse network

Install nninit module:

luarocks install nninit

Train:

th train-vgg-decoder.lua --contentDir /path/to/dir --styleDir /path/to/dir

More options:

th train-vgg-decoder.lua --help

For training the network in our paper, we used images from MS COCO and the Painter by Numbers competition hosted by Kaggle. A trained network can be downloaded here.

Video

Frame-by-frame stylization can be done using the -contentBatch option.

An example script using ffmpeg to extract frames, stylize, and re-encode a video.

mkdir video_frames
ffmpeg -i /path/to/video -qscale:v 2 video_frames/video_%04d.jpg
th style-swap --contentBatch video_frames --style /path/to/style/file --save stylized_frames
ffmpeg -i stylized_frames/video_%04d_stylized.jpg -c:v libx264 -pix_fmt yuv420p stylized_video.mp4

Examples of stylized videos are placed in the videos folder. (Original video by TimeLapseHD.)

Reducing GPU Memory Usage

A few ways to reduce memory usage for style-swap.lua:

Decrease --maxStyleSize and --maxContentSize. The latter changes the size of the resulting image.
Increase --patchStride. This extracts less patches to use for style swap. Best to use a larger --patchSize to ensure the patches still overlap.
Last resort: use CPU-only mode by specifying --cpu.