Home

Awesome

aligner_gpu_version

Author: Enliang Li

mian translation for the forward algorithm (gpu-version)

Latest Version: 1.2 on Apr.15th 2019

Use following command to build:

using the "compile.sh" to build the sources

Performance Details (TITAN Xp) [HOLMES flow (27 states)]

forward_matrix [x_dim+1][y_dim+1][batch][states-1]

forward_matrix: torch.Size

([6, 7, 1, 26])

10.79748 ms / call (float)

11.009 ms / call (double)

forward_matrix: torch.Size

([16, 19, 1, 26])

11.50272 ms / call (float)

11.9557 ms / call (double)

forward_matrix: torch.Size

([61, 127, 1, 26])

30.30934 ms / call (float)

33.30686 ms / call (double)

Performance Details (TITAN Xp) [regular flow (3 states)]

forward_matrix [x_dim+1][y_dim+1][batch][states-1]

forward_matrix: torch.Size

([16, 19, 1, 3])

12.6679 ms / call (float)

13.0121 ms / call (double)

forward_matrix: torch.Size

([16, 19, 3, 3])

12.9385 ms / call (float)

13.7454 ms / call (double)

forward_matrix: torch.Size

([159, 149, 1, 3])

69.3239 ms / call (float)

72.4838 ms / call (double)

PyCUDA Support:

copy-paste the function definition and call it with PyCUDA in your own flow

For examples, please refer to:

https://wiki.tiker.net/PyCuda/Examples

Copy Right Reserved by AndreasKloeckner