Home

Awesome

Note: Currently, we only have torch c version batch input sparse linear and sparse covolution.

##implementation of (c)DSSM in torch

##dependencies:

##Data Preprocessing ###Related Functions:

##Tranining 1: generate data from dataset. The data format follows the C# implementation. Each query and document in the same line, and the seperator is 'Tab'. 2: generate vocabulary for question and answers. Using WordHash.Pair2Voc(). you should get the result like this: ''' Creating Voc file form ... srcVoc contains vocabulary: 5584 tgtVoc contains vocabulary: 10876 ''' 3: Create Pair2Seq Feature and save to txt. Using WordHash.Pair2SeqFea()

4: Convert the seq Feature to Binay file, we give the batchsize here. (this can't be change after you train the model. for orginial data, the batch size is 1024. Using WordHash.SeqFea2Bin(), See more info under the function.

###Related functions

##Predicting 1: generate feature file, refer PreProcess.lua for details

##To-do List