Home

Awesome

ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing

ViText2SQL is a dataset for the Vietnamese Text-to-SQL semantic parsing task, consisting of about 10K question and SQL query pairs. The construction of ViText2SQL is detailed in our EMNLP-2020 Findings paper:

@inproceedings{vitext2sql,
    title     	= {{A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese}},
    author    	= {Anh Tuan Nguyen and Mai Hoang Dao and Dat Quoc Nguyen},
    booktitle   = {Findings of the Association for Computational Linguistics: EMNLP 2020},
    year      	= {2020},
    pages       = {4079--4085}
}  

By downloading the ViText2SQL dataset, USER agrees:

The user can also download from here our pre-trained Word2Vec syllable and word embeddings for Vietnamese that we used in our experiments.

Copyright (c) 2020 VinAI

THE DATA IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE DATA OR THE USE OR OTHER DEALINGS IN THE
DATA.