Home

Awesome

A-Paper-List-of-Awesome-Tabular-LLMs

Different types of tables are widely used to store and present information. To automatically process numerous tables and gain valuable insights, researchers have proposed a series of deep-learning models for various table-based tasks, e.g., table question answering (TQA), table-to-text (T2T), text-to-sql (NL2SQL) and table fact verification (TFV). Recently, the emerging Large Language Models (LLMs) and more powerful Multimodal Large Language Models (MLLMs) have opened up new possibilities for processing the tabular data, i.e., we can use one general model to process diverse tables and fulfill different tabular tasks based on the user natural language instructions. We refer to these LLMs speciallized for tabular tasks as Tabular LLMs. In this repository, we collect a paper list about recent Tabular (M)LLMs and divide them into the following categories based on their key idea.


<font size=8><center><b> Table of Contents: </b> </center></font>

  1. Survey of Tabular LLMs and table understanding
  2. Prompting LLMs for different tabular tasks, e.g., in-context learning, prompt engineering and integrating external tools.
  3. Training LLMs for better table understanding ability, e.g., training existing LLMs by instruction fine-tuning or post-pretraining.
  4. Developing Agents for tabular data, e.g., devolping copilot for processing excel tables.
  5. RAG with tabular data, e.g., devolping RAG systems for understanding long tables.
  6. Empirical study or benchmarks for evaluating LLMs' table understanding ability, e.g., exploring the influence of various table types or table formats.
  7. Multimodal table understanding, e.g., training MLLMs to understand diverse table images and textual user requests.
  8. Table Understanding datasets, e.g., valuable datasets for model training and evaluation.
  9. Evaluation Metrics for Table Understanding, e.g., devising better evaluation method for table understanding.

<font size=8><center><b> Task Names and Abbreviations: </b> </center></font>

Task NamesAbbreviationsTask Descriptions
Table Question AnsweringTQAAnswering questions based on the table(s), e.g., answer look-up or computation questions about table(s).
Table-to-TextTable2Text or T2TGenerate a text based on the table(s), e.g., generate a analysis report given a financial statement.
Text-to-TableText2TableGenerate structured tables based on input text, e.g., generate a statistical table based on the game summary.
Table Fact VerificationTFVJudging if a statement is true or false (or not enough evidence) based on the table(s)
Text-to-SQLNL2SQLGenerate a SQL statement to answer the user question based on the database schema
Tabular Mathematical ReasoningTMRSolving mathematical reasoning problems based on the table(s), e.g., solve math word problems related to a table
Table-and-Text Question AnsweringTAT-QAAnswering questions based on both table(s) and their related texts, e.g., answer questions given wikipedia tables and their surrounding texts.
Table InterpretationTIInterpreting basic table content and structure information, e.g., column type annotation, entity linking, relation extraction, cell type classification et al.
Table AugmentationTAAugmenting existing tables with new data, e.g., schema augmentation, row population, et al.

1. Survey of Tabular LLMs and Table Understanding

TitleConferenceDatePages
Language Modeling on Tabular Data: A Survey of Foundations, Techniques and Evolutionarxiv2024-08-2049
Large Language Model for Table Processing: A Surveyarxiv2024-02-049
A Survey of Table Reasoning with Large Language Modelsarxiv2024-02-139
Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Surveyarxiv2024-03-0141
Transformers for Tabular Data Representation: A Survey of Models and ApplicationsTACL 202323
Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream TasksIJCAI 20222022-01-2415

2. Prompting LLMs for Different Tabular Tasks

TitleConferenceDateTaskCode
Retrieval & Fine-Tuning for In-Context Tabular ModelsNIPS 20242024-06-07Machine learning tasks with tabular data
Star <br> GraphOTTER: Evolving LLM-based Graph Reasoning for Complex Table Question AnsweringCOLING 20252024-12-02TQA
PoTable: Programming Standardly on Table-based Reasoning Like a Human Analystarxiv2024-12-05TQA, TFV
Unveiling Implicit Table Knowledge with Question-Then-Pinpoint Reasoner for Insightful Table SummarizationEMNLP 2024 Findings2024-06-18Table Summarization
TKGT: Redefinition and A New Way of Text-to-Table Tasks Based on Real World Demands and Knowledge Graphs Augmented LLMsEMNLP 2024Text2Table
Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple ExtractionEMNLP 20242024-04-22Text2TableGithub
TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoningarxiv2024-09-18TQAGithub
SynTQA: Synergistic Table-based Question Answering via Mixture of Text-to-SQL and E2E TQAEMNLP 20242024-09-25TQA
Star <br> FLEXTAF: Enhancing Table Reasoning with Flexible Tabular Formatsarxiv2024-08-16TQA, TFVGithub
Learning Relational Decomposition of Queries for Question Answering from TablesACL 2024TQA
TaPERA: Enhancing Faithfulness and Interpretability in Long-Form Table QA by Content Planning and Execution-based ReasoningACL 2024TQA
Enhancing Temporal Understanding in LLMs for Semi-structured Tablesarxiv2024-07-22Temporal TQA
Star <br> ALTER: Augmentation for Large-Table-Based Reasoningarxiv2024-07-03TQAGithub
TrustUQA: A Trustful Framework for Unified Structured Data Question Answeringarxiv2024-06-27TQA
Adapting Knowledge for Few-shot Table-to-Text Generationarxiv2024-03-27T2T
Graph Reasoning Enhanced Language Models for Text-to-SQLSIGIR 2024NL2SQL
NormTab: Improving Symbolic Reasoning in LLMs Through Tabular Data Normalizationarxiv2024-06-25TQA,TFV
Improving Factual Accuracy of Neural Table-to-Text Output by Addressing Input Problems in ToTToNAACL 20242024-04-05T2T
TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table DecompositionNAACL 2024TQA,TFV
Star <br> E5: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, Exhibit and ExtrapolateNAACL 2024TQA on hierarchical tablesGithub
OpenTE: Open-Structure Table Extraction From TextICASSP 2024Text-to-Table Extraction
On Linearizing Structured Data in Encoder-Decoder Language Models: Insights from Text-to-SQLNAACL 20242024-04-03NL2SQL
MFORT-QA: Multi-hop Few-shot Open Rich Table Question Answeringarxiv2024-03-28TQA
Star <br> OpenTab: Advancing Large Language Models as Open-domain Table ReasonersICLR 20242024-02-22TQA,TFVGithub
CABINET: Content Relevance based Noise Reduction for Table Question AnsweringICLR 20242024-02-02TQA
Star <br> Augment before You Try: Knowledge-Enhanced Table Question Answering via Table Expansionarxiv2024-01-24TQAGithub
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table UnderstandingICLR 20242024-01-09TQA,TFV
TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model ReasoningEMNLP 2024 Findings2023-12-14TQA,TAT-QA,TFV,T2TGithub
Large Language Models are Complex Table ParsersEMNLP 20232023-12-13TQA
API-Assisted Code Generation for Question Answering on Varied Table StructuresEMNLP 20232023-10-23TQA
Star <br> TableQAKit: A Comprehensive and Practical Toolkit for Table-based Question Answeringarxiv2023-10-23TQA,NL2SQLGithub
Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategiesarxiv2023-05-21NL2SQL
Star <br>StructGPT: A General Framework for Large Language Model to Reason over Structured DataEMNLP 20232023-05-16TQA, TFVGithub
Star <br> Chameleon:Plug-and-Play Compositional Reasoning with Large Language ModelsNIPS 20232023-04-19TMRGithub
Generate, Transform, Answer: Question Specific Tool Synthesis for Tabular DataEMNLP 20232023-03-17TQA,NL2SQL
DTT: An Example-Driven Tabular Transformer for Joinability by Leveraging Large Language ModelsSIGMOD 20242023-03-12Table Transformation
Star <br> Large Language Models are Versatile Decomposers:Decompose Evidence and Questions for Table-based ReasoningSIGIR 20232023-01-13TQA, TFVGithub
Star <br> Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning TasksTMLR 20232022-11-22TMR, TAT-QAGithub
Star <br> Large Language Models are few(1)-shot Table ReasonersEACL 2023 Findings2022-10-13TQA, TFVGithub
Star <br> Binding Language Models in Symbolic LanguagesICLR 20232022-10-06TQA, TFVGithub
Star <br> Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical ReasoningICLR 20232022-09-29TMR (Tabular Mathematical Reasoning)Github

3. Training LLMs for Better Table Understanding Ability

TitleConferenceDateTaskLLM BackboneCode
TableGPT2: A Large Multimodal Model with Tabular Data Integrationarxiv2024-11-04TQA, TFV, et al.Qwen2.5 model family with a special pre-trained table encoder.Github
Large Scale Transfer Learning for Tabular Data via Language ModelingNIPS 20242024-06-17tabular data prediction (classification and binned regression)Llama 3-8B
ProTrix: Building Models for Planning and Reasoning over Tables with Sentence ContextEMNLP 2024 Findings2024-03-04TQA, TFVLlama-2Github
UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure RecognitionEMNLP 2024 Findings2024-09-20Table Recognition
Table Question Answering for Low-resourced Indic LanguagesEMNLP 20242024-10-04Indian TQAmBARTGithub
TabMoE: A General Framework for Diverse Table-Based Reasoning with Mixture-of-ExpertsMathematics2024-08-16TQA, TFV, T2TBART
Star <br/>rLLM: Relational Table Learning with LLMsarxiv2024-07-29multi-table joint learning tasksa PyTorch library designed for Relational Table Learning (RTL) with Large Language Models (LLMs).Github
Star <br> Mambular: A Sequential Model for Tabular Deep Learningarxiv2024-08-12ML Classification and Regression tasks like California HousingMambaGithub
MambaTab: A Plug-and-Play Model for Learning Tabular DataMIPR 20242024-01-16ML Classification tasksMamba
SpreadsheetLLM: Encoding Spreadsheets for Large Language Modelsarxiv2024-07-12Excel Manipulation
Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Sciencearxiv2024-03-29Predictive Tabular TasksLlama2 7BHuggingFace
HGT: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understandingarxiv2024-03-28TI,TQAVicuna-1.5 7B
Star <br> TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenariosarxiv2024-03-28Table ManipulationCodeLlama 7B, 13BGithub
Star <br> StructLM: Towards Building Generalist Models for Structured Knowledge GroundingCoLM 20242024-02-26TQA,TFV,T2T,NL2SQLCodeLlama 7B-34BGithub
Star <br> TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Dataarxiv2024-01-24TQALlama2 7B, 13B, 70BGithub
Star <br> TableLlama: Towards Open Large Generalist Models for TablesNAACL 20242023-11-15TQA,TFV,T2T,TA,TILlama2 7BGithub
HELLaMA: LLaMA-based Table to Text Generation by Highlighting the Important Evidencearxiv2023-11-15T2TLlama2 7B-13B
Table-GPT: Table-tuned GPT for Diverse Table Tasksarxiv2023-10-13TQAGPT-3.5, ChatGPT

Pre-trained Tabular Language Models (non-LLM)

TitleConferenceDateTaskCode
Star <br> HYTREL: Hypergraph-enhanced Tabular Data Representation LearningNIPS 20232023-07-14TA, TIGithub
FLAME: A small language model for spreadsheet formulasAAAI 20242023-01-31Generating Excel FormulasGithub

4. Developing Agents for Processing Tabular Data

TitleConferenceDateTaskCode
SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Modelsarxiv2024-03-06Manipulating Excels with LLMGithub
Star <br> EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Recordsarxiv2024-01-13TQAGithub
Star <br> InfiAgent-DABench: Evaluating Agents on Data Analysis Tasksarxiv2024-01-10Data AnalysisGithub
Star <br> DB-GPT: Empowering Database Interactions with Private Large Language Modelsarxiv2023-12-29Data AnalysisGithub
ReAcTable: Enhancing ReAct for Table Question Answeringarxiv2023-10-01TQA
Star <br>SheetCopilot: Bringing Software Productivity to the Next Level through Large Language ModelsNIPS 20232023-05-30Manipulating Excels with LLMGithub
TableGPT: Towards Unifying Tables, Nature Language and Commands into One GPTarxiv2023-07-17Manipulating CSV table with LLM

5. RAG with Tabular Data

TitleConferenceDateTaskCode
TableRAG: Million-Token Table Understanding with Language ModelsNIPS 20242024-10-07TQA for extremely long tables
Evaluation of Table Representations to Answer Questions from Tables in Documents : A Case Study using 3GPP Specificationsarxiv2024-08-30how to represent tables for better retrieval within RAG systems
THoRR: Complex Table Retrieval and Refinement for RAGIR-RAG 2024 workshopRAG with large and complex tables

6. Empirical Study or Benchmarks for Evaluating LLMs' Table Understanding Ability

TitleConferenceDateTaskCode
Rethinking Tabular Data Understanding with Large Language ModelsNAACL 20242023-12-27TQA
On the Robustness of Language Models for Tabular Question Answeringarxiv2024-06-18TQA
FREB-TQA: A Fine-Grained Robustness Evaluation Benchmark for Table Question AnsweringNAACL 20242024-04-29TQA
How Robust are the Tabular QA Models for Scientific Tables? A Study using Customized Datasetarxiv2024-03-20TQA
Star <br> InstructExcel: A Benchmark for Natural Language Instruction in ExcelFindings of EMNLP 20232023-10-23Excel operationsGithub
Tabular Representation, Noisy Operators, and Impacts on Table Structure Understanding Tasks in LLMsarxiv2023-10-16Fact-Finding Tasks, Transformation Tasks
Star <br> Investigating Table-to-Text Generation Capabilities of LLMs in Real-World Information Seeking ScenariosEMNLP 20232023-05-24T2TGithub
Star <br> TABLET: Learning From Instructions For Tabular Dataarxiv2023-04-25Github
Table Meets LLM: Can Large Language Models Understand Structured Table Data? A Benchmark and Empirical StudyWSDM 20242023-05-22TQA,TFV,T2T
Evaluating the Text-to-SQL Capabilities of Large Language Modelsarxiv2022-03-15NL2SQL
Star <br> A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capabilityarxiv2023-03-12NL2SQLGithub
Star <br> RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial PerturbationsACL 20232023-06-25TQAGithub

7. Multimodal Table Understanding

TitleConferenceDateTaskCode
Knowledge-Aware Reasoning over Multimodal Semi-structured TablesEMNLP 2024 Findings2024-08-25Understanding table images with visual elements like symbols and icons
Leopard: A Vision Language Model For Text-Rich Multi-Image Tasksarxiv2024-10-02Multi Table Image QAGithub
Star <br> PixT3: Pixel-based Table-To-Text GenerationACL 20242023-11-16T2TGithub
TabPedia: Towards Comprehensive Visual Table Understanding with Concept SynergyNIPS 20242024-06-03TQA,TI
Star <br> TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domainsarxiv2024-04-30TQA, TFVGithub
Tables as Texts or Images: Evaluating the Table Reasoning Ability of LLMs and MLLMsACL 20242024-02-19TQA,TFV,T2T
Star <br> Multimodal Table UnderstandingACL 20242024-02-15TQA, TFV, T2T, TI, TAT-QA, TMRGithub

8. Table Understanding Datasets

8.1 Recent Datasets for LLMs

TitleConferenceDateTaskData VolumeDomainTable TypeData and Code
MiMoTable: A Multi-scale Spreadsheet Benchmark with Meta Operations for Table ReasoningCOLING 20242024-12-16TQA,T2T,Table manipulation, Data analysis1,719 (spreadsheet, question, answer) triplets from 428 different spreadsheetsMultiple domainsFlat and hierarchical tablesGithub
ENTRANT: A Large Financial Dataset for Table UnderstandingSci Data2024-07-04Cell Type Classification, Header Extraction, et alMillions of tables with cell attributes, as well as positional and hierarchical informationFinancialFlat tables and hierarchical tablesGithub
TableBench: A Comprehensive and Complex Benchmark for Table Question Answeringarxiv2024-08-17TMR, TFV, Trend Forecasting and Chart Generation3681 tables and 20K samplesCollect tables from academic datasets like WTQ and FeTaQAFlat tables and a small number of hierarchical tablesGithub
DocTabQA: Answering Questions from Long Documents Using Tablesarxiv2024-08-21Table Generation based on question and document300 documents and 1.5k question-table pairsFinancialFlat tables and hierarchical tablesGithub

8.2 Classic Datasets of Downstream Table Tasks

9. Designing Evaluation Metrics for Table Understanding

TitleConferenceDateTaskCode
Revisiting Automated Evaluation for Long-form Table Question Answering in the Era of Large Language ModelsEMNLP 2024TQA
Is This a Bad Table? A Closer Look at the Evaluation of Table Generation from TextEMNLP 20242024-06-21Text2Table