Awesome
Must-read Papers on Legal Intelligence
Contributed by Chaojun Xiao, Haoxi Zhong, Yutao Sun
Overview of Legal Intelligence
-
How Does NLP Benefit Legal System: A Summary of Legal Artificial Intelligence.
Haoxi Zhong, Chaojun Xiao, Cunchao Tu, Tianyang Zhang, Zhiyuan Liu, Maosong Sun. ACL 2020. [pdf]
Datasets
Dataset | Task | Language | Size |
---|---|---|---|
Gamper (2000) | Parallel Corpus | Italian, German | 5m words |
Grover et al. (2004) | Summarization | English | 40 documents, 12k sentences |
Hoekstra et al. (2007) | Ontology | English | 2378 concepts |
Demenko et al. (2008) | Speech | Polish | 2h vocal material |
Cvrcek et al. (2012) | Dictionary | Czech | 10k entries, 20k terms |
Fawei et al. (2016) | Question Answering | English | 400 questions |
Locke et al. (2018) | Information Retrieve | English | 3m decisions, 2572 assessments |
Araujo et al. (2018) | Name Entity Recognition | Portuguese | 70 documents |
Kano et al. (2018) | IR and QA | Japanese | 285 queries, 651 questions |
Xiao et al. (2018) | Judgment Prediction | Chinese | 2.68m documents |
Manor et al. (2019) | Summarization | English | 505 sets, 175 documents |
Chalkidis et al. (2019a) | Judgment Prediction | English | 11.5k documents |
Chalkidis et al. (2019b) | Classification | English | 57k documents, 4.3k labels |
Duan et al. (2019) | Reading Comprehension | Chinese | 50k questions, 10k documents |
Xiao et al. (2019) | Similar Case Matching | Chinese | 9k triplets of documents |
Zhong et al. (2020) | Question Answering | Chinese | 30k questions, 80k articles |
-
<span id="Gamper">A parallel corpus of Italian/German legal texts.</span>
Johann Gamper. LREC 2000. [pdf]
-
<span id = "Grover">The HOLJ corpus: supporting summarisation of legal texts.</span>
Claire Grover, Ben Hachey, Ian Hughson. COLING 2004. [pdf]
-
<span id = "Hoekstra">The lkif core ontology of basic legal concepts.</span>
Rinke Hoekstra, Joost Breuker, Marcello Di Bello, Alexander Boer. 2007. [pdf]
-
<span id = "Demenko">JURISDIC: Polish speech database for taking dictation of legal texts.</span>
Grazyna Demenko, Stefan Grocholewski, Katarzyna Klessa, Jerzy Ogorkiewicz, Agnieszka Wagner, Marek Lange, Daniel Sledzinski, Natalia Cylwik. LREC 2008. [pdf]
-
<span id = "Cvrcek">Legal electronic dictionary for Czech. </span>
Frantisek Cvrcek, Karel Pala, Pavel Rychly. LREC 2012. [pdf]
-
<span id = "Fawei">Passing a USA national bar exam: a first corpus for experimentation.</span>
Biralatei Fawei, Adam Wyner, Jeff Pan. LREC 2016. [pdf]
-
<span id = "Locke">A Test Collection for Evaluating Legal Case Law Search.</span>
Daniel Locke, Guido Zuccon. SIGIR 2018. [pdf]
-
<span id = "Kano">Coliee-2018: Evaluation of the competition on legal information extraction and entailment.</span>
Yoshinobu Kano, Mi-Young Kim, Masaharu Yoshioka, Yao Lu, Juliano Rabelo, Naoki Kiyota, Randy Goebel, Ken Satoh. JSAI 2018. [pdf]
-
<span id = "Lenerbr">Lener-br: A dataset for named entity recognition in brazilian legal text.</span>
Pedro Henrique Luz de Araujo, Te¨®filo E. de Campos, Renato R. R. de Oliveira, Matheus Stauffer, Samuel Couto, Paulo Bermejo. PROPOR 2018. [pdf]
-
<span id = "XiaoCAIL2018">CAIL2018: A Large-Scale Legal Dataset for Judgment Prediction.</span>
Chaojun Xiao, Haoxi Zhong, Zhipeng Guo, Cunchao Tu, Zhiyuan Liu, Maosong Sun, Yansong Feng, Xianpei Han, Zhen Hu, Heng Wang, Jianfeng Xu. [pdf]
-
<span id = "Manor">Plain English summarization of contracts.</span>
Laura Manor, Junyi Jessy Li. Natural Legal Language Processing Workshop 2019. [pdf]
-
<span id = "ChalkidisNeural">Neural Legal Judgment Prediction in English.</span>
Ilias Chalkidis, Ion Androutsopoulos, Nikolaos Aletras. ACL 2019. [pdf]
-
<span id = "ChalkidisLargeScale">Large-Scale Multi-Label Text Classification on EU Legislation.</span>
Ilias Chalkidis, Manos Fergadiotis, Prodromos Malakasiotis, Ion Androutsopoulos. ACL 2019. [pdf]
-
<span id = "Duan">Cjrc: A reliable human-annotated benchmark dataset for chinese judicial reading comprehension.</span>
Xingyi Duan, Baoxin Wang, Ziyue Wang, Wentao Ma, Yiming Cui, Dayong Wu, Shijin Wang, Ting Liu, Tianxiang Huo, Zhen Hu. CCL 2019. [pdf]
-
<span id = "XiaoCAIL2019">Cail2019-scm: A dataset of similar case matching in legal domain.</span>
Chaojun Xiao, Haoxi Zhong, Zhipeng Guo, Cunchao Tu, Zhiyuan Liu, Maosong Sun, Tianyang Zhang, Xianpei Han, Heng Wang, Jianfeng Xu. [pdf]
-
<span id = "ZhongJECQA">Jec-qa: A legal-domain question answering dataset.</span>
Haoxi Zhong, Chaojun Xiao, Cunchao Tu, Tianyang Zhang, Zhiyuan Liu, Maosong Sun. AAAI 2020. [pdf]
Legal Judgment Prediction
-
Learning to predict charges for criminal cases with legal basis.
Bingfeng Luo, Yansong Feng, Jianbo Xu, Xiang Zhang, Dongyan Zhao. EMNLP 2017. [pdf]
-
Few-shot charge prediction with discriminative legal attributes.
Zikun Hu, Xiang Li, Cunchao Tu, Zhiyuan Liu, Maosong Sun. COLING 2018. [pdf]
-
Legal Judgment Prediction via Topological Learning.
Haoxi Zhong, Zhipeng Guo, Cunchao Tu, Chaojun Xiao, Zhiyuan Liu, Maosong Sun. EMNLP 2018. [pdf]
-
Interpretable Rationale Augmented Charge Prediction System.
Xin Jiang, Hai Ye, Zhunchen Luo, Wenhan Chao, Wenjia Ma. COLING 2018. [pdf]
-
Legal Article-Aware End-To-End Memory Network for Charge Prediction.
Yatian Shen, Jun Sun, Xiaopeng Li, Lei Zhang, Yan Li, Xiajiong Shen. CSAE 2018. [pdf]
-
SECaps: A Sequence Enhanced Capsule Model for Charge Prediction.
Congqing He, Li Peng, Yuquan Le, Jiawei He, and Xiangyu Zhu. [pdf]
-
Automatic Judgment Prediction via Legal Reading Comprehension.
Shangbang Long, Cunchao Tu, Zhiyuan Liu, Maosong Sun. [pdf]
-
A Markov Logic Networks Based Method to Predict Judicial Decisions of Divorce Cases.
Jiajing Li, Guoying Zhang, Hongfei Yan, Longxue Yu, Tao Meng. IEEE SmartCloud. [pdf]
-
Legal Judgment Prediction via Multi-Perspective Bi-Feedback Network.
Wenmian Yang, Weijia Jia, Xiaojie Zhou, Yutao Luo. IJCAI 2019. [pdf]
-
Law text classification using semi-supervised convolutional neural networks.
Penghua Li, Fen Zhao, Yuanyuan Li, Ziqin Zhu. CCDC. [pdf]
-
Exploring the Use of Text Classification in the Legal Domain.
Octavia-Maria Sulea, Marcos Zampieri, Shervin Malmasi, Mihaela Vela, Liviu P. Dinu, Josef van Genabith. [pdf]
-
Predicting the Law Area and Decisions of French Supreme Court Cases.
Octavia-Maria Sulea, Marcos Zampieri, Mihaela Vela, Josef van Genabith. RANLP 2017. [pdf]
-
JUMPER: Learning When to Make Classification Decisions in Reading.
Xianggen Liu, Lili Mou, Haotian Cui, Zhengdong Lu, Sen Song. IJCAL 2018. [pdf]
-
Generalize Symbolic Knowledge With Neural Rule Engine.
Shen Li, Hengru Xu, Zhengdong Lu. [pdf]
-
An External Knowledge Enhanced Multi-label Charge Prediction Approach with Label Number Learning.
Duan Wei, Li Lin. [pdf]
-
Machine learning for explaining and ranking the most influential matters of law.
Max R. S. Marques, Tommaso Bianco, Maxime Roodnejad, Thomas Baduel, Claude Berrou. ICAIL 2019. [pdf]
-
Charge-Based Prison Term Prediction with Deep Gating Network. Huajie Chen, Deng Cai, Wei Dai, Zehui Dai, Yadong Ding. EMNLP-IJCNLP 2019. [pdf]
-
Iteratively Questioning and Answering for Interpretable Legal Judgment Prediction.
Haoxi Zhong, Yuzhong Wang, Cunchao Tu, Tianyang Zhang, Zhiyuan Liu, Maosong Sun. AAAI 2020. [pdf]
-
Distinguish Confusing Law Articles for Legal Judgment Prediction.
Nuo Xu, Pinghui Wang, Long Chen, Li Pan, Xiaoyan Wang, Junzhou Zhao. ACL 2020. [pdf]
Court Views Generation
-
Interpretable Charge Predictions for Criminal Cases: Learning to Generate Court Views from Fact Descriptions.
Hai Ye, Xin Jiang, Zhunchen Luo, Wenhan Chao. NAACL-HLT 2018. [pdf]
-
De-Biased Court’s View Generation with Causality.
Yiquan Wu, Kun Kuang, Yating Zhang, Xiaozhong Liu, Changlong Sun, Jun Xiao1, Yueting Zhuang, Luo Si, Fei Wu. EMNLP 2020. [pdf]
Information Extraction
Named Entity Recognition
-
Named entity recognition in the legal domain for ontology population.
Mirian Bruckschen, Caio Northfleet, Paulo Bridi, Roger Granada, Renata Vieira, Prasad Rao, Tomas Sander. 2010. [pdf]
-
Legal NERC with ontologies, Wikipedia and curriculum learning.
Cristian Cardellino, Milagro Teruel, Laura Alonso Alemany, Serena Villata. EACL 2017. [pdf]
-
A Low-cost, High-coverage Legal Named Entity Recognizer, Classifier and Linker.
Cristian Cardellino, Milagro Teruel, Laura Alonso Alemany, Serena Villata. 2017. [pdf]
-
Legal Entity Extraction with NER Systems.
Ines Badji. 2018. [pdf]
-
Deep Learning for Named-Entity Linking with Transfer Learning for Legal Documents.
Ahmed Elnaggar, Robin Otto, Florian Matthes. AICCC 2018. [pdf]
-
Neural Entity Reasoner for Global Consistency in Named Entity Recognition.
Xiaoxiao Yin, Daqi Zheng, Zhengdong Lu, Ruifang Liu. 2018. [pdf]
-
Fine-Grained Named Entity Recognition in Legal Documents.
Elena Leitner, Georg Rehm, Julian Moreno-Schneider. SEMANTiCS 2019. [pdf]
Event Extraction
-
Event extraction and temporal reasoning in legal documents.
Frank Schilder. 2005. [pdf]
-
Event extraction for legal case building and reasoning.
Nikolaos Lagos, Frederique Segond, Stefania Castellani, Jacki O¡¯Neill. IIP 2010. [pdf]
-
Event Identification as a Decision Process with Non-linear Representation of Text.
YukunYan, Daqi Zheng, Zhengdong Lu, Sen Song. [pdf]
-
Apply event extraction techniques to the judicial field.
Chuanyi Li, Yu Sheng, Jidong Ge, Bin Luo. 2019. [pdf]
Others
-
**Semantic mark-up of Italian legal texts through NLPbased techniques. **
Roberto Bartolini, Alessandro Lenci, Simonetta Montemagni, Vito Pirrelli, Claudia Soria. LREC 2004. [pdf]
-
Legal aspects of text mining.
Maarten Truyens and Patrick Van Eecke. LREC 2014. [pdf]
-
Litigation Analytics: Case Outcomes Extracted from US Federal Court Dockets.
Thomas Vacek, Ronald Teo, Dezhao Song, Conner Cowling, Frank Schilder, Timothy Nugent. NAACL Workshop 2019. [pdf]
-
A Sequence Approach to Case Outcome Detection.
Tom Vacek, Frank Schilder. ICAIL 2017. [pdf]
-
Extracting the Gist of Chinese Judgments of the Supreme Court.
Chaolin Liu, Kuanchun Chen. ICAIL 2019. [pdf]
Information Retrieval
-
Analyzing the extraction of relevant legal judgments using paragraph-level and citation information.
Raghav K, Reddy P K, Reddy V B. ECAI 2016. [pdf]
-
On the concept of relevance in legal information retrieval.
Marc Van Opijnen, Cristiana Santos. ECAI 2016. [pdf]
-
Building legal case retrieval systems with lexical matching and summarization using a pretrained phrase scoring model.
Vu Tran, Minh Le Nguyen, Ken Satoh. [pdf]
-
Legal document retrieval using document vector embeddings and deep learning.
Keet Sugathadasa, Buddhi Ayesha, Nisansa de Silva, Amal Shehan Perera, Vindula Jayawardana, Dimuthu Lakmal, Madhavi Perera. [pdf]
Legal Text Summarization
-
Automatic summarisation of legal documents.
Claire Grover, Ben Hachey, Lan Hugson, Chris Korycinski. ICAIL 2003. [pdf]
-
Summarising legal texts: Sentential tense and argumentative roles.
Claire Grover, Ben Hachey, Chris Korycinski. NAACL 2003. [pdf]
-
A Rhetorical Status Classifier for Legal Text Summarisation.
Ben Hachey, Claire Grover. ACL Workshop 2004. [pdf]
-
Sentence extraction for legal text summarisation.
Ben Hachey, Claire Grover. IJCAI 2005. [pdf]
-
Legal Document Summarization using Latent Dirichlet Allocation.
Ravi Kumar V, K. Raghuveer. IJCST 2012. [pdf]
-
Text summarization from legal documents: a survey.
Ambedkar Kanapala, Sukomal PalRajendra Pamula. Artificial Intelligence Review 2019. [pdf]
-
A Comparative Study of Summarization Algorithms Applied to Legal Case Judgments.
Paheli Bhattacharya, Kaustubh Hiware, Subham Rajgaria, Nilay Pochhi, Kripabandhu Ghosh, Saptarshi Ghosh. ECIR 2019. [pdf]
-
A Novel Approach of Augmenting Training Data for Legal Text Segmentation by Leveraging Domain Knowledge.
Rupali Sunil Wagh, Deepa Anand. Technologies and Applications 2020. [pdf]
Legal Question Answering
-
Lexical-Morphological Modeling for Legal Text Analysis.
Danilo S. Carvalho, Minh-Tien Nguyen, Chien-Xuan Tran, Minh-Le Nguyen. COLIEE 2017. [pdf]
-
Legal Question Answering using Ranking SVM and Deep Convolutional Neural Network.
Phong-Khac Do, Huy-Tien Nguyen, Chien-Xuan Tran, Minh-Tien Nguyen, Minh-Le Nguyen. COLIEE 2017. [pdf]
-
Multi-Task CNN for Classification of Chinese Legal Questions.
Guangyi Xiao, Jiqian Mo, Even Chow, Hao Chen, Jingzhi Guo, Zhiguo Gong. ICEBE 2017. [pdf]
-
Chinese Questions Classification in the Law Domain.
Guangyi Xiao, Even Chow, Hao Chen, Jiqian Mo, Jingzhi Guo, Zhiguo Gong. ICEBE 2017. [pdf]
-
Answering Legal Questions by Learning Neural Attentive Text Representation
Phi Manh Kien, Ha-Thanh Nguyen, Ngo Xuan Bach, Vu Tran, Minh Le Nguyen, Tu Minh Phuong. ACL 2020. [pdf]
Semantical Parsing
-
Object-oriented Neural Programming (OONP) for Document Understanding.
Zhengdong Lu, Xianggen Liu, Haotian Cui, Yukun Yan, Daqi Zheng. ACL 2018. [pdf]