Awesome
Knowledge Graphs and Data Integration in Database Conferences (2020)
You can check the conference prograpm webpages and attend the sessions that you are interested in.
VLDB 2020 [Conference Program, you can listen to the talk of each paper on Youtube or Bilibili]
Researh Papers:
- Mining an "Anti-Knowledge Base" from Wikipedia Updates with Applications to Fact Checking and Beyond
- KBPearl: A Knowledge Base Population System Supported by Joint Entity and Relation Linking
- Knowledge Translation [Technical Report]
- A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs
- Effective and Efficient Relational Community Detection and Search in Large Dynamic Heterogeneous Information Networks
- Obi-Wan: Ontology-Based RDF Integration of Heterogeneous Data
- Selecting Data to Clean for Fact Checking: Minimizing Uncertainty vs. Maximizing Surprise
Other Intersting Papers:
- Optimizing DNN Computation Graph using Graph Substitutions
Demo Papers:
- RDFFrames: Knowledge Graph Access for Machine Learning Tools
- SPHINX: A System for Metapath-based Entity Exploration in Heterogeneous Information Networks
Workshops:
- Fast Entity Resolution With Mock Labels and Sorted Integer Sets
- Entity Resolution on Camera Records without Machine Learning
- CheetahER: A Fast Entity Resolution System for Heterogeneous Camera Data
- An Extensible Block Scheme-Based Method for Entity Matching
- Spread the good around! Information Propagation in Schema Matching and Entity Resolution for Heterogeneous Data
- Intermediate Training of BERT for Product Matching
- Towards Guaranteeing Global Consistency for Peer-based Data Integration Architecture
- Integration of Fast-Evolving Data Sources Using A Deep Learning Approach
- Reliable Clustering with Applications to Data Integration
Recommend Sessions (Tokyo time):
- W2_3-6 Tuesday, September 1st 2020, 11:00 am - Knowledge Graphs
- Day3-Block1 Thursday, September 3rd 2020, 6:00 pm - Knowledge Bases
- Day2-Block3 Thursday, September 3rd 2020, 7:00 am - Knowledge Graphs & Hypergraphs
- Pre-Conference-Workshop Monday, August 31st 2020, 4:00 pm - DI2KG (1)
- Pre-Conference-Workshop Monday, August 31st 2020, 11:00 pm - DI2KG (2) (Same as DI2KG(1))
You can also go to the graph sessions, where a lot of novel graph algorithms (e.g., subgraph seraching) are proposed...
ICDE 2020 [Conference Program, videos and slides avaliable]
Research Papers:
- Optimizing Knowledge Graphs through Voting-based User Feedback [Video][Slides][Paper]
- AutoSF: Searching Scoring Functions for Knowledge Graph Embedding [Video][Slides][Paper]
- Semantic Guided and Response Times Bounded Top-k Similarity Search over Knowledge Graphs [Video][Slides][Paper]
- Online Indices for Predictive Top-k Entity and Aggregate Queries on Knowledge Graphs [Video][Slides][Paper]
- Sya: Enabling Spatial Awareness inside Probabilistic Knowledge Base Construction [Video][Slides][Paper]
- Crowdsourced Collective Entity Resolution with Relational Match Propagation [Video][Slides][Paper]
- Improving Neural Relation Extraction with Implicit Mutual Relations [Video][Slides][Paper]
- Dataset Discovery in Data Lakes [Video][Slides][Paper]
- TransN: Heterogeneous Network Representation Learning by Translating Node Embeddings [Video][Slides][Paper]
Recommened Sessions:
- R02: Data Integration and Machine Learning (Tuesday 21st April, 10:00-11:30)
- R08: Graph and Social Networks 2 (Tuesday 21st April, 13:30-15:00)
- R13: Data Cleaning, Curation and Analytics (Wed 22nd April, 12:00-13:30)
- R18: Search and Information Extraction (Wed 22nd April, 14:00-15:30)
SIGMOD 2020 [Conference Program (user account required, you can ask Xin for the account)]
Research Papers:
- A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching [Paper]
- ZeroER: Entity Resolution using Zero Labeled Examples [Paper]
- Towards Interpretable and Learnable Risk Analysis for Entity Resolution [Paper]
- SLIM: Scalable Linkage of Mobility Data [Paper]
- Learning Over Dirty Data Without Cleaning [Paper]
- Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks [Paper]
- SPARQL Rewriting: Towards Desired Results [Paper]
- Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study [Paper]
Demos:
Tutorials:
- State of the Art and Open Challenges in Natural Language Interfaces to Data [PDF], Tuesday 1:30 PM – 3:00 PM
- Automating Exploratory Data Analysis via Machine Learning: An Overview [PDF], Thursday 10:30 AM – 12:00 PM
Industries:
- AliCoCo: Alibaba E-commerce Cognitive Concept Net [Paper]
- An Ontology-Based Conversation System for Knowledge Bases [Paper]
- GIANT: Scalable Creation of a Web-scale Ontology [Paper]
- Entity Matching in the Wild: a Consistent and Versatile Framework to Unify Data in Industrial Applications [Paper]
Other interesting papers:
- Monotonic Cardinality Estimation of Similarity Selection: A Deep Learning Approach [Paper]
- Minimization of Classifier Construction Cost for Search Queries [Paper]
- Complaint-driven Training Data Debugging for Query 2.0 [Paper]
- Densely Connected User Community and Location Cluster Search in Location-Based Social Networks [Paper]
- Reliable Data Distillation on Graph Convolutional Network [Paper]
- DB4ML – An In-Memory Database Kernel with Machine Learning Support [Paper]
- Efficient Algorithms for Densest Subgraph Discovery on Large Directed Graphs [Paper]
- Organizing Data Lakes for Navigation [Paper]
- Finding Related Tables in Data Lakes for Interactive Data Science [Paper]
- Web Data Extraction using Hybrid Program Synthesis: A Combination of Top-down and Bottom-up Inference [Paper]
- Cleaning Denial Constraint Violations through Relaxation [Paper]
- SCODED: Statistical Constraint Oriented Data Error Detection [Paper]
- A Method for Optimizing Opaque Filter Queries [Paper]
Recommened Sessions:
- Research 13: Data Matching, Wednesday 10:30 AM – 12:00 PM
- Research 15: Machine Learning for Cleaning, Integration, and Search, Wednesday 10:30 AM – 12:00 PM
- Research 9: Data Cleaning, Wednesday 4:30 AM – 6:00 PM (I think this should be 4:30PM, the official website has a typo)
- Research 22: Data Lakes, Web, and Knowledge Graph, Thursday 10:30 AM – 12:00 PM
- Industry 3: Graph Databases and Knowledge Bases, Wednesday 4:30 PM -6:00 PM
KDD 2020 [Videos avaliable (you can ask Professor Chen for the account)]
Research Papers and Applied Data Science Track Papers:
- Dynamic Knowledge Graph based Multi-Event Forecasting
- Improving Conversational Recommender Systems via Knowledge Graph based Semantic Fusion
- Incremental Mobile User Profiling: Reinforcement Learning with Spatial Knowledge Graph for Modeling Event Streams
- MultiImport: Inferring Node Importance in a Knowledge Graph from Multiple Input Signals
- REA: Robust Cross-lingual Entity Alignment Between Knowledge Graphs
- AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of Types (Applied Data Science Track)
- Domain Specific Knowledge Graphs as a Service to the Public (Applied Data Science Track)
- BOND: Bert-Assisted Open-Domain Named Entity Recognition with Distant Supervision
- CoRel: Seed-Guided Topical Taxonomy Construction by Concept Learning and Relation Transferring
- Representing Temporal Attributes for Schema Matching
- Automatic Validation of Textual Attribute Values in ECommerce Catalog by Learning with Limited Labeled Data (Applied Data Science Track)
VLDBJ 2020
- A survey of community search over big graphs
- An analytical study of large SPARQL query logs
- Snorkel: rapid training data creation with weak supervision
- Automatic weighted matching rectifying rule discovery for data repairing
- Diversified spatial keyword search on RDF data
- RDF graph summarization for first-sight structure discovery [Paper]
TKDE 2020
- r-HUMO: A Risk-Aware Human-Machine Cooperation Framework for Entity Resolution with Quality Guarantees
- Bayesian Networks for Data Integration in the Absence of Foreign Keys
- Efficient Entity Resolution on Heterogeneous Records
- Generalized Translation-Based Embedding of Knowledge Graph
- Joint Learning of Question Answering and Question Generation