Title | Authors |
An API oriented open-source Python framework for unsupervised learning on graphs | Benedek Rozemberczki, Olivér Kiss and Rik Sarkar |
Little Ball of Fur: A Python Library for Graph Sampling | Benedek Rozemberczki, Olivér Kiss and Rik Sarkar |
MindReader: Recommendation over Knowledge Graph Entities with Explicit User Ratings | Anders Brams, Anders Jakobsen, Theis Jendal, Matteo Lissandrini, Peter Dolog and Katja Hose |
Event-QA: A Dataset for Event-Centric Question Answering over Knowledge Graphs | Tarcísio Souza Costa, Simon Gottschalk and Elena Demidova |
Fine-Grained Relevance Annotations for Multi-Task Document Ranking and Question Answering | Sebastian Hofstätter, Markus Zlabinger, Mete Sertkan, Michael Schröder and Allan Hanbury |
GeoFlink: A Distributed and Scalable Framework for the Real-time Processing of Spatial Streams | Salman Shaikh, Komal Mariam, Hiroyuki Kitagawa and Kyoung-Sook Kim |
CC-News-En: A Large English Newswire Corpus | Joel Mackenzie, Rodger Benham, Matthias Petri, Johanne Trippas, J. Shane Culpepper and Alistair Moffat |
CauseNet: Towards a Causality Graph Extracted from the Web | Stefan Heindorf, Yan Scholten, Henning Wachsmuth, Axel-Cyrille Ngonga Ngomo and Martin Potthast |
A Multidimensional Dataset for Analyzing and Detecting News Bias based on Crowdsourcing | Michael Färber, Victoria Burkard, Adam Jatowt and Sora Lim |
A Dataset of Journalists’ Interactions With Their Readership: When Should Article Authors Reply to Reader Comments? | Julian Risch and Ralf Krestel |
TweetsCOV19 – A Knowledge Base of Semantically Annotated Tweets about the COVID-19 Pandemic | Dimitar Dimitrov, Erdal Baran, Pavlos Fafalios, Ran Yu, Xiaofei Zhu, Matthäus Zloch and Stefan Dietze |
Web Page Segmentation Revisited: Evaluation Framework and Dataset | Johannes Kiesel, Florian Kneist, Lars Meyer, Kristof Komlossy, Benno Stein and Martin Potthast |
The Newspaper Navigator Dataset: Extracting Headlines and Visual Content From 16 Million Historic Newspaper Pages in Chronicling America | Benjamin Lee, Jaime Mears, Eileen Jakeway, Meghan Ferriter, Chris Adams, Nathan Yarasavage, Deborah Thomas, Kate Zwaard and Daniel Weld |
Enslaved Dataset: A Real-world Complex Ontology Alignment Benchmark using Wikibase | Lu Zhou, Cogan Shimizu, Pascal Hitzler, Alicia Sheill, Seila Gonzalez Estrecha, Catherine Foley, Duncan Tarr and Dean Rehberger |
Argo Lite: Open-Source Interactive Graph Exploration and Visualization in Browsers | Siwei Li, Zhiyan Zhou, Anish Upadhayay, Omar Shaikh, Scott Freitas, Haekyu Park, Zijie J. Wang, Susanta Routray, Matthew Hull and Duen Horng Chau |
MLM: A Benchmark Dataset for Multitask Learning with Multiple Languages and Modalities | Jason Armitage, Endri Kacupaj, Golsa Tahmasebzadeh, Swati, Maria Maleshkova, Ralph Ewerth and Jens Lehmann |
MAEC: A Multimodal Aligned Earnings Conference Call Dataset for Financial Risk Prediction | Linyi Yang, Jiazheng Li, Barry Smyth and Ruihai Dong |
GeoLink Cruises: A Non-Synthetic Benchmark for Co-Reference Resolution on Knowledge Graphs | Reihaneh Amini, Lu Zhou and Pascal Hitzler |
A Large Test Collection for Entity Aspect Linking | Jordan Ramsdell and Laura Dietz |
PrivacyFL: A simulator for privacy-preserving and secure federated learning | Vaikkunth Mugunthan, Lalana Kagal and Anton Peraire-Bueno |
A Large-Scale Search Clarification Data Collection | Hamed Zamani, Gord Lueck, Everest Chen, Rodolfo Quispe, Flint Luu and Nick Craswell |
Feature Extraction for Large-Scale Text Collections | Luke Gallagher, Antonio Mallia, Shane Culpepper, Torsten Suel and Barla Cambazoglu |
ContentWise Impressions: An industrial dataset with impressions included | Fernando Benjamín Pérez Maurera, Maurizio Ferrari Dacrema, Lorenzo Saule, Mario Scriminaci and Paolo Cremonesi |
ReCOVery: A Multimodal Repository for COVID-19 News Credibility Research | Xinyi Zhou, Apurva Mulay, Emilio Ferrara and Reza Zafarani |
ReQue: A Configurable Workflow and Dataset Collection for Query Refinement | Mahtab Tamannaee, Hossein Fani, Fattane Zarrinkalam, Jamil Samouh, Samad Paydar and Ebrahim Bagheri |
SDM-RDFizer: An RML Interpreter for the Efficient Creation of RDF Knowledge Graphs | Samaneh Jozashoori, Enrique Iglesias, David Chaves-Fraga, Diego Collarana and Maria-Esther Vidal |
BioKG: A Knowledge Graph for Relational Learning On Biological Data | Brian Walsh, Sameh Mohamed and Vit Novacek |
Falcon 2.0: An Entity and Relation Linking Tool over Wikidata | Ahmad Sakor, Kuldeep Singh, Anery Patel and Maria-Esther Vidal |
LensKit for Python: Next-Generation Software for Recommender Systems Experiments | Michael Ekstrand |
ORCAS: 20 Million Clicked Query-Document Pairs for Analyzing Search | Nick Craswell, Daniel Campos, Bhaskar Mitra, Emine Yilmaz and Bodo Billerbeck |
Flexible IR Pipelines with Capreolus | Andrew Yates, Kevin Martin Jose, Xinyu Zhang and Jimmy Lin |
Profiling Entity Matching Benchmark Tasks | Anna Primpeli and Christian Bizer |