PrefLib

PrefLib is a reference library of preference data maintained by Nicholas Mattei and Simon Rey. The original version of this website was developed by Nicholas Mattei and Toby Walsh.

We aim at providing a comprehensive resource for the multiple research communities that deal with preferences: computational social choice, recommender systems, data mining, machine learning, combinatorial optimization, to name just a few.

The strength of PrefLib is to provide carefully curated data, formatted in a unified format. We encourage the users to read the detailed explanations that we provide regarding the format and the modelisation choices. Once everything is clear, feel free to explore the datasets we are hosting, or to search for specific files that may interest you.

PrefLib-Tools

Providing data is only the first step, the next one being actually using the data. To help you with that, we provide a Python library specifically designed to work with PrefLib instances: the PrefLib-Tools. This library is distributed in PyPi.

Have a look at the preflibtools repository, where you can download the code; and check out the documentation!

Data Usage and Citation Policy

Constructing and maintaining this website and its database requires a lot of work. We ask that you provide a reference to our website when publishing research based on data gathered here. Here are some references you can use.

Nicholas Mattei and Toby Walsh. PrefLib: A Library of Preference Data. Proceedings of Third International Conference on Algorithmic Decision Theory (ADT 2013) — PDF — Bibtex.
Nicholas Mattei and Toby Walsh. A Preflib.org Retrospective: Lessons Learned and New Directions. Trends in Computational Social Choice — PDF — Bibtex.

In addition, many dataset have specific citation requirements. Make sure to always include them whenever you used a file taken from such a dataset (especially if you downloaded aggregated data files).

Contributing to PrefLib

We rely on the support of the community in order to increase the usefulness and coverage of this site. If you want to donate a new dataset, report an issue with an existing dataset, or suggest changes to the website, several GitHub repositories are at your disposal.

PrefLib-Contrib: used as a discussion platform around PrefLib.
PrefLib-Data: hosts the (raw) data and the related scripts.
PrefLib-Django hosts the Django project for the website.
PrefLib-Tools hosts the code for the PrefLib-Tools.

If you need anything, have a look at those repositories, open new issues, comments, like, subscribe and share the word!

In Brief

We currently host:

8 types of data
62 datasets
17775 data files
More than 4.4 GB of data

The community

We want to thank all who participated in the development of PrefLib. This list may not be exhaustive, but we hope it is. Contact us if you feel unjustly treated.

Haris Aziz and Omer Lev: who provided great help at the launch of the website, discussing the design of the first version of the website, and troubleshooting it.
Niclas Boehmer: who contributed a large amount of data, the datasets summarized on this page (together with Nathan Schaar), the Polkadot Network and the Kusama Network datasets.
Dominik Peters: who provided useful feedback, created the amazing new logo, committed some code, and contributed the Breakfast dataset.
Jeffrey O'Neill: maintainer of the website OpenSTV.org. (now OpaVote) who contributed a lot of datasets: irish, Burlington, Glasgow, Aspen, Berkley, Minneapolis, Oakland, Pierce, San Francisco, San Leandro, Takoma Park.
Robert Bredereck: who contributed the F1 and Skiing, Web Search, and the Web Search Clean datasets.
Jean-Francois Laslier: who contributed the datasets French Presidental Election, Proto French Election, and its rating counterpart; together with Karine Van der Straeten, and Michel Balinski.
Andrew Mao: who contributed the Dots, and Puzzle datasets.
Nicolaus Tideman and Florenz Plassmann: who contributed the ERS dataset.
Piotr Faliszewski: who contributed the AGH dataset.
Carleton Coffrin: who contributed the T-Shirt dataset.
Lihi Dery: who contributed the Social Recommendation dataset.
Toshihiro Kamishima: who contributed the Sushi dataset.
Michal Regenwetter and Anna Popova: who contributed the APA dataset.
Jeremy A. Hansen: who contributed the Vermont District Races dataset.
Alejandro Rosete Suarez and Milton Garcia Borroto: who contributed the Cujae dataset.
Ulle Endriss: who contributed the Poster Competition dataset and part of the AAMAS dataset.
Ioannis Caragiannis: who contributed the Cities Survey dataset.
John P. Dickerson: who contributed the Kidney dataset.
Rafael Bordini, Edith Elkind, John Thangarajah, and David Shield: who contributed part of the AAMAS dataset.
David Manlove: who contributed the Project Bidding dataset.
Hongning Wang: who contributed the Trip Advisor dataset.
Martin Lackner: who contributed the Parliamentary Elections dataset.
Andrzej Kaczmarczyk: who contributed the Camp Songs dataset.

These Papers are Using PrefLib

Below is a list of papers that have made use of or directly referenced data stored here at PrefLib. The papers have been automatically added from Google Scholar, if there is a problem with a paper or if you want to add one paper, please contact us.

« 1 2 3 4 5 6 7 8 9 »

Analyzing the practical relevance of the Condorcet loser paradox and the agenda contraction paradox. F Brandt, C Geist, M Strobel, Evaluating Voting Systems with Probability Models:Essays by and in Honor of William Gehrlein and Dominique Lepelley (2020).
Branch-and-cut-and-price for the cardinality-constrained multi-cycle problem in kidney exchange. E Lam, V Mak-Hau, Computers & Operations Research (2020).
Modelling rankings in R: The PlackettLuce package. HL Turner, J van Etten, D Firth, I Kosmidis, Computational Statistics (2020).
PLMIX: an R package for modelling and clustering partially ranked data. C Mollica, L Tardella, Journal of Statistical Computation and Simulation (2020).
Preference elicitation and robust winner determination for single-and multi-winner social choice. T Lu, C Boutilier, Artificial Intelligence (2020).
Stable matching with uncertain linear preferences. H Aziz, P Biró, S Gaspers, R De Haan, N Mattei et al., Algorithmica (2020).
Strategic voting in the lab: compromise and leader bias behavior. R Meir, K Gal, M Tal, Autonomous Agents and Multi-Agent Systems (2020).
Algorithms for destructive shift bribery. A Kaczmarczyk, P Faliszewski, Autonomous Agents and Multi-Agent Systems (2019).
An experimental view on committees providing justified representation. R Bredereck, P Faliszewski, A Kaczmarczyk et al., Proceedings of the International Joint Conference on Artificial Intelligence (2019).
Asymptotic properties of Lee distance. NI Nikolov, E Stoimenova, Metrika (2019).
Between Discord and Deadlock: Consensus Under a Deadline. M Bannikova, L Dery, S Obraztsova et al., arXiv preprint (2019).
Bottom-up approaches to achieve Pareto optimal agreements in group decision making. V Sanchez-Anguix, R Aydoğan, T Baarslag et al., Knowledge and Information Systems (2019).
Building ethically bounded AI. F Rossi, N Mattei, Proceedings of the AAAI Conference on Artificial Intelligence (2019).
Complexity of manipulating and controlling approval-based multiwinner voting. Y Yang, Proceedings of the International Joint Conference on Artificial Intelligence (2019).
Exploring the no-show paradox for Condorcet extensions using Ehrhart theory and computer simulations. F Brandt, J Hofbauer, M Strobel, Proceedings of the International Conference on Autonomous Agents and Multiagent Systems (2019).

Even more references, papers, and tutorials can be found in the proceedings of the EXPLORE Workshops:

Some Tools to Work with Preferences

Empirical experiments with real data are becoming a more fundamental part of work in computational social choice. In addition to a lightweight set of tools for working with data from PrefLib we also host documentation for several of these project. Please contact Nick if you have code that you would like to share with the community.

Iterative Voting Simulator

This is a voting simulator built for the paper A Local-Dominance Theory of Voting Equilibria. We are releasing its source code to be expanded and enhanced by the community. However, it is quite versatile in its current construction, and can be used for various simulations "as is".

Learn more.

Kidney Dataset Generator

Kidney failure is a life-threatening health issue that affects hundreds of thousands of people worldwide. In the US alone, the waitlist for a kidney transplant has over 100,000 patients. This list is growing: demand far outstrips supply.This codebase includes: structural elements of kidney exchange like "pools", "hospitals", and "pairs", a couple of kidney exchange graph generators, a couple of kidney exchange solvers (max weight, failure-aware, fairness-aware, individually rational), and a dynamic kidney exchange simulator.

Learn more.

CRISNER: A Qualitative Preference Reasoner for CP-nets, TCP-nets, CP-Theories

CRISNER stands for Conditional and Relative Importance Statement Network PrEference Reasoner. It can reason about ceteris paribus preference languages such as CP-nets, TCP-nets and CP-theories. Given a preference specification (a set of preference statements) in one of these languages, CRISNER succinctly encodes its induced preference graph (IPG) into a Kripke structure model in the language of the NuSMV model checker. This Kripke model is reachability-equivalent to the induced preference graph. CRISNER generates the model only once, and then translates each query posed against this preference specification into a temporal logic formula in computation-tree temporal logic (CTL) such that the formula is verified in the Kripke model if and only if the query holds true according to the ceteris paribus semantics of the preference language. The model checker either affirms the query or returns with a counterexample. For answering queries related to equivalence and subsumption checking of two sets of preferences, CRISNER constructs a combined IPG and uses temporal queries in CTL to identify whether every dominance that holds in one also holds in the other, and vice-versa.

Learn more.

PrefLib

PrefLib-Tools

Data Usage and Citation Policy

Contributing to PrefLib

In Brief

Other Links

The community

These Papers are Using PrefLib

Some Tools to Work with Preferences

Iterative Voting Simulator

Kidney Dataset Generator

CRISNER: A Qualitative Preference Reasoner for CP-nets, TCP-nets, CP-Theories