Latent space models for opinion mapping and vote prediction
Which latent space models from machine learning or item-response theory can most accurately predict user responses in political questionnaires based on a few initial answers (Bachmann et al., 2024)?
How do the sparsity of the data and the dimensionality of the latent space affect reconstruction and imputation error?
How do voters’ and candidates’ answers in political surveys differ, and how does this difference impact the distribution of their positions in the latent space?
More information will follow soon.
2024
ECML/PKDD
Fast and Adaptive Questionnaires for Voting Advice Applications
Fynn Bachmann, Cristina Sarasua, and Abraham Bernstein
In Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track
The effectiveness of Voting Advice Applications is often compromised by the length of their questionnaires. To address user fatigue and incomplete responses, some applications (such as the Swiss Smartvote) offer a condensed version of their questionnaire. However, these condensed versions cannot ensure the accuracy of recommended parties or candidates, which we show to remain below 40%. To address these limitations, this work introduces an adaptive questionnaire approach that selects subsequent questions based on users’ previous answers, aiming to enhance recommendation accuracy while reducing the number of questions posed to the voters. Our method uses an encoder and decoder module to predict missing values at any completion stage, leveraging a two-dimensional latent space that is reflective of the traditional methods used in political science for visualizing ideology. Additionally, a selector module is proposed to determine the most informative subsequent question based on the voter’s current position in the latent space and the remaining unanswered questions. We validated our approach using the Smartvote dataset from the Swiss Federal elections in 2019, testing various spatial models and selection methods to optimize the system’s predictive accuracy. Our findings indicate that employing the IDEAL model both as encoder and decoder, combined with a PosteriorRMSE method for question selection, significantly improves the accuracy of recommendations, achieving 74% accuracy after asking the same number of questions as in the condensed version.