In this short film, Barry Schouten discusses connection between surveys and AI-ML models with Stephanie Eckman, Christoph Kern and Bolei Ma. The following discussion points are covered in this short movie:
– What is Social Data Science
– Differences and similarities between Social Science and Social Data Science
– Similarity between annotators labeling cases for Data Science datasets and survey respondents or interviewers
– Sources for biases of data annotators
– How to account for biases while working with models in Data Science
– Annotation sensitivity
– Conceptual shifts in Data Science
– Robustness of models in Data Science
– Future of representation error research in annotation sensitivity research
– Potential of using ML models in cross-cultural research
– Two cases of using ML models in surveys (Supermarket receipts, Hate-speech)
– Potential of using AI for data annotation
References:
Kern, C., Eckman, S., Beck, J., Chew, R., Ma, B., & Kreuter, F. (2023). Annotation sensitivity: Training data collection methods affect model performance. arXiv preprint arXiv:2311.14212.
Beck, J., Eckman, S., Ma, B., Chew, R., & Kreuter, F. (2024, March). Order effects in annotation tasks: Further evidence of annotation sensitivity. In Proceedings of the 1st Workshop on Uncertainty-Aware NLP (UncertaiNLP 2024) (pp. 81-86).
Eckman, S., Ma, B., Kern, C., Chew, R., Plank, B., & Kreuter, F. (2025). Correcting Annotator Bias in Training Data: Population-Aligned Instance Replication (PAIR). arXiv preprint arXiv:2501.06826.
Affiliations:
Dr. Stephanie Eckman – Data Scientist at Social Data Science Center, University of Maryland
Prof. Dr. Christoph Kern – Junior Professor of Social Data Science and Statistical Learning, Chair of Statistics and Data Science in Social Sciences and the Humanities (SODA) at the Ludwig-Maximilians-University of Munich; Project Director at the Mannheim Centre for European Social Research (MZES)
Bolei Ma – Researcher and PhD student at Social Data Science and AI Lab at the Ludwig-Maximilians-University of Munich
Prof. Dr. Barry Schouten – Senior Methodologist at Centraal Bureau voor de Statistiek (CBS), the Netherlands.