Cultural Heritage Datasets, Artificial Intelligence and the Ethics of Non-Intervention

It is a well-known fact that machine-learning algorithms exacerbate biases inherent in the datasets on which they were trained. In recent times, this fact has found ample evidence, e.g. in Cathy O’Neill’s book “Weapons of Math Destruction” (2016), in Kate Crawford’s and Trevor Paglen’s “Excavating AI” (2019), or in articles such as “Data and its […]

Perspectives for machine-assisted subject indexing at the Berlin State Library

Implications for the library With the constant increase in publication numbers, the question arises how the growing and extensive collections will be indexed in the future. How can an acceptable level of quality be ensured, if the available human resources do not increase at the same rate? To answer this challenge, one option is to […]

On the Use of Licences in Times of Large Language Models

It could all be so simple: cultural heritage institutions and other public sector bodies provide high-quality data on a large scale and, wherever possible, under a permissive licence such as CC0 or Public Domain Mark 1.0. This is in line with the idea that cultural heritage institutions are funded by taxes, therefore everyone should also […]