Web & User Data Processing

Publication

Authors Macko, D., Pulver, A.

PerQ: Efficient Evaluation of Multilingual Text Personalization Quality

Since no metrics are available to evaluate specific aspects of a text, such as its personalization quality, the researchers often rely solely on large language models to meta-evaluate such texts. Due to internal biases of individual language models, it is recommended to use multiple of them for combined evaluation, which directly increases costs of such metaevaluation. In this paper, a computationally efficient method for evaluation of personalization quality of a given text (generated by a language model) is introduced, called PerQ. A case study of comparison of generation capabilities of large and small language models shows the usability of the proposed metric in research, effectively reducing the waste of resources.

Cite: Macko, D., & Pulver, A. (2025). PerQ: Efficient Evaluation of Multilingual Text Personalization Quality. arXiv preprint arXiv:2509.25903.

Web & User Data Processing

PerQ: Efficient Evaluation of Multilingual Text Personalization Quality

Authors

Dominik Macko

Andrew Pulver

Why partner with KInIT