KInIT KInIT Menu
  • About us
  • Research
  • Education
  • Careers
  • News
  • Partners
  • Contact
  • Get Involved
ensk
Search partner-logo

Home > Research > Publications

Publications

MultiSocial: Multilingual Benchmark of Machine-Generated Text Detection of Social-Media Texts

Macko, D., Kopál, J., Moro, R., Srba, I. – Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Download

Revisiting Algorithmic Audits of TikTok: Poor Reproducibility and Short-term Validity of Findings

Mosnar, M., Skurla, A., Pecher, B., Tibensky, M., Jakubčík, J., Bindas, A., Sakalík, P., Srba, I. – SIGIR ’25: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Evaluation of LLM Vulnerabilities to Being Misused for Personalized Disinformation Generation

Zugecova, A., Macko, D., Srba, I., Moro, R., Kopál, J., Marcinčinová, K., Mesarčík, M. – Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Download

LLMs vs Established Text Augmentation Techniques for Classification: When do the Benefits Outweight the Costs?

Čegiň, J., Šimko, J., Brusilovsky, P. – Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2025
Download

RecGaze: The First Eye Tracking and User Interaction Dataset for Carousel Interfaces

Leon‑Martinez, S., Kang, J., Moro, R., de Rijke, M., Kveton, B., Oosterhuis, H., Bielikova, M. – SIGIR ’25: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Model-based Algorithmic Auditing of Social Media AI Algorithms

Srba, I., Pecher, B., Simko, J., Moro, R. & Bielikova, M. – Proceedings of Fourth European Workshop on Algorithmic Fairness, PMLR, 2025
Download

Multilingual vs Crosslingual Retrieval of Fact-checked Claims: A Tale of Two Approaches

M. Ramponi, A., Rovera, M., Moro, R., Tonelli, S. – In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Long Papers., 2025
Download

A Rigorous Evaluation of LLM Data Generation Strategies for Low-Resource Languages

Anikina, T., Cegin, J., Simko, J., Ostermann, S. – In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Long Papers., 2025
Download

Comparing Specialised Small and General Large Language Models on Text Classification: 100 Labelled Samples to Achieve Break-Even Performance

Pecher, B., Srba, I., Bielikova, M. – In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Long Papers., 2025
Download

Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation for Classification

Cegin, J., Pecher, B., Simko, J., Srba, I., Bielikova, M., Brusilovsky, P. – In Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
Download

Learning action embeddings for off-policy evaluation

Cief, M., Golebiowski, J., Schmidt, P., Abedjan, Z., Bekasov, A. – 46th European Conference on Information Retrieval, 2024
Download

Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation

Cegin, J., Pecher, B., Simko, J., Srba, I., Bielikova, M., Brusilovsky, P. – Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) – ACL 2024, 2024
Download

A Survey on Stability of Learning with Limited Labelled Data and its Sensitivity to the Effects of Randomness

Pecher, B., Srba, I., Bielikova, M. – ACM Computing Surveys (ACM CSUR), 2024
Download

A Ship of Theseus: Curious Cases of Paraphrasing in LLM-Generated Texts

Tripto, N. I., Venkatraman, S., Macko, D., Moro, R., Srba, I., Uchendu, A., Le, T., Lee, D. – Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) – ACL 2024, 2024
Download

Disinformation Capabilities of Large Language Models

Vykopal, I., Pikuliak, M., Srba, I., Moro, R., Macko, D., Bielikova, M. – Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) – ACL 2024, 2024
Download

IMGTB: A Framework for Machine-Generated Text Detection Benchmarking

Spiegel, M., Macko, D. – Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2024
Download

KInIT at SemEval-2024 Task 8: Fine-tuned LLMs for Multilingual Machine-Generated Text Detection

Spiegel, M., Macko, D. – Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), 2024
Download

EAMT 2024: Multilinguality in the VIGILANT project

Spillane, B.,, Scarton, C., Moro, R., Ivanov, P., Tagarev, A., Simko, J., Abu Farha, I., Munnelly, G., Uhlárik, F., Heppell, F. – The 25th Annual Conference of The European Association for Machine Translation, 2024
Download

Understanding User Behavior in Carousel Recommendation Systems for Click Modeling and Learning to Rank

Leon-Martinez, S. – WSDM ’24: Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices.

Pecher, B., Srba, I., Bielikova, M. – Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation

Pecher, B., Cegin, J., Belanec, R., Simko, J., Srba, I., Bielikova, M. – Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

AI Research is not Magic, it has to be Reproducible and Responsible: Challenges in the AI field from the Perspective of its PhD Students

Hrckova, A., Renoux, J., Calasanz, R. T., Chuda, D., Tamajka, M., Simko, J. – arXiv preprint, 2024

Automated, not Automatic: Needs and practices in European fact-checking organizations as a Basis for Designing Human-Centered AI Systems

Hrckova, A., Moro, R., Srba, I., Simko, J., Bielikova, M. – arXiv preprint, 2024

Auditing YouTube’s Recommendation Algorithm for Misinformation Filter Bubbles

Srba, I., Moro, R., Tomlein, M., Pecher, B., Simko, J., Stefancova, E., Kompan, M., Hrckova, A., Podrouzek, J., Gavornik, A., Bielikova, M. – ACM Transactions on Recommender Systems (ACM TORS), 2023
Download

KInITVeraAI at SemEval-2023 Task 3: Simple yet Powerful Multilingual Fine-Tuning for Persuasion Techniques Detection

Hromadka, T., Smolen, T., Remis, T., Pecher, B., Srba, I. – Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 629–637, Toronto, Canada. Association for Computational Linguistics., 2023

Stance on the regulation of Generative Artificial Intelligence

Mesarcik, M., Slosiarova, N., Podrouzek, J., Bielikova, M. – KInIT report, 2023
Download

Report on the Current State of Societal Biases in Slovak AI

Pikuliak, M., Oreško, Š., Burda, K., Gavorník, A., Mesarčík, M., Hrčková, A., Kottulová, J., Szapuová, M., Podroužek, J., Šimko, M. – , 2023
Download

ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model Robustness

Cegin, J., Simko, J., Brusilovsky, P., – Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Download

MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark

Macko, D., Moro, R., Uchendu, A,, Lucas, J., Yamashita, M., Pikuliak, M., Srba, I., Le, T., Lee, D., Simko, J., Bielikova, M. – Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing – EMNLP 2023, 2023
Download

Multilingual Previously Fact-Checked Claim Retrieval

Pikuliak, M., Srba, I., Moro, R., Hromadka, T., Smoleň, T., Melišek, M., Vykopal, I., Simko, J., Podroužek, J., Bielikova, M. – Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing – EMNLP 2023, 2023
Download

Why partner with KInIT

  • Help Slovakia to concentrate talents
  • Discover solutions for your problems using AI
  • Get a new perspective on your R&D&I
  • Collaborate in excellent research
  • Improve knowledge of your employees on selected topics of AI

Get involved

KInIT
  • About us
  • Research
  • Education
  • Careers
  • News
  • Partners
  • Contact
  • Get Involved

Get the latest news from KInIT

    By hitting “Subscribe”, you agree that we process your personal data provided through
    this form on the basis of the legal basis of GPDR Art. 6 (1) letter (a) consent to
    the processing of personal data.You can withdraw your consent at any time.
    For more information please read our Privacy Policy.

    • Facebook
    • Instagram
    • LinkedIN
    • Twitter
    © 2020-2024 Kempelen Institute of Intelligent Technologies. Made by Seesame
    Manage Consent
    To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    • Manage options
    • Manage services
    • Manage {vendor_count} vendors
    • Read more about these purposes
    View preferences
    • {title}
    • {title}
    • {title}