{"id":8082,"date":"2021-02-16T22:37:26","date_gmt":"2021-02-16T21:37:26","guid":{"rendered":"https:\/\/kinit.sk\/publication\/cross-lingual-learning-for-text-processing-a-survey\/"},"modified":"2026-04-23T14:31:29","modified_gmt":"2026-04-23T12:31:29","slug":"cross-lingual-learning-for-text-processing-a-survey","status":"publish","type":"publication","link":"https:\/\/kinit.sk\/sk\/publikacia\/cross-lingual-learning-for-text-processing-a-survey\/","title":{"rendered":"Cross-lingual Learning for Text Processing: A&nbsp;Survey"},"content":{"rendered":"<div id=\"\" class=\"element core-paragraph\">\n<p><strong>Pikuliak, M., Simko, M., Bielikova, M.<\/strong><\/p>\n<\/div>\n\n<div id=\"\" class=\"element core-paragraph\">\n<p id=\"sp005\">Abstract: Many intelligent systems in business, government or academy process natural language as an input during inference or they might even communicate with users in natural language. The natural language processing is currently often done with machine learning models. However, machine learning needs training data and such data are often scarce for low-resource languages. The lack of data and resulting poor performance of natural language processing can be solved with cross-lingual learning. Cross-lingual learning is a paradigm for transferring knowledge from one natural language to another. The transfer of knowledge can help us overcome the lack of data in the target languages and create intelligent systems and machine learning models for languages, where it was not possible previously.<\/p>\n<\/div>\n\n<div id=\"\" class=\"element core-paragraph\">\n<p id=\"sp010\">Despite its increasing popularity and potential, no comprehensive survey on cross-lingual learning was conducted so far. We survey 173 text processing cross-lingual learning papers and examine tasks, datasets and languages that were used. The most important contribution of our work is that we identify and analyze four types of cross-lingual transfer based on \u201cwhat\u201d is being transferred. Such insight might help other NLP researchers and practitioners to understand how to use cross-lingual learning for wide range of problems. In addition, we identify what we consider to be the most important research directions that might help the community to focus their future work in cross-lingual learning. We present a comprehensive table of all the surveyed papers with various data related to the cross-lingual learning techniques they use. The table can be used to find relevant papers and compare the approaches to cross-lingual learning. To the best of our knowledge, no survey of cross-lingual text processing techniques was done in this scope before.<\/p>\n<\/div>\n\n<div id=\"\" class=\"element core-paragraph\">\n<p>Attachment: <a href=\"https:\/\/kinit.sk\/public\/cll.html\" target=\"_blank\" rel=\"noreferrer noopener\">Interactive Table <\/a><\/p>\n<\/div>\n\n<div id=\"\" class=\"element core-paragraph\">\n<p>Cite: Pikuliak, M., Simko, M., Bielikova, M. Cross-lingual learning for text processing: A survey. Expert Systems with Applications 165 (2021). DOI: 1<a href=\"https:\/\/doi.org\/10.1016\/j.eswa.2020.113765\" target=\"_blank\" rel=\"noreferrer noopener\">0.1016\/j.eswa.2020.113765<\/a><\/p>\n<\/div>","protected":false},"featured_media":0,"template":"","meta":{"_acf_changed":false,"footnotes":""},"categories":[76,82,236],"class_list":["post-8082","publication","type-publication","status-publish","hentry","category-natural-language-processing-sk","category-2021-sk","category-bielikovam-sk"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Cross-lingual Learning for Text Processing: A&nbsp;Survey - KInIT<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/kinit.sk\/sk\/publikacia\/cross-lingual-learning-for-text-processing-a-survey\/\" \/>\n<meta property=\"og:locale\" content=\"sk_SK\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Cross-lingual Learning for Text Processing: A&nbsp;Survey - KInIT\" \/>\n<meta property=\"og:description\" content=\"Pikuliak, M., Simko, M., Bielikova, M. Abstract: Many intelligent systems in business, government or academy process natural language as an input during inference or they might even communicate with users...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/kinit.sk\/sk\/publikacia\/cross-lingual-learning-for-text-processing-a-survey\/\" \/>\n<meta property=\"og:site_name\" content=\"KInIT\" \/>\n<meta property=\"article:modified_time\" content=\"2026-04-23T12:31:29+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/kinit.sk\/wp-content\/uploads\/2021\/03\/KINIT_Sharepic.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"628\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:site\" content=\"@kinit\" \/>\n<meta name=\"twitter:label1\" content=\"Predpokladan\u00fd \u010das \u010d\u00edtania\" \/>\n\t<meta name=\"twitter:data1\" content=\"2 min\u00faty\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/kinit.sk\\\/sk\\\/publikacia\\\/cross-lingual-learning-for-text-processing-a-survey\\\/\",\"url\":\"https:\\\/\\\/kinit.sk\\\/sk\\\/publikacia\\\/cross-lingual-learning-for-text-processing-a-survey\\\/\",\"name\":\"Cross-lingual Learning for Text Processing: A&nbsp;Survey - KInIT\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/kinit.sk\\\/#website\"},\"datePublished\":\"2021-02-16T21:37:26+00:00\",\"dateModified\":\"2026-04-23T12:31:29+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/kinit.sk\\\/sk\\\/publikacia\\\/cross-lingual-learning-for-text-processing-a-survey\\\/#breadcrumb\"},\"inLanguage\":\"sk-SK\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/kinit.sk\\\/sk\\\/publikacia\\\/cross-lingual-learning-for-text-processing-a-survey\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/kinit.sk\\\/sk\\\/publikacia\\\/cross-lingual-learning-for-text-processing-a-survey\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/kinit.sk\\\/sk\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Natural Language Processing\",\"item\":\"https:\\\/\\\/kinit.sk\\\/sk\\\/category\\\/natural-language-processing-sk\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Cross-lingual Learning for Text Processing: A&nbsp;Survey\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/kinit.sk\\\/#website\",\"url\":\"https:\\\/\\\/kinit.sk\\\/\",\"name\":\"KInIT\",\"description\":\"Vyu\u017e\u00edvame v\u00fdskum pre \u013eud\u00ed a priemysel\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/kinit.sk\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"sk-SK\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Cross-lingual Learning for Text Processing: A&nbsp;Survey - KInIT","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/kinit.sk\/sk\/publikacia\/cross-lingual-learning-for-text-processing-a-survey\/","og_locale":"sk_SK","og_type":"article","og_title":"Cross-lingual Learning for Text Processing: A&nbsp;Survey - KInIT","og_description":"Pikuliak, M., Simko, M., Bielikova, M. Abstract: Many intelligent systems in business, government or academy process natural language as an input during inference or they might even communicate with users...","og_url":"https:\/\/kinit.sk\/sk\/publikacia\/cross-lingual-learning-for-text-processing-a-survey\/","og_site_name":"KInIT","article_modified_time":"2026-04-23T12:31:29+00:00","og_image":[{"width":1200,"height":628,"url":"https:\/\/kinit.sk\/wp-content\/uploads\/2021\/03\/KINIT_Sharepic.png","type":"image\/png"}],"twitter_card":"summary_large_image","twitter_site":"@kinit","twitter_misc":{"Predpokladan\u00fd \u010das \u010d\u00edtania":"2 min\u00faty"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/kinit.sk\/sk\/publikacia\/cross-lingual-learning-for-text-processing-a-survey\/","url":"https:\/\/kinit.sk\/sk\/publikacia\/cross-lingual-learning-for-text-processing-a-survey\/","name":"Cross-lingual Learning for Text Processing: A&nbsp;Survey - KInIT","isPartOf":{"@id":"https:\/\/kinit.sk\/#website"},"datePublished":"2021-02-16T21:37:26+00:00","dateModified":"2026-04-23T12:31:29+00:00","breadcrumb":{"@id":"https:\/\/kinit.sk\/sk\/publikacia\/cross-lingual-learning-for-text-processing-a-survey\/#breadcrumb"},"inLanguage":"sk-SK","potentialAction":[{"@type":"ReadAction","target":["https:\/\/kinit.sk\/sk\/publikacia\/cross-lingual-learning-for-text-processing-a-survey\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/kinit.sk\/sk\/publikacia\/cross-lingual-learning-for-text-processing-a-survey\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/kinit.sk\/sk\/"},{"@type":"ListItem","position":2,"name":"Natural Language Processing","item":"https:\/\/kinit.sk\/sk\/category\/natural-language-processing-sk\/"},{"@type":"ListItem","position":3,"name":"Cross-lingual Learning for Text Processing: A&nbsp;Survey"}]},{"@type":"WebSite","@id":"https:\/\/kinit.sk\/#website","url":"https:\/\/kinit.sk\/","name":"KInIT","description":"Vyu\u017e\u00edvame v\u00fdskum pre \u013eud\u00ed a priemysel","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/kinit.sk\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"sk-SK"}]}},"_links":{"self":[{"href":"https:\/\/kinit.sk\/sk\/wp-json\/wp\/v2\/publication\/8082","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/kinit.sk\/sk\/wp-json\/wp\/v2\/publication"}],"about":[{"href":"https:\/\/kinit.sk\/sk\/wp-json\/wp\/v2\/types\/publication"}],"version-history":[{"count":5,"href":"https:\/\/kinit.sk\/sk\/wp-json\/wp\/v2\/publication\/8082\/revisions"}],"predecessor-version":[{"id":41948,"href":"https:\/\/kinit.sk\/sk\/wp-json\/wp\/v2\/publication\/8082\/revisions\/41948"}],"wp:attachment":[{"href":"https:\/\/kinit.sk\/sk\/wp-json\/wp\/v2\/media?parent=8082"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/kinit.sk\/sk\/wp-json\/wp\/v2\/categories?post=8082"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}