Use this url to cite publication: https://cris.mruni.eu/cris/handle/007/17832
Corpora for Bilingual Terminology Extraction in Cybersecurity Domain
Type of publication
Straipsnis recenzuojamoje užsienio konferencijos medžiagoje / Article in peer-reviewed foreign conference proceedings (P1g)
Author(s)
Author(s) | |
---|---|
Utka, Andrius | Vytauto Didžiojo universitetas |
Rokas, Aivaras | Vytauto Didžiojo universitetas |
Bielinskienė, Agnė | Vytauto Didžiojo universitetas |
Title [en]
Corpora for Bilingual Terminology Extraction in Cybersecurity Domain
Date Issued
Date Issued |
---|
2021 |
Extent
p. 11-15
Is part of
Proceedings of CLARIN Annual Conference 2021, 27 – 29 September 2021 Virtual Edition / edited by Monica Monachini, Maria Eskevich. [Utrecht] : [Utrecht University], 2021.
Field of Science
Abstract (en)
The paper aims at presenting English-Lithuanian corpora for bilingual term extraction (BiTE) in the cybersecurity domain within the framework of the project DVITAS. It is argued that a system of parallel, comparable, and training corpora for BiTE is particularly useful for less resourced languages, as it allows to efficiently use strengths and avoid weaknesses of comparable and parallel resources. A special focus is given to the open nature of the data, which is achieved by publishing the data in CLARIN-LT repository.
Type of document
type::text::conference output::conference proceedings::conference paper
ISSN (of the container)
2773-2177
eLABa
106450440
Coverage Spatial
Nyderlandai / Netherlands (NL)
Language
Anglų / English (en)
Bibliographic Details
17
Affiliation(s)
Project(s)
The research is carried out under the project “Bilingual Automatic Terminology Extraction” funded by the Research Council of Lithuania |
The project is also included as a use case in COST action “European Network for Web-Centred Linguistic Data Science” |
Creative Commons License
Access Rights
Dalinė atviroji prieiga / Mixed Open Access