Use this url to cite publication: https://cris.mruni.eu/cris/handle/007/18579
Building of Parallel and Comparable Cybersecurity Corpora for Bilingual Terminology Extraction
Type of publication
Straipsnis recenzuojamoje užsienio konferencijos medžiagoje / Article in peer-reviewed foreign conference proceedings (P1g)
Author(s)
Author | Affiliation |
---|---|
Utka, Andrius | |
Title [en]
Building of Parallel and Comparable Cybersecurity Corpora for Bilingual Terminology Extraction
Date Issued
Date |
---|
2022 |
Extent
p. 126-138
Is part of
Selected Papers from the CLARIN Annual Conference 2021 Virtual Event, 2021, 27–29 September / edited by Monica Monachini and Maria Eskevich. [Utrecht] : [Linköping University Electronic Press], 2022. ISBN 9789179294441.
Series/Report no.
Linköping Electronic Conference Proceedings, ISSN 1650-3686; vol. 189
Field of Science
Abstract (en)
The paper aims at presenting English-Lithuanian corpora for bilingual term extraction (BiTE) in the cybersecurity domain within the framework of the project DVITAS. It is argued that a system of parallel, comparable, and training corpora for BiTE is particularly useful for less-resourced languages, as it allows efficiently to combine strengths and avoid weaknesses of comparable and parallel resources. A special focus is given to the availability of sources in the cybersecurity domain and issues related to copyright-protected publications, as well as the data curation performed for building the corpora and depositing them to CLARIN-LT repository.
Type of document
type::text::conference output::conference proceedings::conference paper
ISBN (of the container)
9789179294441
eLABa
138186370
Coverage Spatial
Nyderlandai / Netherlands (NL)
Language
Anglų / English (en)
Bibliographic Details
32
Affiliation(s)
Project(s)
“Bilingual Automatic Terminology Extraction” |
Funding(s)
The Research Council of Lithuania |
Creative Commons License
Access Rights
Atviroji prieiga / Open Access
File(s)