Proposing a Semantic Tagging Model on Bilingual English Vietnamese Corpus

Chủ nhật - 20/08/2023 04:12
In this article, we report on the results of the study on building a semantic tagging system using the English-Vietnamese bilingual corpus to create a lexical resource with lexical notes based on translation similarities, transfer of vocabulary, and classification schemes through bilingual connections. This tool plays an important role in building and developing natural languages processing systems such as automatic translation, text summary, text extraction, information retrieval, and question-answer automatic. In this context, our goal to see English as the source language is roughly translated into Vietnamese, we use an English-Vietnamese bilingual concept dictionary for the purpose of annotating the semantics of words. In our experiments, we used hand-annotated vocabulary sets, compared with the proposed model results, and the results were achieved based on an average vocabulary coverage of 83.05%, with accuracy reaching up to 82.40%. This test can be applied to build lexical la-bels for other languages than bilingual corpus whose source language is English.
Link: https://link.springer.com/chapter/10.1007/978-981-99-4725-6_63
 Tags: Fira, BDU, CNTT, NCKH, Tạp trí

Tổng số điểm của bài viết là: 0 trong 0 đánh giá

Click để đánh giá bài viết

  Ý kiến bạn đọc

Icon-Zalo Zalo Icon-Messager Messenger Icon-Youtube Youtube Icon-Instagram Maps
Bạn đã không sử dụng Site, Bấm vào đây để duy trì trạng thái đăng nhập. Thời gian chờ: 60 giây