robust word sense translation

Word sense disambiguation  scaling up, domain adaptation and application to machine translation

Word sense disambiguation scaling up, domain adaptation and application to machine translation

... English Words 120 8.1.2 Word Sense Disambiguation for Machine Translation 120 vi Trang 9The process of identifying the correct meaning, or sense of a word in context, is known as word sense ... Disambiguation 192.2 Domain Adaptation for Word Sense Disambiguation 23 2.3 Word Sense Disambiguation for Machine Translation 24 3 Our Word Sense Disambiguation System 27 3.1 Knowledge ... English translations as word senses would be ideal, they insteadpredicted the HowNet (Dong, 2000) sense of a word and then used the English gloss of the HowNet sense as the WSD model’s predicted translation

Ngày tải lên: 11/09/2015, 16:08

153 208 0
Báo cáo khoa học: "Knowledge-rich Word Sense Disambiguation Rivaling Supervised Systems" pptx

Báo cáo khoa học: "Knowledge-rich Word Sense Disambiguation Rivaling Supervised Systems" pptx

... Wikipedia to WordNet During the first phase of our methodology we aim to establish links between Wikipages and Word-Net senses Formally, given the entire set of pages SensesWikiand WordNet sensesSensesWN, ... : SensesWiki→ Senses WN , such that, for each Wikipagew ∈ SensesWiki: µ(w) =    s ∈ SensesWN(w) if a link can be established, where SensesWN(w) is the set of senses of the lemma of w in WordNet ... Senses Wiki → Senses WN 1: for each w ∈ Senses Wiki 2: µ(w) :=  3: for each w ∈ Senses Wiki 4: if |Senses Wiki (w)| = |Senses WN (w)| = 1 then 5: µ(w) := w 1 n 6: for each w ∈ Senses Wiki 7:

Ngày tải lên: 17/03/2014, 00:20

10 268 0
Báo cáo khoa học: "Word Sense Induction for Novel Sense Detection" pot

Báo cáo khoa học: "Word Sense Induction for Novel Sense Detection" pot

... to automatically induce word senses of a target word, and demonstrate that our word sense induction method can be used to automatically de-tect words with emergent novel senses, as well as token ... occurrence of the target word As our baseline representation, we use a bag of words, where word frequency is kept but not word order All words are lemmatised, and stopwords and low frequency terms ... Dong-Hong Ji, and Chew-Lim Tan. 2007 I2R: Three systems for word sense discrimi-nation, chinese word sense disambiguation, and en-glish word sense disambiguation In Proceedings of the Fourth International

Ngày tải lên: 17/03/2014, 22:20

11 287 0
Báo cáo khoa học: "Word Sense Disambiguation Improves Information Retrieval" ppt

Báo cáo khoa học: "Word Sense Disambiguation Improves Information Retrieval" ppt

... This paper proposes the use of word senses to improve the performance of IR We propose an ap-proach to annotate the senses for short queries We incorporate word senses into the language modeling ... of word senses because of the overall poor perfor-mances of their baseline method and their system Instead of using fine-grained sense inventory, Kim et al (2004) tagged words with 25 root senses ... http://lemurproject.org/indri/ Trang 44 Word Sense DisambiguationIn this section, we first describe the construction of our WSD system Then, we propose the method of assigning senses to query terms 4.1 Word sense disambiguation

Ngày tải lên: 23/03/2014, 14:20

10 339 0
Báo cáo khoa học: "Latent Semantic Word Sense Induction and Disambiguation" pdf

Báo cáo khoa học: "Latent Semantic Word Sense Induction and Disambiguation" pdf

... overview of word sense induction research A thor-ough survey on word sense disambiguation (includ-ing unsupervised induction algorithms) is presented in Navigli (2009) Algorithms for word sense induction ... assigning a sense label to a particular instance of a word in context, algorithms up till now use pre-defined sense inven-tories (such as WordNet) that often contain fine-grained sense distinctions, ... sense induction can roughly algorithms extract the different senses of a word on a per-word basis, i.e the different senses for each word are determined separately They can be further subdivided

Ngày tải lên: 23/03/2014, 16:20

10 314 0
Domain adaptation and training data acquisition in wide coverage word sense disambiguation and its application to information retrieval

Domain adaptation and training data acquisition in wide coverage word sense disambiguation and its application to information retrieval

... ambiguousword can be determined by its context A word sense is a representation of onemeaning of a word The task of identifying the correct sense of an ambiguous word in context is known as word sense ... Statistical Machine Translation 24 2.5.2 Word Sense Disambiguation in Information Retrieval 26 2.5.3 Word Sense Disambiguation in Other NLP Tasks 28 Chapter 3 An Open Source Word Sense Disambiguation ... Knowledge Resources for Word Sense Disambiguation 3 1.3 SensEval Workshops 5 1.4 Difficulties in Supervised Word Sense Disambiguation 8 1.5 Applications of Word Sense Disambiguation

Ngày tải lên: 09/09/2015, 10:06

132 228 0
Building word sense taxonomy and automatic annotation for mandarin chinese 2

Building word sense taxonomy and automatic annotation for mandarin chinese 2

... of our word sense taxonomy We believe that the performance of word is decided by word meaning, and word sense is expressed in the form of the usage of the word We study the issue of word sense ... study word sense class and word sense taxonomy We created word sense taxonomy for Chinese noun, verb and adjective, in terms of natural language processing Then conduct automatic word sense class ... distinguishes our word sense taxonomy from other word sense taxonomy built by now Additionally, we propose the complementary relationship of internal and external features of word sense class in

Ngày tải lên: 09/09/2015, 18:57

273 412 0
Application of generic sense classes in word sense disambiguation

Application of generic sense classes in word sense disambiguation

... set of senses out of WORDNETfine-grained senses.This method is to eliminate some of the finer senses of a word by keeping only onesense per LF For instance, the four senses of building in WORDNETare ... Sense Enumerative Lexicon, or the ideathat words come with lists of senses, each list meant for a given individual word Incontrast, we propose generalizing senses across word boundaries, as sense ... 171.1 Word Sense DisambiguationBy definition, Word Sense Disambiguation (WSD) is the task of identifying the correctsense of a word in a given context This definition involves the concept of sense

Ngày tải lên: 11/09/2015, 14:22

206 270 0
EXPLOITING TAGGED AND UNTAGGED CORPORA FOR WORD SENSE DISAMBIGUATION

EXPLOITING TAGGED AND UNTAGGED CORPORA FOR WORD SENSE DISAMBIGUATION

... using WordNet2 as sense inventory When disambiguating the word “boot” inthe phrase “boot sector”, the sense tagger will assign this instance with one of the senses of word “boot” listed in WordNet ... networks for word sense disambiguation In the VLNNS, each wordwas linked to its senses, which were themselves linked to the words in their definitions, whichwere in turn linked to their senses, etc ... obtain sense tagged corpus Theyused the information from WordNet to formulate queries consisting of synonyms or defini-tions of word senses, and obtained additional training data for word senses

Ngày tải lên: 12/09/2015, 11:05

99 128 0
Automatic generation of labelled data for word sense disambiguation

Automatic generation of labelled data for word sense disambiguation

... the words average have 3 senses and with maximum of 5 sense, minimum of 2 senses 4.3 Sense Mapping from SENSEVAL to WordNet Since we need to use the information of WordNet, we have to map the sense ... “not sure” Trang 32As SENSEVAL-2, it’s easier to map the sense from SENSEVAL format to WordNet because WordNet 1.7 itself points out the mapping Given a SENSEVAL2 or WordNet sense key K, lookup ... multiple translations for a polysemous word If the correct sense can be determined, then we can find corresponding translation for the word For example, the word “accident” has two meaning The translation

Ngày tải lên: 30/09/2015, 14:24

77 151 0
Japanese   vietnamese compound word machine translation

Japanese vietnamese compound word machine translation

... Japanese compound word translation The Japanese compound word translation is primarily the translation of JCNs because there is no good corpus for translating JCVs while the idiosyncratic translations ... sub-problem, compound word translation is not also an exception In fact, the translation of compound words which mainly include compound nouns is a major issue in machine translation due to their ... The context of translation often decides which compound word translation is correct among translation candidates These following examples of the constructional variability in the translations of

Ngày tải lên: 22/06/2016, 15:49

81 271 0
DSpace at VNU: Adaptively entropy-based weighting classifiers in combination using Dempster Shafer theory for word sense disambiguation

DSpace at VNU: Adaptively entropy-based weighting classifiers in combination using Dempster Shafer theory for word sense disambiguation

... samples of Senseval-2 and Senseval-3 resulting in an improvement in overall accuracy Ó 2009 Elsevier Ltd All rights reserved Keywords: Computational linguistics; Classifier combination; Word sense ... role in WSD For predicting senses of a word, information usually used in previous studies is the topic context which is represented as bag of words In for determining word sense in many studies later ... speak-ing, WSD is the task of associating a given word in a text or discourse with an appropriate sense among numerous possible senses of that word This is only an ‘‘intermediate task” which necessarily

Ngày tải lên: 16/12/2017, 09:18

13 85 0
Japanese   vietnamese compound word machine translation

Japanese vietnamese compound word machine translation

... Japanese compound word translation The Japanese compound word translation is primarily the translation of JCNs because there is no good corpus for translating JCVs while the idiosyncratic translations ... sub-problem, compound word translation is not also an exception In fact, the translation of compound words which mainly include compound nouns is a major issue in machine translation due to their ... The context of translation often decides which compound word translation is correct among translation candidates These following examples of the constructional variability in the translations of

Ngày tải lên: 22/01/2021, 19:14

81 16 0
Tài liệu Báo cáo khoa học: "Bilingual Sense Similarity for Statistical Machine Translation" ppt

Tài liệu Báo cáo khoa học: "Bilingual Sense Similarity for Statistical Machine Translation" ppt

... context word which co-occurs with the translation rule 3.1 Context Features In the hierarchical phrase-based translation me-thod, the translation rules are extracted by ab-stracting some words ... the sense similarity measure 5 Experiments We evaluate the algorithm of bilingual sense si-milarity via machine translation The sense simi-larity scores are used as feature functions in the translation ... previous work that uses the VSM to compute sense similarity for terms from parallel corpora The sense similarities, i.e the translation probabilities in a translation mod-el, for units from parallel

Ngày tải lên: 20/02/2014, 04:20

10 598 0
Báo cáo khoa học: "A Word-Class Approach to Labeling PSCFG Rules for Machine Translation" pot

Báo cáo khoa học: "A Word-Class Approach to Labeling PSCFG Rules for Machine Translation" pot

... distinguishing between one-word, two-word, and multiple-word phrases as follows: Each one-word phrase with tag T simply receives the label T , instead of T -T Two-word phrases with tag sequence ... Since consecutive words within a rule stem from consecutive words in the training corpus and thus are already consistent, the boundary word tags are more informative than tags of words between the ... the result of looking up the class of a word that is out of bounds (e.g., the left context of the first word of a sentence, or the sec-ond word of a one-word phrase) We now map our phrase pair

Ngày tải lên: 23/03/2014, 16:20

11 424 0
Bilingual sentence alignment based on sentence length and word translation

Bilingual sentence alignment based on sentence length and word translation

... suitable for large corpora while Chen constructs a word-to-word translation model during alignment to assess the probability of an alignment Word correspondence was further developed in IBM Model-1 ... 44 Figure 3.3 English word clustering data 44 Figure 3.4 Vietnamese word clustering data 44 Figure 3.5 Bilingual dictionary 46 Figure 3.6 Looking up the probability of a word pair 47 Figure ... text and its translation (a bitext map) [14] This is the first stage for Statistical Machine Translation With aligned sentences, we can perform further analyses such as phrase and word alignment

Ngày tải lên: 15/07/2015, 12:49

61 294 0
Some Propositions to Improve the Prediction Capability of Word Confidence Estimation for Machine Translation

Some Propositions to Improve the Prediction Capability of Word Confidence Estimation for Machine Translation

... novel approach translation model for detecting word errors A phrase is considered as a contiguous sequence of words and is extracted from the word-aligned value of each target word is then computed ... source words which the target word is aligned to To facilitate the alignment representation, we apply the BIO1 format: in case of multiple target words are aligned with one source word, the first word’s ... target word is generated by a particular source word We also apply them in our experiments, containing: • Source alignment context features: the combinations of the target word and one word before

Ngày tải lên: 13/08/2015, 10:00

14 258 0
Reordering in statistical machine translation a function word, syntax based approach

Reordering in statistical machine translation a function word, syntax based approach

... third line to the word alignment The word alignment takes the format of (i:7) where i refers to the position of the English word while 7 to the position of the aligned Chinese word Statistics ... lexical items (individual word and even multi-word) This characteristic allows Hiero to capitalize on the phrase-based ap- proach’s strength of modeling multi-word translation Lastly, the correspondences ... words as well as function words are modeled identically in a fine-grained manner Unfortunately, in modeling content words, FSB models may run into data sparsity issues since unlike function words,

Ngày tải lên: 14/09/2015, 08:47

192 240 0
Enhancing the quality of machine translation system using cross lingual word embedding models

Enhancing the quality of machine translation system using cross lingual word embedding models

... word based context words as shown in Figure2.1 In this architecture, theprojection layer is common among all words, the input is a window of n future wordsand n history words of the current word ... approach to handle the translation ofunknown words in NMT by encoding unknown words as a sequence of subword units.This method based on the intuition that a variety of word classes are translated ... Neural Machine Translation System 10 2.2 Word Embedding 11 2.2.1 Monolingual Word Embedding Models 12 2.2.2 Cross-Lingual Word Embedding Models 13 3 Using Cross-Lingual Word Embedding

Ngày tải lên: 14/10/2019, 23:57

54 83 0
Graduation thesis strategies to deal with non equivalence at word level in translation

Graduation thesis strategies to deal with non equivalence at word level in translation

... expressive word……… 33 4.1.4 Translation by cultural substitution……… ……… 35 4.1.5 Translation using a loan word or loan word plus explanation ………37 4.1.6 Translation by paraphrasing……… 38 4.1.7 Translation ... tackle non-equivalence at word level……… 30 4.1.1 Translation by a more specific word (hyponym)……… 30 4.1.2 Translation by a more general word (superordinate) ……… 32 4.1.3 Translation by a more neutral/less ... the extent oftranslation (full translation versus partial translation); the grammatical rank at which thetranslation equivalence is established (rank bound translation vs unbounded translation);

Ngày tải lên: 06/07/2020, 10:56

57 156 0

Bạn có muốn tìm thêm với từ khóa:

w