... integrated algorithm we are looking for has to be transition-based at the top level The advan-tages of the graph-based approach – a more glob-ally informed basis for the decision among dif-ferent attachment ... in the beam are recalculated based on a scoring model inspired by the graph-based parsing ap-proach, i.e., taking complete factors into account as they become incrementally available As a con-sequence ... aspects of transition-based and graph-based pars-ing, and end up using a transition-based parser with a combined transition-based/second-order graph-based scoring model (Zhang and Clark, 2008, 567),
Ngày tải lên: 17/03/2014, 22:20
... (MT06) and 2008 (MT08) data sets All bilingual corpora available for the NIST 2008 constrained data track of Chinese-to-English MT task are used as training data, which contain 5.1M sentence pairs, ... language model is used as an additional feature Trang 7Phrasal rules are extracted on all bilingual data, hierarchical rules used in DHPB and reordering rules used in SCFG-HMD are extracted from a ... performance of HM decoding We also plan to investigate more complicated reordering models in HM decoding References David Chiang 2005 A Hierarchical Phrase-based Model for Statistical Machine
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "Improving Pronoun Translation for Statistical Machine Translation" docx
... translation (N-gram or syntactic tree), these agreement features can influence the translation. But this locality cannot be guaranteed in either phrase-based or syntax-based Statistical Machine Translation ... Machine Translation and Multilingual NLP. Machine Translation, 14:159– 161. Franz J. Och and Hermann Ney. 2003. A Systematic Comparison of Various Statistical Alignment Mod- els. Computational ... 311– 318. Horacio Saggion and Ariadne Carvalho. 1994. Anaphora Resolution in a Machine Translation Sys- tem. In Proceedings of the International Con- ference on Machine Translation: Ten Years On, November,
Ngày tải lên: 17/03/2014, 22:20
Báo cáo khoa học: "Adaptation of Statistical Machine Translation Model for Cross-Lingual Information Retrieval in a Service Context" ppt
... Trang 1Adaptation of Statistical Machine Translation Model for Cross-LingualInformation Retrieval in a Service Context Vassilina Nikoulina Xerox Research Center Europe vassilina.nikoulina@xrce.xerox.com ... on Statistical Machine Trans-lation, pages 182–189 Association for Computa-tional Linguistics. Caroline Brun, Vassilina Nikoulina, and Nikolaos La-gos 2012 Linguistically-adapted structural ... on Statistical Machine Translation, pages 1–28, Athens, Greece, March Association for Computational Linguistics. David Chiang, Yuval Marton, and Philip Resnik. 2008 Online large-margin training
Ngày tải lên: 24/03/2014, 03:20
Tài liệu Báo cáo khoa học: "Modified Distortion Matrices for Phrase-Based Statistical Machine Translation" doc
... Sydney, Australia, July. Association for Computa- tional Linguistics. Jacob Andreas, Nizar Habash, and Owen Rambow. 2011. Fuzzy syntactic reordering for phrase-based statistical machine translation. ... Chunk-lattices for verb reordering in Arabic- English statistical machine translation. Machine Trans- lation, Published Online. David Chiang. 2005. A hierarchical phrase-based model for statistical ... Matrices for Phrase-Based Statistical Machine Translation Arianna Bisazza and Marcello Federico Fondazione Bruno Kessler Trento, Italy {bisazza,federico}@fbk.eu Abstract This paper presents a
Ngày tải lên: 19/02/2014, 19:20
Báo cáo khoa học: "Perplexity Minimization for Translation Model Domain Adaptation in Statistical Machine Translation" potx
... sennrich@cl.uzh.ch Abstract We investigate the problem of domain adaptation for parallel data in Statistical Machine Translation (SMT) While tech-niques for domain adaptation of monolin-gual data can be ... borrowed for parallel data, we explore conceptual differences between translation model and language model do-main adaptation and their effect on per-formance, such as the fact that translation models ... models An unadapted, out-of-domain language model trained on data sets provided for the WMT 2011 transla-tion task, and an adapted language model which is the linear interpolation of all data
Ngày tải lên: 17/03/2014, 22:20
Báo cáo hóa học: " Research Article Flicker Compensation for Archived Film Sequences Using a Segmentation-Based Nonlinear Model" doc
... to account for spatial variability In [3] it was observed that archive material typically has a limited dynamic range Histogram stretching was applied to individual frames allowing the available ... spatially adaptive compensation techniques polynomials, hierarchical parameters estimation Linear compensation: flicker is modelled as 2-parameter 2nd order polynomials, parameters estimation based ... incorporation of segmentation information enhances the accuracy and the robustness of flicker parameters estimation Spatial adaptation requires mixed block-based/region-based frame partitioning
Ngày tải lên: 22/06/2014, 01:20
Reordering in statistical machine translation a function word, syntax based approach
... Word, Syntax-based Approach Hendra Setiawan In this thesis, we investigate a specific area within Statistical Machine Trans- lation (SMT): the reordering task — the task of arranging translated words ... Chapter 2 Related Work 2.1 Word-based Approach 2 0.02 Q Q2 2.2 Phrase-based Approach .0.0.0.0 0.0000 eee eee 2.3.1 Linguistically Syntax-based Approach 2.3.2 Formally Syntax-based Approach ... Specifically, we focus on a class of syntax-based approaches, namely: formally syntaz-based (FSB) approach The FSB approach is unique, since it uses a syntactic formalism that is not necessarily
Ngày tải lên: 14/09/2015, 08:47
2010 statistical power analysis a simple and general model for traditional and modern hypothesis tests
... power analysis for tifactor analysis of variance (ANOVA), including split-plot and randomized block factorial designs Although conceptual issues for power analysis are similar in factorial ANOVA and ... However, power analysis is applicable to a very wide range of statistical tests, and the same simple and general model can be applied to virtually all of the statistical analyses you are likely to ... in small samples, there may be little reason to test for it statistically), and researchers are often unwilling to abandon the tra-ditional criteria for statistical significance that are accepted
Ngày tải lên: 09/08/2017, 10:28
DSpace at VNU: Dependency-based Pre-ordering For English-Vietnamese Statistical Machine Translation
... Machine Translation, Phrase-based Statistical Machine Translation. 1 Introduction Phrase-based statistical machine translation [1] is the state-of-the-art of SMT because of its power in modelling ... α: d(ai− bi−1)= α|a i −b i−1 −1| (2) Trang 5Moses [11] is open source toolkit for statisticalmachine translation system that allows automat-ically train translation models for any language pair ... Vietnamese We evaluated our approach on English-Vietnamese machine translation tasks, and showed that it outperforms the baseline phrase-based SMT system. Keywords: Natural Language Processing, Machine
Ngày tải lên: 11/12/2017, 11:14
Dependency-based Pre-ordering For English-Vietnamese Statistical Machine Translation
... translation system that allows automatically train translation models for any language pair When we have a trained model, an efficient search algorithm quickly finds the highest probability translation ... Translation, Phrase-based Statistical Machine Translation 1 Introduction * Phrase-based statistical machine translation [8] is the state-of-the-art of SMT because of its power in modelling short ... based on a dependency parser in phrase-based statistical machine translation (SMT) to learn automatic and manual reordering rules from English to Vietnamese The dependency parse trees and transformation
Ngày tải lên: 29/01/2020, 23:17
A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies
... survival analysis for all TCGA LUAD patients and GEO LUAD patients according to the 16-gene-based model stratified by clinical stage, gender, age, and smoking status a TCGA LUAD patients b GEO LUAD ... mechanisms of this prevalent and devastating disease Methods Data acquisition and preprocessing The TCGA LUAD legacy level-3 RNA-Seq data, con-taining 515 tumor samples and 59 adjacent normal samples, ... pro-moter and result in a favorable prognosis [27] Elevated Table 2 Univariate and multivariate analyses of clinicopathological factors and risk model in TCGA and GEO LUAD cohorts Trang 9expression
Ngày tải lên: 17/06/2020, 17:59
Dependency based pre ordering for english vietnamese statistical machine translation
... Machine Translation, Phrase-based Statistical Machine Translation. 1 Introduction Phrase-based statistical machine translation [1] is the state-of-the-art of SMT because of its power in modelling ... α: d(ai− bi−1)= α|a i −b i−1 −1| (2) Trang 5Moses [11] is open source toolkit for statisticalmachine translation system that allows automat-ically train translation models for any language pair ... Vietnamese We evaluated our approach on English-Vietnamese machine translation tasks, and showed that it outperforms the baseline phrase-based SMT system. Keywords: Natural Language Processing, Machine
Ngày tải lên: 17/03/2021, 20:26
A topic based approach for narrowing the
... top-20 accuracy results As shown, for this dataset in ArgoUML, the accuracy achieves its highest point in the range of around 300 topics That is, this particular data set might actually contain around ... with that of the state-of-the-art approaches: the Support Vector Machine (SVM)-based approach by Premraj et al [19] and the approach by Lukins et al [12] that combines LDA and Vector Space Model ... Trang 1A Topic-based Approach for Narrowing the Search Space of Buggy Files from a Bug Report Anh Tuan Nguyen, Tung Thanh Nguyen, Jafar Al-Kofahi, Hung Viet Nguyen, Tien N Nguyen Electrical and
Ngày tải lên: 09/02/2022, 14:32
A finite volume SOFC model for coal based integrated gasification fuel cell systems analysis
... than activation loss and overall performance is much greater This indicates that the activation loss parameters used by Campanari and Iora关12兴 and Costamagna et al 关8兴 are not appropriate for state-of-the-art ... pre-exponential factor and activation en-ergy of Eqs.共7兲 and 共8兲 are reported in the literature 关8,11–13兴 Values reported by Campanari and Iora 关12兴 and Costamagna et al.关8兴 for simulating an electrolyte-supported ... 3.1.2 Activation Polarization The activation polarization is estimated as the sum of activation polarization at each electrode-electrolyte interface act=actan共j兲 +act共j兲 共4兲 The governing equation
Ngày tải lên: 19/11/2022, 11:36
Tài liệu GIVING CREDIT WHERE CREDIT IS DUE: CREATING A COMPETENCY-BASED QUALIFICATIONS FRAMEWORK FOR POSTSECONDARY EDUCATION AND TRAINING pdf
... perhaps the best illustration of the sort of broad-based, voluntary national standards organization that could serve as a model for creating a competency-based framework for noncredit occupational ... conversation about how to move the postsecondary and employment and training fields toward a qualifications framework for awarding educational credit for occupational education and training based ... can earn educational credit for technical instruction What began as an effort to ensure that postsecondary credits can transfer has led to a process for awarding educational credit for occupational
Ngày tải lên: 16/02/2014, 03:20
Tài liệu Báo cáo khoa học: Trophoblast-like human choriocarcinoma cells serve as a suitable in vitro model for selective cholesteryl ester uptake from high density lipoproteins pdf
... activities and regulation of transplacental transport and uptake mechanisms.To date, only limited information on binding and holoparticle-uptake of lipoproteins by choriocarcinoma cell lines is available ... Reverse-transcriptase-polymerase chain reaction Total RNA from choriocarcinoma cell lines was isolated by using RNeasy kit (Qiagen, Vienna, Austria).Three micro-grams of total RNA were treated with ... blot analysis Total RNA was isolated from choriocarcinoma and human liver tissues (used as a positive control) by the RNA-easy kit (Qiagen) exactly as described [40].A 553-bp fragment, amplified
Ngày tải lên: 20/02/2014, 23:20
Tài liệu Báo cáo khoa học: "Extending the Entity-based Coherence Model with Multiple Ranks" pot
... coherence models has recently become an active research area A particularly popular coherence model is the entity-based local coher-ence model of Barzilay and Lapata (B&L) (2005; 2008) This model ... of Human Language Technologies and North American Association for Computational Linguistics 2004: Short Papers, pages 1–4. Regina Barzilay and Mirella Lapata 2005 Modeling local coherence: An entity-based ... datasets: news articles on the topic of earthquakes (Earthquakes) and narratives on the topic of aviation accidents (Accidents) A train-ing data instance is constructed as a pair con-sisting of a source
Ngày tải lên: 22/02/2014, 02:20
Báo cáo khoa học: "Syntax-to-Morphology Mapping in Factored Phrase-Based Statistical Machine Translation from English to Turkish" ppt
... Columbia, Canada, October. Hany Hassan, Khalil Sima’an, and Andy Way 2007. Supertagged phrase-based statistical machine trans-lation In Proceedings of the 45th ACL, pages 288– 295, Prague, Czech ... the basics of phrase-based statistical machine transla-tion (Koehn et al., 2003) and factored statistical machine translation (Koehn and Hoang, 2007) 2 Syntax-to-Morphology Mapping In this section, ... ACL–demonstration session, pages 177–180. Philipp Koehn 2005 Europarl: A parallel corpus for statistical machine translation In MT Summit X. statistical machine translation In Proceedings of HLT/NAACL-2004
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" potx
... character can be assigned one of two possi-ble boundary tags: “B” for a character that begins a word and “I” for a character that occurs in the mid-dle of a word We denote a candidate character ... the features We refer readers to read the above paper for details For parameter estimation, our work adopt the Passive-Aggressive (PA) framework (Crammer et al., 2006), a family of margin based ... kind of approach, the task is formulated as the classification of characters into POS tags with boundary information Both the IOB2 representa-tion (Ramshaw and Marcus, 1995) and the Start/End
Ngày tải lên: 17/03/2014, 00:20