... Generalized Algorithms for Constructing Statistical Language Models Cyril Allauzen, Mehryar Mohri, Brian Roark AT&T Labs – ... with other in- formation sources to rank alternative hypotheses by as- signing them some probabilities. There are classical techniques for constructing language models such as - gram models with ... finding all for a given is . Therefore, the total cost is . For all non-empty , we create a new state and for all we set . We create a transition , and for all such that , we set . For all such...
Ngày tải lên: 08/03/2014, 04:22
... known method for estimating N-gram language models. Kneser-Ney smoothing, however, requires nonstandard N-gram counts for the lower- order models used to smooth the highest- order model. For some ... 1998), and they use a different way of computing N-gram counts for all the lower-order models used for smooth- ing. For these lower-order models, the actual cor- pus counts C(w 1 . w n ) are replaced ... provide a non-zero probability for N-grams not observed in the training corpus. The best methods for smoothing N-gram lan- guage models all use a hierarchy of lower-order models to smooth the highest-order...
Ngày tải lên: 20/02/2014, 09:20
Tài liệu Báo cáo khoa học: "Reading Level Assessment Using Support Vector Machines and Statistical Language Models" pdf
... only one over 40%. The curves for bigram and unigram models have similar shapes, but the trigram models outperform the lower-order models. Error rates for the bigram models range from 37-45% and ... using statistical language models. In this paper, we also use support vector machines to combine features from tradi- tional reading level measures, statistical language models, and other language ... the likelihood ratio for classi- fication, we can use scores from language models as features in another classifier (e.g. an SVM). For ex- ample, perplexity (P P) is an information-theoretic measure...
Ngày tải lên: 20/02/2014, 15:20
Tài liệu Báo cáo khoa học: "Generating statistical language models from interpretation grammars in dialogue systems" potx
... decades of statistical language modeling: Where do we go from here? In Proceed- ings of IEEE:88(8). Rosenfeld R. 2000. Incorporating Linguistic Structure into Statistical Language Models. In ... with a comparison of in- grammar recognition performance. 3 Language modelling To generate the different trigram language models we used the SRI language modelling toolkit (Stol- cke, 2002) with ... Danieli M., Gerbino E., Moisa L. M., and Popovici C. 1997. Contextual Information and Spe- cific Language Models for Spoken Language Un- derstanding. In Proceedings of SPECOM’97, Cluj- Napoca, Romania,...
Ngày tải lên: 22/02/2014, 02:20
Báo cáo khoa học: "Annealing Techniques for Unsupervised Statistical Language Learning" ppt
Ngày tải lên: 23/03/2014, 19:20
Báo cáo khoa học: "Continuous Space Language Models for Statistical Machine Translation" pdf
Ngày tải lên: 31/03/2014, 01:20
Tài liệu Báo cáo khoa học: "Incremental Syntactic Language Models for Phrase-based Translation" pptx
... has effectively used n-gram word sequence models as language models. Modern phrase-based translation using large scale n-gram language models generally performs well in terms of lexical choice, ... to incorporate large- scale n-gram language models in conjunction with incremental syntactic language models. The added decoding time cost of our syntactic language model is very high. By increasing ... Association for Computational Linguistics, pages 620–631, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics Incremental Syntactic Language Models for Phrase-based...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx
... contain the acoustic models, language model and lexicon, but the LM makes up for most of the size. The availability of data varies for the different languages, and therefore the FST sizes are ... rates were 17.0 for En- glish, 18.7 for Spanish, and 22.5 for French. For English, we also created web mixture mod- els with KN smoothing. The error rates were 16.5, 15.9 and 15.7 for the 20 MB, ... data were selected for each language. The adaptation was thought to take place off-line on a server. 3.2.1 Data sets For each language, the adaptation takes place on two baseline models, which are...
Ngày tải lên: 22/02/2014, 02:20
Báo cáo khoa học: "Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers" ppt
... as language models for statistical machine translation. In Proceed- ings of AMTA. Sylvain Raybaud, Caroline Lavecchia, David Langlois, and Kamel Sma ă li. 2009. New condence measures for statistical ... Association for Computational Linguistics Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers Deyi Xiong, Min Zhang, Haizhou Li Human Language ... to standard n-gram language mod- els in statistical machine translation: a back- ward language model that augments the con- ventional forward language model, and a mu- tual information trigger...
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "Combining a Statistical Language Model with Logistic Regression to Predict the Lexical and Syntactic Difficulty of Texts for FFL" potx
... level for PO model and MLR model (on the test folds). 8 Discussion and future research This paper has proposed the first readability for- mula” for French as a foreign language using NLP and statistical ... measures for first and second language texts. In Proceedings of NAACL HLT, pages 460–467. M. Heilman, K. Collins-Thompson, and M. Eskenazi. 2008. An analysis of statistical models and fea- tures for ... for every learner is far from easy. In this context, automatic procedures can support the teacher’s work. Some tools exist for English, but at present there are none for French as a foreign language (FFL)....
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "Cutting the Long Tail: Hybrid Language Models for Translation Style Adaptation" doc
... testing for statistical machine translation: Controlling for optimizer instability. In Proceedings of the Association for Computational Lingustics, ACL 2011, Portland, Oregon, USA. Associa- tion for ... ways: only for word selection, as a frequency measure, or also for word representation, as a mapping for common words. In the former, we preserve in- flected variants that may be useful to model the language ... Association for Computational Linguistics, pages 439–448, Avignon, France, April 23 - 27 2012. c 2012 Association for Computational Linguistics Cutting the Long Tail: Hybrid Language Models for Translation...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining" ppt
... syn- tactic information in both generative and discrimi- native language models. For generative LMs, the syntactic information must be part of the generative process. Structured language modeling ... Association for Computational Linguistics, pages 175–183, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics Fast Syntactic Analysis for Statistical Language ... USA {ariya,mdredze,khudanpur}@jhu.edu Abstract Long-span features, such as syntax, can im- prove language models for tasks such as speech recognition and machine translation. However, these language models can be dif- ficult to use in practice because...
Ngày tải lên: 16/03/2014, 19:20
Báo cáo khoa học: "Phrase-based Statistical Language Generation using Graphical Models and Active Learning" potx
Ngày tải lên: 17/03/2014, 00:20
Báo cáo khoa học: "Discriminative Pruning of Language Models for Chinese Word Segmentation" ppt
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "Utilizing Dependency Language Models for Graph-based Dependency Parsing Models" pptx
Ngày tải lên: 23/03/2014, 14:20
Báo cáo khoa học: " Exploring Asymmetric Clustering for Statistical Language Modeling" docx
Ngày tải lên: 23/03/2014, 20:20
Báo cáo khoa học: "Immediate-Head Parsing for Language Models£" potx
Ngày tải lên: 31/03/2014, 04:20
Báo cáo hóa học: " Research Article Hybrid Projection Algorithms for Generalized Equilibrium Problems and Strictly " docx
Ngày tải lên: 21/06/2014, 07:20
Báo cáo hóa học: " Research Article Perturbed Iterative Algorithms for Generalized Nonlinear Set-Valued Quasivariational " pptx
Ngày tải lên: 22/06/2014, 18:20