the semi supervised model for cross lingual approach

Báo cáo khoa học: "Adaptation of Statistical Machine Translation Model for Cross-Lingual Information Retrieval in a Service Context" ppt

Báo cáo khoa học: "Adaptation of Statistical Machine Translation Model for Cross-Lingual Information Retrieval in a Service Context" ppt

... compare these two approaches to a baseline approach and comment on their respective performance 4.2.1 Query-genre tuning approach For the CLEF-tuning experiments we used the same translation model ... explain the unstable results Finally, we can see that the syntax-based features can be beneficial for the final retrieval quality: the models with syntax features can outperform the model basd on the ... generative model produces a set of hypotheses, and the best hypothesis is chosen afterwards via the discriminative reranking model, which allows to enrich the baseline model with the new complex...

Ngày tải lên: 24/03/2014, 03:20

11 367 0
Báo cáo khoa học: "Semi-supervised Training for the Averaged Perceptron POS Tagger" potx

Báo cáo khoa học: "Semi-supervised Training for the Averaged Perceptron POS Tagger" potx

... always uses the same tagger for tagging the unsupervised data For both English and Czech, the selection of taggers, the best combination and the best overall setup has been optimized on the development ... Evaluation of the English taggers only) on the supervised training results, where the performance of the taggers using the morphological analyzer output and using the full list of tags are nearly the same, ... replicable state-of -the- art in POS tagging) have been significantly surpassed also by the semisupervised Morˇ e (at the 99 % confidence level) c In addition, the semi- supervised Morˇ e perc forms (on single...

Ngày tải lên: 17/03/2014, 22:20

9 450 0
Tài liệu Evolving the neural network model for forecasting air pollution time series pdf

Tài liệu Evolving the neural network model for forecasting air pollution time series pdf

... (Section 2.5) was used for evolving the MLP for the forecasting problem (Fig 3) The starting populations were initialised with the random set of MLP models (see the encoding in the Section 2.4) and ... adequate information for achieving the performance described in the results Results 3.2 Validation statistics 3.1 Evolved MLP models The validation statistics is presented for each model and the reference ... Convergences resulting in the evolutionary model design The performance assessment was performed in the context of general performance (IA) and exceedance performance The latter was calculated...

Ngày tải lên: 17/02/2014, 22:20

9 537 1
Tài liệu The Dynamic Retention Model for Air Force Officers- New Estimates and Policy Simulations of the Aviator Continuation Pay Program doc

Tài liệu The Dynamic Retention Model for Air Force Officers- New Estimates and Policy Simulations of the Aviator Continuation Pay Program doc

... describes the characteristics and logic of the DRM Chapter Three compares the DRM with the ACOL model, and Chapter Four presents the modeling results Appendix A gives the estimates produced by the model, ... depend on the specifics of the ACP program in effect during the period covered by the data used to estimate the parameters of the model (By specifics we mean, for example, the amount of the annual ... advance to either the officer or the analyst The shock affects the value the individual places on staying in the military until the next decision The shock can make an individual place either a higher...

Ngày tải lên: 17/02/2014, 23:20

85 468 0
Tài liệu Báo cáo khoa học: "A Graph-based Semi-Supervised Learning for Question-Answering" doc

Tài liệu Báo cáo khoa học: "A Graph-based Semi-Supervised Learning for Question-Answering" doc

... colored nodes, have the same label, the algorithm continues to search for the secondary k neighbors, the light blue colored nodes, i.e., the neighbors of the neighbors, to find out if there are any ... SSL can improve the QA task performance when more unlabeled data is used to learn the classifier model of the models As more labeled data is introduced, Hybrid SVM models’ performance increase ... spatial information that would help to improve the the performance of the graph summary models We gradually increase the number of unlabeled data samples as shown in Table to demonstrate the effects...

Ngày tải lên: 20/02/2014, 07:20

9 507 1
Báo cáo khoa học: "Using Bilingual Parallel Corpora for Cross-Lingual Textual Entailment" pptx

Báo cáo khoa học: "Using Bilingual Parallel Corpora for Cross-Lingual Textual Entailment" pptx

... results achieved on the cross- lingual scenario, we investigate the possibility to exploit bilingual parallel corpora in the traditional monolingual scenario Using the same approach discussed in ... phrase With the second method, phrasal matches between the text and the hypothesis are indirectly performed through paraphrases of the phrase table entries The final entailment decision for a T/H ... released for the WMT101 We run TreeTagger (Schmid, 1994) for tokenization, and used the Giza++ (Och and Ney, 2003) to align the tokenized corpora at the word level Subsequently, we extracted the bilingual...

Ngày tải lên: 17/03/2014, 00:20

10 286 0
Báo cáo khoa học: "Is Machine Translation Ripe for Cross-lingual Sentiment Classification" pdf

Báo cáo khoa học: "Is Machine Translation Ripe for Cross-lingual Sentiment Classification" pdf

... estimation for covariate shift adaptation Annals of the Institute of Statistical Mathematics, 60(4) Xiaojun Wan 2009 Co-training for cross- lingual sentiment classification In Proc of the Association for ... not necessarily the case Certainly, the domain mismatch for JP is larger than DE, but this could be due to phenomenon other than MT errors Where exactly is the domain mismatch? 4.1 Theory of Domain ... mismatch implies that the input feature vectors have different distribution (e.g one dataset uses the word “excellent” often, while the other uses the word “awesome”) This degrades performance because...

Ngày tải lên: 17/03/2014, 00:20

5 374 0
Báo cáo khoa học: "Using Bilingual Comparable Corpora and Semi-supervised Clustering for Topic Tracking" ppt

Báo cáo khoa học: "Using Bilingual Comparable Corpora and Semi-supervised Clustering for Topic Tracking" ppt

... motivation for using bilingual corpora: bilingual corpora helps to collect more information about the target topic We therefore extracted monolingual(Japanese) story pairs and added them to the training ... included in the EDR bilingual dictionary For example, ’ エンデバー (Endeavour)’ which is a key term for the topic ‘Shuttle Endeavour mission for space station’ from the TDT3 corpus is not included in the ... Dec.31, 1998, against the 60 topics Each story was labelled according to whether the story discussed the topic or not Not all the topics were present in the Japanese corpora We therefore coltest data...

Ngày tải lên: 17/03/2014, 04:20

8 256 0
Báo cáo khoa học: "Semi-supervised Learning for Automatic Prosodic Event Detection Using Co-training Algorithm" doc

Báo cáo khoa học: "Semi-supervised Learning for Automatic Prosodic Event Detection Using Co-training Algorithm" doc

... significantly better than that of other semi- supervised approaches of previous work and comparable with supervised approaches For the break index detection, the learning curve of most different ... random selection at the beginning, but the saturation point is much later and therefore outperforms the random selection at the later iterations We also evaluated the effect of the amount of initial ... detection task The first issue is how to assign possible labels accurately The general method is to let the two classifiers predict the class for a given sample, and if they agree, the hypothesized label...

Ngày tải lên: 23/03/2014, 16:21

9 323 1
Báo cáo khoa học: "Improving the Performance of the Random Walk Model for Answering Complex Questions" pptx

Báo cáo khoa học: "Improving the Performance of the Random Walk Model for Answering Complex Questions" pptx

... as the sum of its relevance to the question (i.e rel(s|q)) and the similarity to other sentences in the collection (i.e sim(s, v)) The denominators in both terms are for normalization C is the ... the relatedness between the query sentences and the document sentences is an important factor, the graph-based random walk model of ranking sentences would perform better if we could encode the ... subtrees makes the TK function appropriate for syntactic trees but at the same time makes it not well suited for the semantic trees (ST) defined in Section For instance, although the two STs of...

Ngày tải lên: 23/03/2014, 17:20

4 456 0
Báo cáo khoa học: "Semi-Supervised SimHash for Efficient Document Similarity Search" pptx

Báo cáo khoa học: "Semi-Supervised SimHash for Efficient Document Similarity Search" pptx

... computationally expensive, which limits their usage for high-dimensional data This paper proposes a novel (semi- )supervised hashing method, Semi- Supervised SimHash (S3 H), for high-dimensional data similarity ... improve hashing results via the kernel trick However, KLSH is unsupervised, thus designing a data-specific kernel remains a big challenge 2.3 Semi- Supervised Hashing Semi- Supervised Hashing (SSH) ... dimension, which limits its usage for high-dimensional data (Trefethen et al., 1997) Furthermore, the variance of directions obtained by PCA decreases with the decrease of the rank (Jolliffe, 1986) Thus,...

Ngày tải lên: 30/03/2014, 21:20

9 390 0
Báo cáo khoa học: "Semi-Supervised Modeling for Prenominal Modifier Ordering" ppt

Báo cáo khoa học: "Semi-Supervised Modeling for Prenominal Modifier Ordering" ppt

... left uniform In this case, test ordering is determined by the class label alone 5.2 Semi- supervised models We now evaluate performance of the models on the scaled up training data Using the Berkeley ... output the best scoring model Because of the constraints on transition probabilthat the full-sentence n-gram model performs similarly to Malouf’s bigram model; although the re- ities, straightforward ... demonstrate cross- domain appli- ments (over 1% absolute, statistically significant) by combining the four models, indicating a continued cability of the approaches benefit of the other models, even...

Ngày tải lên: 30/03/2014, 21:20

6 274 0
Báo cáo khoa học: "Co-Training for Cross-Lingual Sentiment Classification" pot

Báo cáo khoa học: "Co-Training for Cross-Lingual Sentiment Classification" pot

... compared in the figure We can see that the performance of the cotraining approach with the balanced growth can be improved after a few iterations And the performance of the co-training approach ... etc To the best of our knowledge, cotraining has not yet been investigated for crossdomain or cross- lingual text classification 236 3.1 The Co-Training Approach Overview The purpose of our approach ... from the remaining unlabeled set Finally, the performance of the approach does not change any more, because the algorithm runs out of all possible examples in the unlabeled set Fortunately, the...

Ngày tải lên: 30/03/2014, 23:20

9 243 0
Báo cáo khoa học: "Semi-Supervised Training for Statistical Word Alignment" docx

Báo cáo khoa học: "Semi-Supervised Training for Statistical Word Alignment" docx

... stemming The approximative stemming sub -model (sub -model 9) uses the first letters of each vocabulary item as the stem for English and French while for Arabic we use the full word as the stem ... therefore performing semi- supervised training We show that semi- supervised training leads to better word alignments than running unsupervised training followed by discriminative training Another ... iterations of Model 1, iterations of the HMM model (Vogel et al., 1996), and iterations of Model We quantify the quality of the resulting hypothesized alignments with F-measure using the manually...

Ngày tải lên: 31/03/2014, 01:20

8 194 0
Báo cáo khoa học: "Weakly Supervised Learning for Cross-document Person Name Disambiguation Supported by Information Extraction" potx

Báo cáo khoa học: "Weakly Supervised Learning for Cross-document Person Name Disambiguation Supported by Information Extraction" potx

... introduce the following symbols C i refers to the context of the i -th mention Pi refers to the entity for the i -th mention Namei refers to the name string of the i -th mention CS i , j refers to the ... integers The number of integers being used may impact the final performance of the system If the number is too small, significant information may be lost during the discretization process On the other ... occurring person names Therefore, the use of frequently mentioned names in the corpus construction process does not affect the effectiveness of the learned model to be applicable to all the person names...

Ngày tải lên: 31/03/2014, 03:20

8 334 0
Transductive Support Vector Machines for Cross-lingual Sentiment Classification

Transductive Support Vector Machines for Cross-lingual Sentiment Classification

... cost-effective solution There are a few novel models have been proposed as the same problem, for example, the information bottleneck approach (Ling et al., 2008), the multilingual domain models (Gliozzo ... our approach also perform well in comparison to the supervised techniques that only employ the labeled data to learn the model shown in line (3) Because the number of unlabeled data is small for ... machine learning, there are supervised learning, semi- supervised learning and unsupervised learning that have been wide applied for real application and give a good performance Supervised learning...

Ngày tải lên: 01/08/2014, 17:53

25 389 0
Báo cáo lâm nghiệp: "Evaluation of a semi-empirical model for predicting fine root biomass in compositionally complex woodland vegetation" ppsx

Báo cáo lâm nghiệp: "Evaluation of a semi-empirical model for predicting fine root biomass in compositionally complex woodland vegetation" ppsx

... incorporated in the model However, the issue of resource heterogeneity is generic Resource patchiness can therefore serve as an explanation for the differences in model performance between the xeric ... respectively These values fit data best, i.e the regressions between TrFRB based on these settings and measured FRB showed the highest R2 compared to other approaches All other settings of the model ... improve the prediction of the model at the mesic site (Tab III, Fig 3) This indicates that inability of the model to predict FRB is not because of failure to account for the direct contribution of the...

Ngày tải lên: 07/08/2014, 16:20

8 303 0
Báo cáo y học: " Advantages of the single delay model for the assessment of insulin sensitivity from the intravenous glucose tolerance test" doc

Báo cáo y học: " Advantages of the single delay model for the assessment of insulin sensitivity from the intravenous glucose tolerance test" doc

... significant for 1/HOMA-IR, HOMA2 and for KxgI both in the whole sample (P < 0.001 for the KxgI, P = 0.002 for the 1/HOMA-IR and P = 0.001 for the HOMA2) and in the reduced sub-sample (P < 0.001 for the ... errors for the Insulin Sensitivity Indices from the Single Delay Model (KxgI) and from the Minimal Model (SI) For the KxgI the average values were computed both in the Full Sample and in the reduced ... concentrations as the true forcing function for glucose kinetics Figures and show the performance of the two models in terms of their ability to describe the observed data The apparent better fit of the Minimal...

Ngày tải lên: 13/08/2014, 16:20

20 284 0
proposing the leadership buiding model for fpt

proposing the leadership buiding model for fpt

... thesis, the writer would like to propose the Leadership Building Model for FPT Corporation, the company in the strong need of leaders to prepare for the development in the coming years The reason for ... evaluate the assignment’s result based on the plan for personal development by the mentee Internal Assessor Participating in the evaluation before and after the training Giving requirements for the model ... number of theories and models The table on page 40 gives a broad overview of the evolution of leadership models and associated theories in the 20th century: The shift from ‘great man (trait) theories’...

Ngày tải lên: 14/10/2014, 01:13

69 262 4
w