large vocabulary speech recognition based on statistical methods

Context-Dependent Pre-trained Deep Neural Networks for Large Vocabulary Speech Recognition

... M ONOPHONE AND T RIPHONE HMM A LIGNMENT L ABELS Alignment # Hidden Units Label Dev Accuracy Monophone 1.5K Monophone State 55.5% Triphone 1.5K Monophone State 59.1% TABLE IV C OMPARISON OF C ONTEXT ... the performance difference between using a mono-phone alignment and a tri-mono-phone alignment, using monomono-phone state labels and tri-phone senone labels, using 1.5K and 2K hidden units in ... additional benefits Our evaluation was done on LVSR instead of phoneme recognition tasks as was the case in [30]–[32], [59] It represents the first large vocabulary application of a pre-trained, deep

Ngày tải lên: 03/01/2023, 13:17

13 8 0

Báo cáo hóa học: " Anomalous heat transfer modes of nanofluids: a review based on statistical analysis" doc

... review based on statistical analysis Antonis Sergis*and Yannis Hardalupas Abstract This paper contains the results of a concise statistical review analysis of a large amount of publications regardingthe ... sec-tion contains the main conclusions reached by the cur-rent review Characteristics of nanofluids This section epitomizes the most common nanofluid preparation methods by providing information ... Experiments focusing on Convection heat transfer (Continued)lubrication inside HFC134a refrigerant fluid along with NPs.Conventionally Polyol- ester (POE) is used as a lubricant lubrication inside HFC134a

Ngày tải lên: 21/06/2014, 03:20

37 339 0

Emotion an attention recognition based on biological signals and images

... Emotion and Attention Recognition Based on Biological Signals and Images Seyyed Abed Hosseini Additional information is available at the end of the chapter 1 Emotion and attention recognition based ... applications The book Emotion and Attention Recognition Based on Biological Signals and Images attempts to introduce the different soft computing approaches and technologies for recognition of emotion, ... (fNIRS), and functional magnetic resonance imaging (fMRI) have a great help in understanding the mentioned cognitive processes Emotion, stress, and attention recognition systems based on different

Ngày tải lên: 16/01/2018, 08:55

94 131 0

Improving the Competitiveness for Enterprises in Brand Recognition Based on Machine Learning Approach45290

... of convolution and pooling occurring in an alternating fashion Convolution layers is one of the most layers in CNN structure It has two types, including Convolution Filter and Convolutional ... brand can consist of multiple logo classes There are some methods for brand recognition, in particular logo identification 2.2.1 Decision tree methods Decision tree is known as classification and ... and online marketing, etc It is considered to view as a task of multidimensional image classification but brand recognition cannot be solved directly by applying traditional image recognition

Ngày tải lên: 30/03/2022, 11:56

14 6 0

hybrid radar emitter recognition based on rough k means classifier and relevance vector machine

... radar emitter recognition model is proposed In Section 3, the primary recognition is introduced In Section 4, the advanced recognition is introduced In Section 5, the computational complexity ... in Section 6, and conclusions are given in Section 7 2 Radar Emitter Recognition System A combination of multiple classifiers is a powerful solution for difficult pattern recognition problems ... primary recognition and the accuracy of the advanced recognition The samples that the primary recognition rejects are classified by the advanced recognition So the estimate of recognition accuracy

Ngày tải lên: 02/11/2022, 11:38

18 2 0

Báo cáo hóa học: " Research Article Cued Speech Gesture Recognition: A First Prototype Based on Early Reduction" pot

... work on automatic cued speech translation In this paper, we only address the problem of automatic cued speech manual gesture recognition Such a gesture recognition issue is really com-mon from ... during automatic recognition. Figure 1: French cued speech specifications: on the left, 5 diﬀerent hand locations coding vowels; on the right, 8 diﬀerent hand shapes coding consonants vowel of ... succession of consonants (which are coded as CV with invisible vow-els) the change of configuration is really fast On the contrary, at the end of a sentence, the constraints are less strong and

Ngày tải lên: 22/06/2014, 00:20

19 540 0

a novel voice activity detection based on phoneme recognition using statistical model

... detection based on phoneme recognition using statistical model Xulei Bao*and Jie Zhu Abstract In this article, a novel voice activity detection (VAD) approach based on phoneme recognition using ... method based on the recursive phoneme recognition and noise suppression methods is given in Section 3 The detail experiments and simulation results are shown in Section 4 Finally, the discussion ... employed to represent each speech/non-speech segment The main idea of this new method is regarding the non-speech as a new phoneme corresponding to the conventional phonemes in mandarin, and all

Ngày tải lên: 02/11/2022, 08:52

10 4 0

Luận văn thạc sĩ: Speech emotion recognition using fuzzy inference system based on fuzzy associative memory

... tools for speech emotion feature extraction and tools for emotion classification Speech Emotion Recognition Toolbox is an open toolbox It meansSERT allows users to integrate extra functions and ... using FAM-based Fuzzy Inference SystemThe system model of speech emotion recognition using fuzzy inference system based on fuzzy associative memory is showed in [Figure 3-11] The model consists ... Determination of | ! Inference phase membership fuzzy rule base l M function tà)[Figure 3-11] The model of SER using FAM-based fuzzy inference systemFor implementation, a toolbox for speech motion recognition

Ngày tải lên: 08/11/2024, 17:07

128 1 0

Báo cáo khoa học: "Automated Essay Scoring Based on Finite State Transducer: towards ASR Transcription of Oral English Speech" docx

... combination of the weights of insertion, deletion and substi- tution. The relation is shown in equation (2), where ins, del and sub are the appearance times of insertions, deletions and substitutions, ... Automatic Speech Recognition (ASR), in which we get the speech scoring features as well as the textual transcriptions of the speech- es. Then, the second step could grade the text-free transcription ... an (conventional) AES system. The present work is mainly about the AES system un- der the certain situation as the examination grading criterion is more concerned about the integrated con- tent

Ngày tải lên: 07/03/2014, 18:20

10 325 0

Báo cáo khoa học: "A Large-Scale Uniﬁed Lexical-Semantic Resource Based on LMF" docx

... application: domain-specific lex-icons are extracted from ontology specifications and merged with existing LSRs on demand As a consequence, there is no available large-scale in-stance of theLEMONmodel ... primarily contain information on lexical-semantic relations, such as synonymy, and use synsets (groups of lexemes that are synony-mous) as organizational units FN focuses on groups of lexemes that ... UBY-API without losing information As the conversion is largely performed automatically, systematic errors and information loss could be introduced by a faulty conversion routine In or-der to detect

Ngày tải lên: 17/03/2014, 22:20

11 483 0

A marketing science perspective on recognition-based heuristics (and the fast-and-frugal paradigm) ppt

... majority of consumers (76%) were making consideration decisions based on non-compensatory heuristics—not recognition alone but rather conjunctions (and disjunctions of conjunctions) of aspects ... might expect con-sumers to rely less on recognition 4.2 In new situations consumers learn deci-sion rules by self-reflection Hauser, Dong, and Ding (2011) sought to test three com-mon methods of ... good decisions They use other decision rules in other situations 3.2 Are recognition-based heuristics eco-logically rational in brand choice? We have argued that consumers use recognition as a

Ngày tải lên: 29/03/2014, 20:20

13 556 0

Báo cáo hóa học: " A broadly applicable method to characterize large DNA viruses and adenoviruses based on the DNA polymerase gene" doc

... families Several small regions of conservation were iden-tified Only one region with conservation of at least 5 con-secutive amino acids was found among nearly all sequences evaluated This was ... and Baculoviridae reveal two regions that display a high level of conservation [1] The upstream region showed two different contiguous sequences of conservation with potential for degenerate ... additional sequences also provided EcoRI and BglII restriction enzyme recognition sites to the lower primer and EcoRI and XbaI recognition sites to the upper primers that could be used for cloning

Ngày tải lên: 20/06/2014, 01:20

10 468 0

Báo cáo toán học: " DWT and LPC based feature extraction methods for isolated word recognition" pptx

... extraction and its normalization are described in Section 3 The various experiments and recognition results are given in Section 4 Section 5 gives the concluding remarks based on the experimentation ... normalization; hidden Markov model 1 Introduction A speech recognition system has two major components, namely, feature extraction and classification Feature extraction method plays a vital role in speech ... Provisional PDF corresponds to the article as it appeared upon acceptance Fully formattedPDF and full text (HTML) versions will be made available soon. DWT and LPC based feature extraction methods

Ngày tải lên: 20/06/2014, 20:20

31 580 1

Báo cáo hóa học: " Indoor localization based on cellular telephony RSSI fingerprints containing very large numbers of carriers" potx

... validation strategy Table 1 presents the classification results for SVMs with linear and Gaussian kernels, respectively (see Sec-tion 3.1) in one-vs-one and one-vs-all configuraSec-tions Results ... the classification results for linear and Gaussian SVMs in the one-vs-one and one-vs-all config-urations, with results from a non-PCA K-NN classifier also provided for comparison The meaning of ... all points in a large dataset is expensive and time consuming Finally, in Section 5, we present some conclusions and ideas for further study An appendix provides basic information on the machine

Ngày tải lên: 21/06/2014, 00:20

14 400 0

Báo cáo hóa học: " Research Article Online Speech/Music Segmentation Based on the Variance Mean of Filter Bank Energy" pdf

... adapted to other speech/nonspeech discrimination applications Two methods for speech/music classiﬁcation for multimedia applications were compared in [16] The ﬁrst method is based on a zero-crossing ... Focus conditions of the BNSI database FC F0 F1 F2 F3 F4 F5 Fx Description Read studio speech Spontaneous studio speech Clean telephone speech Speech with music background Read or spontaneous speech ... Gatica-Perez, “Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms,” in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing

Ngày tải lên: 21/06/2014, 19:20

13 382 0

báo cáo hóa học:" Research Article Pitch- and Formant-Based Order Adaptation of the Fractional Fourier Transform and Its Application to Speech Recognition" pptx

... mainly focuses on speech analysis [23, 25–28], pitch estimation [4, 29], speech enhancement [30, 31], speech recognition [32], speaker recognition [33], and speech separation [34] These methods basically ... resolution than the traditional FFT-based method, a more accurate pitch estimate, and have shown to be beneficial for speech enhancement, speech recognition, speaker recognition, and monaural speech ... Consonant Challenge Speech Recognition Experiments 6.1 Speech Corpus The experiments were conducted on the intervocalic consonants (VCV) provided by the Interspeech 2008 Consonant Challenge [42]

Ngày tải lên: 21/06/2014, 20:20

14 385 0

báo cáo hóa học:" Decision tree-based acoustic models for speech recognition" potx

... 2Decision tree-based acoustic models for speech recognition Masami Akamine*1 and Jitendra Ajmera2 Trang 3Keywords: speech recognition; acoustic modeling; decision trees; probability estimation; ... the purpose of large vocabulary speech recognition [7] We propose various methods to improve DT-based acoustic models (DTAMs) In addition to the continuous acoustic feature questions previously ... Provisional PDF corresponds to the article as it appeared upon acceptance Fully formattedPDF and full text (HTML) versions will be made available soon Decision tree-based acoustic models for speech

Ngày tải lên: 21/06/2014, 20:20

32 289 0

Báo cáo hóa học: " Research Article Robust In-Car Speech Recognition Based on Nonlinear Multiple Regressions" pot

... outlined in Section In Section 5, we present the environmental adaptation and model compensation algorithms Then the performance evaluation on the adaptive regression -based speech recognition framework ... speakers, visor mic speech) Nonspeech R K-means Cluster ID clustering Test word Speech X(L) N(L) Estimation Recognition Figure 5: Diagram of adaptive regression -based speech recognition X(L) , N(L) ... stationary noise (e.g., air conditioner on) , but has some problems in the nonstationary noise (e.g., CD player on) CONCLUSIONS In this paper, we have proposed a nonlinear multiple-regression-based...

Ngày tải lên: 22/06/2014, 23:20

10 220 0

Báo cáo hóa học: " Speech/Non-Speech Segmentation Based on Phoneme Recognition Features" doc

... phoneme recognition features were designed to follow the basic concept of this kind of classification, where one class -speech defines another non -speech For this purpose, four measures based on ... which confirmed our expectations that probably the most suitable representation for SNS classification is a combination of acoustic- and recognition- based features The proposed phoneme recognition ... Section the phoneme recognition features are proposed We give the basic ideas behind introducing such a representation of audio signals for SNS segmentation and define four features based on consonant-vowel...

Ngày tải lên: 22/06/2014, 23:20

13 229 0

Tài liệu Báo cáo khoa học: "Learning Sub-Word Units for Open Vocabulary Speech Recognition" doc

... have a unique segmentation, shared by all common tokens Words are converted into phonetic representations according to their most likely dictionary pronunciation; non-dictionary words use the L2S ... F Wessel, R Schluter, K Macherey, and H Ney 2001 Conﬁdence measures for large vocabulary continuous speech recognition IEEE Transactions on Speech and Audio Processing, 9(3) Christopher White, ... pronunciation for each word It is straightforward to extend to multiple pronunciations by ﬁrst sampling a pronunciation for each word and then sampling a segmentation for that pronunciation Once an...

Ngày tải lên: 20/02/2014, 04:20

10 446 0