thêm tài liệu thành công vào corpus

Building ecotourism corpus and application of corpus based materials in vocabulary teaching master of tesol

Building ecotourism corpus and application of corpus based materials in vocabulary teaching master of tesol

... 2.2 Corpus linguistics 11 2.3 History of corpus 13 2.4 Types of corpora 14 2.5 Corpora in language teaching 16 2.6 The corpus approach (data-driven learning approach) 19 2.7 Corpus-based ... the corpus-based materials in vocabulary teaching 1.5 Terms and concepts Corpus “ A corpus is principally a collection of textbooks which is stored in a computer (McCarten, 2007 2).” Trang 10Corpus ... In this chapter, corpus linguistics, the historical backdrop of corpus, types of corpora, corpora in language teaching, and the corpus approach have been clarified, and then corpus-based vocabulary

Ngày tải lên: 12/08/2022, 23:01

87 7 0
Nghiên cứu phương pháp thu thập tập dữ liệu song song (parallel corpus)việt anh, việt pháp từ các nguồn tài liệu đa ngữ

Nghiên cứu phương pháp thu thập tập dữ liệu song song (parallel corpus)việt anh, việt pháp từ các nguồn tài liệu đa ngữ

... NGỮ LIỆU 1.2.1 Kho ngữ liệu (Corpus) 1.2.2 Kho ngữ liệu ña ngữ (Multilingual Corpora) 1.2.3 Kho ngữ liệu so sánh (Comparable Corpus) 1.2.4 Kho ngữ liệu song song (Parallel Corpus) Kho ngữ liệu ... ñược kiểm soát, chất lượng dịch của tài liệu trên Internet là rất khác nhau, và không phải tài liệu nào cũng ñược dịch chuẩn Hơn nữa, có nhiều tài liệu (ví dụ tài liệu tin tức song ngữ Việt – Anh, ... ngữ liệu song song: Cho phép con người duyệt qua các cặp câu song song và chỉnh sửa, thêm bớt nếu cần 2.5.1 Liên kết ở mức tài liệu: Phát hiện các cặp tài liệu song ngữ 2.5.1.1 Tải tài liệu

Ngày tải lên: 30/12/2013, 14:21

12 525 0
Tài liệu Báo cáo khoa học: "Creating a manually error-tagged and shallow-parsed learner corpus" pptx

Tài liệu Báo cáo khoa học: "Creating a manually error-tagged and shallow-parsed learner corpus" pptx

... Cambridge Learner Corpus Yes No 30 million No ICLE Corpus (Granger et al., 2009) No No 3.7 million+ Yes JEFLL Corpus (Tono, 2000) No No 1 million Partially Longman Learners’ Corpus No No 10 million ... obtained by transformation. parsed corpus as a test corpus and the other man-ually POS-tagged corpus created in the pilot study described in Subsect 3.2.1 as a training corpus We used POS-based and ... learner corpus that we created In Sect 2, we discuss the difficulties inherent in learner corpus creation Con-sidering the difficulties, in Sect 3, we describe our method for learner corpus creation,

Ngày tải lên: 20/02/2014, 04:20

10 471 0
Tài liệu Onomatopoeia in Spoken and Written English: Corpus- and Usage-based Analysis pot

Tài liệu Onomatopoeia in Spoken and Written English: Corpus- and Usage-based Analysis pot

... Spontaneous oration /Prepared but unscripted oration) The other corpus was the Lancaster-Oslo/BergenCorpus of British English (the LOB Corpus) This corpus contains a total of 1,000,000 Trang 12words from ... Onomatopoeia in Spoken and Written English : Corpus- and Usage-based Analysis Author(s) Sugahara, Takashi Trang 2Onomatopoeia in Spoken and Written English:Corpus- and Usage-based Analysis (英語の話し言葉・書き言葉におけるオノマトペ:コーパスと用法に基づく分析) ... list of the 14 mostfrequent and highly onomatopoeic words in the spoken corpus examined and a list of 13words in the written corpus Trang 16Both Most Frequent and Most Onomatopoeic Words in LLC

Ngày tải lên: 24/02/2014, 18:20

219 704 1
The Proposition Bank: An Annotated Corpus of Semantic Roles pdf

The Proposition Bank: An Annotated Corpus of Semantic Roles pdf

... Trang 1Corpus of Semantic Rolesin that it covers every instance of every verb in the corpus and allows representative statistics to be calculated ... the frequency of syntactic/semantic alternations in the corpus We describe anautomatic system for semantic role tagging trained on the corpus and discuss the effect on itsperformance of various ... sentences from the corpus areincluded in the frames file, in the same format as the examples above In many cases aparticular realization will not be attested within the Penn Treebank corpus; in thesecases,

Ngày tải lên: 06/03/2014, 10:20

36 272 0
Báo cáo khoa học: "The Human Language Project: Building a Universal Corpus of the World’s Languages" pptx

Báo cáo khoa học: "The Human Language Project: Building a Universal Corpus of the World’s Languages" pptx

... we would like a complete digitization of every human language: a Universal Corpus If we are ever to construct such a corpus, it must be now With the current rate of language loss, we have only ... as possible on community members’ ability to obtain and en-hance the corpus, and redistribute derivative data Utility The corpus aims to be maximally use-ful, and minimally parochial Annotation ... resources be integrated with—if not de-rived from—primary data in the corpus 2.3 What to include What should be included in the corpus? To some extent, data collection will be opportunistic, but

Ngày tải lên: 16/03/2014, 23:20

10 579 0
Báo cáo khoa học: "Morphological Analysis of a Large Spontaneous Speech Corpus in Japanese" pptx

Báo cáo khoa học: "Morphological Analysis of a Large Spontaneous Speech Corpus in Japanese" pptx

... category depend on a particular corpus, and the defi-nitions from corpus to corpus differ word by word Therefore, we need to put only words extracted from the same corpus into a dictio-nary We are ... words that comprise the corpus We also show that better accuracy is achieved by using both methods than by using only the first 1 Introduction The “Spontaneous Speech: Corpus and Process-ing Technology” ... Technology” project is sponsorProcess-ing the construc-tion of a large spontaneous Japanese speech corpus, Corpus of Spontaneous Japanese (CSJ) (Maekawa logues and dialogues, the majority being mono-logues

Ngày tải lên: 17/03/2014, 06:20

10 402 0
The Economic Impact of Rail Improvements to the Port of Corpus Christi, Texas pptx

The Economic Impact of Rail Improvements to the Port of Corpus Christi, Texas pptx

... exists a need to expand portions of the Port of Corpus Christi both in terms of capacity and efficiency In order to adequately expand the Port of Corpus Christi to handle anticipated near to mid-term ... international trade continues to increase, port capacity naturally becomes constrained The Port of Corpus Christi is no exception to this general trend To capture a portion of the need to move additional ... Trang 1The Economic Impact of Rail Improvements to the Port of Corpus Christi, Texas Prepared For: Prepared By: October 17, 2011 Trang 2Table of Contents

Ngày tải lên: 23/03/2014, 21:20

15 389 0
Information Structure in written English - a corpus study - docx

Information Structure in written English - a corpus study - docx

... Topic & Focus for Czech n Use a parallel corpus to transfer Topic & Focus to English, through word alignment (in order to create an English corpus) p Trial 2: Investigation of English ... Trang 1Information Structure in written English a corpus study -Oana Postolache oana@coli.uni-saarland.de Trang 2p Division of the sentence in two ... the sentence: p preposing, left-dislocation, postposing, right-dislocation and inversion n Their corpus consists in several thousands naturally occurring sentences collected over approx 10 years

Ngày tải lên: 24/03/2014, 19:20

44 419 0
Corpus Use and Translating doc

Corpus Use and Translating doc

... corpora and corpus analysis have to offer The emphasis has been on corpus use and learning to translate (as opposed to learning corpus use to translate) Furthermore, corpus resources ... learning corpus use to translate,... between corpora as documentation tools and corpora as a source of materials for the translation classroom, and that between corpus- based and corpus- ... framework, four kinds of corpus- related tasks are presented and illustrated: cloze... based on a bilingual corpus, multiple choice exercises based on a learner corpus, translation of short

Ngày tải lên: 27/06/2014, 07:20

165 365 0
a corpus-based analysis of the collocates of the word  homeland  in the 1990s, 2000s and 2010s = nghiên cứu đồng định vị của từ  homeland  qua các thập niên 1990, 2000 và 2010 trên cơ sở ngôn ngữ học khối liệu

a corpus-based analysis of the collocates of the word homeland in the 1990s, 2000s and 2010s = nghiên cứu đồng định vị của từ homeland qua các thập niên 1990, 2000 và 2010 trên cơ sở ngôn ngữ học khối liệu

... corpora, namely COCA (Corpus of Contemporary American English at: americancorpus.org) and Time Magazine Corpus at: corpus.byu.edu/time) The followings are the descriptions of each corpus: The COCA ... collection”: the design of the corpus must be principled The texts in the corpus need to represent the type of language that the corpus is intending to capture For example, if a corpus is to be representative ... of corpus linguistics Corpus-based techniques have been employed in many studies which have attempted to investigate the differences in language use Pearce (2008) carried out a study using corpus-based

Ngày tải lên: 02/03/2015, 14:17

40 435 0
a corpus-based study on collocations of keywords in english business articles about the european debt crisis = nghiên cứu tập hợp cụm từ của các từ khóa trong các bài báo kinh tế tiếng anh

a corpus-based study on collocations of keywords in english business articles about the european debt crisis = nghiên cứu tập hợp cụm từ của các từ khóa trong các bài báo kinh tế tiếng anh

... of CRISIS from the corpus Figure 3: String matching of DEBT from the corpus Figure 4: String matching of ECONOMIC from the corpus Figure 5: String matching of MARKETS from the corpus Trang 8 CHAPTER ... patterns of CRISIS in the corpus Table 17: Other patterns of DEBT in the corpus Table 18: ECONOMIC Concordance (Noun collocations) Table 19: Nouns collocating with ECONOMIC in the corpus Table 20: Composite ... for the goals of their research Conducting a corpus analysis is the very fundamental technique used by CL Corpus analysis is a means of accessing a corpus of text to show how any given word or

Ngày tải lên: 02/03/2015, 14:18

153 759 0
Towards a framework for building an annotated named entities corpus

Towards a framework for building an annotated named entities corpus

... NER corpus will useful fordeveloping automatically NER researches.2.3 Researches about building corpus Process Many building corpus research are published, and many corpus is created: POScorpus, ... many corpus is created: POScorpus, TreeBank corpus, event newer corpus: Parallel language corpus, Opinioncorpus, etc For example: • Towards the national corpus of Polish research (Adam Przepiorkowski ... world to builtNLP corpus in general and NER corpus in particular So that we localize mydirectly study • Chapter three: Building corpus process: Describe a process build ageneral corpus Then, we

Ngày tải lên: 25/03/2015, 10:23

78 640 0
A Corpus-based Study on Collocations of Keywords in English Business Articles on the European Debt Crisis

A Corpus-based Study on Collocations of Keywords in English Business Articles on the European Debt Crisis

... Construction of Corpus Since the study is primarily a corpus-based analysis of collocations, its findings come from a linguistic analysis of a substantial number of written articles The corpus of ... from the high-frequency word list of the corpus Table 3: First 25keywords from the corpus Trang 13N Word Freq % N Word Freq % The top 25 keywords from the corpus, as shown in Table 3, are perhaps ... from a corpus of a certain number of business articles written about the European debt crisis Trang 3To be specific, it identifies words with high frequency of occurrence within the chosen corpus

Ngày tải lên: 10/08/2015, 19:46

30 472 0
The gulf of mexico oil spill   a corpus based study of metaphors in british and american media discourse 1

The gulf of mexico oil spill a corpus based study of metaphors in british and american media discourse 1

... newspaper corpus via the use of a new proposed framework for metaphor identification and analysis Trang 41.2 Research Objectives The methodology adopted in this thesis amalgamates the corpus linguistic ... broadsheet, the corpus sample is of sufficient size, breadth and synchronic range to identify metaphors and conceptual metaphors used throughout the duration of the disaster Hence, the corpus collection ... linguistic and cognitive phenomenon” (Hardie et al., 2007, p.2) This thesis aims to extend existing corpus methodology in the examination of conceptual metaphors in large datasets through the use

Ngày tải lên: 10/09/2015, 09:22

20 243 0
The gulf of mexico oil spill   a corpus based study of metaphors in british and american media discourse 2

The gulf of mexico oil spill a corpus based study of metaphors in british and american media discourse 2

... metaphor finding must be gauged empirically against a reference corpus This is because it is important to make use of a reference corpus to make a point on the significance of the findings in the ... extrapolation of conceptual metaphors from authentic linguistic data This thesis aims to extend existing corpus methodology in the identification and analysis of conceptual metaphors in large datasets ... attempts to Trang 4simply means that the representation of the conceptual metaphors mined from the corpus will include a socio-cultural dimension in addition to the usual linguistic and conceptual

Ngày tải lên: 10/09/2015, 09:22

48 395 0
The gulf of mexico oil spill   a corpus based study of metaphors in british and american media discourse 3 1

The gulf of mexico oil spill a corpus based study of metaphors in british and american media discourse 3 1

... domains pertaining to the BP Oil Spill for the New York Times Corpus (henceforth NYT-Corpus) Fig.3.3. USAS Semantic Tagset for NYT-Corpus (Top-30 semantic domains) These semantic domains will ... bottom-up corpus-based study of conceptual metaphors from empirical linguistic evidence The key departure from Kovecses’ method is that the data used for analysis in this research is based on corpus ... existence of a large, representative and “monothematic” corpus dealing with target domains (e.g Trang 9ECONOMICS, SPORTS, POLITICS) In this thesis, the corpus essentially focuses on the target domains

Ngày tải lên: 10/09/2015, 09:22

49 289 0
The gulf of mexico oil spill   a corpus based study of metaphors in british and american media discourse 4

The gulf of mexico oil spill a corpus based study of metaphors in british and american media discourse 4

... significant metaphors in the NYT-Corpus and the WP-Corpus Table 4.1 provides an aggregate view of the issue metaphorically embodied by the NYT-corpus and the WP-corpus All the conceptual metaphors ... NYT-corpus metaphor tokens This contrasts with the 22.8% of the range of metaphor tokens found in the corresponding WP-Corpus A good example of the associated evaluative prosody in the NYT-corpus ... foregrounded by the NYT-corpus in the framing of this issue The seemingly overtly partisan stance adopted by the NYT-corpus is effectively encapsulated in Text 4.5: Text 4.5 NYT-Corpus – Screenshot

Ngày tải lên: 10/09/2015, 09:22

60 226 0
The gulf of mexico oil spill   a corpus based study of metaphors in british and american media discourse 5

The gulf of mexico oil spill a corpus based study of metaphors in british and american media discourse 5

... metaphor categories in the G-Corpus and the TT-Corpus Table 5.1 provides an aggregate view of the BP oil spill metaphorically embodied by the G-corpus and the TT-corpus All the conceptual metaphors ... Type Distribution in the G-Corpus G-Corpus (202 Metaphorical TYPES out of 188,788 words) Table 5.3 – An overview of the Metaphor Type Distribution in the TT-Corpus TT-Corpus (147 Metaphorical ... “WAR/CRIME/CONFLICT” conceptual key in the TT-corpus when compared to the G-corpus A closer examination of this conceptual key shows that the TT-corpus seems to adopt a more nationalistic and

Ngày tải lên: 10/09/2015, 09:22

63 393 0
The gulf of mexico oil spill   a corpus based study of metaphors in british and american media discourse 6 1

The gulf of mexico oil spill a corpus based study of metaphors in british and american media discourse 6 1

... mined from the IICM, it is clear that the corpora from the conservative broadsheets (WP-Corpus and the TT-corpus) generate a significant proportion of conceptual metaphors that focus on the mitigation ... specific nature of the embodiment within the WAR/CRIME/THREAT conceptual key in the WP-corpus (53.6%) and TT-corpus (68.1%), conceptualising BUSINESS as a WAR/ GAME OF SURVIVAL in an effort to justify ... survival On the other hand, the metaphors mined from the more liberal broadsheets (NYT-Corpus and the G-Corpus) focus on the scale of the disaster and emphasise the range of negative emotions

Ngày tải lên: 10/09/2015, 09:22

35 270 0
w