Get A Resource-Light Approach to Morpho-Syntactic Tagging PDF

By Anna Feldman

ISBN-10: 9042027681

ISBN-13: 9789042027688

Whereas supervised corpus-based equipment are hugely actual for various NLP tasks, together with morphological tagging, they're tough to port to different languages simply because they require assets which are dear to create. for that reason, many languages haven't any lifelike prospect for morpho-syntactic annotation within the foreseeable destiny. the tactic provided during this booklet goals to beat this challenge by way of considerably restricting the mandatory information and as an alternative extrapolating the suitable details from one other, similar language. The process has been proven on Catalan, Portuguese, and Russian. even supposing those languages are just particularly resource-poor, a similar technique should be in precept utilized to any inflected language, so long as there's an annotated corpus of a similar language on hand. Time wanted for adjusting the procedure to a brand new language constitutes a fragment of the time wanted for structures with wide, manually created assets: days rather than years. This ebook touches upon a couple of themes: typology, morphology, corpus linguistics, contrastive linguistics, linguistic annotation, computational linguistics and common Language Processing (NLP). Researchers and scholars who're drawn to those clinical parts in addition to in cross-lingual experiences and purposes will drastically make the most of this paintings. students and practitioners in desktop technological know-how and linguistics are the potential readers of this e-book.

Show description

Read or Download A Resource-Light Approach to Morpho-Syntactic Tagging PDF

Best study & teaching books

Teaching and learning mathematical problem solving: multiple by Edward A. Silver PDF

A provocative number of papers containing entire experiences of past examine, instructing strategies, and tips for course of destiny research. offers either a entire evaluate of the most recent learn on mathematical challenge fixing, with distinctive emphasis on its instructing, and an try and raise verbal exchange around the energetic disciplines during this sector.

New PDF release: Modelling and Applications in Mathematics Education: The

The e-book goals at exhibiting the state of the art within the box of modeling and functions in arithmetic schooling. this is often the 1st quantity to do that. The e-book offers with the query of the way key advantage of purposes and modeling on the middle of mathematical literacy might be built; with the jobs that functions and modeling may well play in arithmetic instructing, making arithmetic extra suitable for college kids.

Download e-book for kindle: International Dialogues about Visual Culture, Education and by Teresa Eça, Rachel Mason

Even supposing artwork is taught all over the world, artwork schooling rules and practices fluctuate widely—and the possibilities for lecturers to replace details are few. foreign Dialogues approximately visible tradition, schooling, and paintings brings jointly various views on educating paintings to forge a accomplished knowing of the demanding situations dealing with artwork educators in each nation.

Help with Idioms (Heinemann English Language Practice) by Anton Rush, Jane Applebee PDF

Aimed toward upper-intermediate and advanced-level newbies, this is often considered one of a sequence which deals ELT scholars additional assistance and perform in parts of the language which they locate fairly tough. The ebook is acceptable for either lecture room and self-study use, and includes a hundred and fifty idiom entries, divided into sections - historic, new, international and funny, proverbs, metaphors and similes, and slang.

Additional resources for A Resource-Light Approach to Morpho-Syntactic Tagging

Example text

So, the determining context for deciding on a tag is the space of the previous n tags (n=2, in the case of a second order Markov model). The methods differ, however, in the way the transition probability p(tn |tn−2tn−1 ) is estimated. N-gram taggers often estimate the probability using the maximum likelihood principle, as mentioned above. Unlike those approaches, TreeTagger constructs a binary-branching decision tree. The binary tree is built recursively from a training set of trigrams. The nodes of the tree correspond to questions (or tests) about the previous one or two tags.

Can’ can be an auxiliary, a noun, and a verb). Still, many of these ambiguous tokens are easy to disambiguate, since the various tags associated with a word are not equally likely. In contrast, languages with rich morphologies are more challenging. Most Russian nouns, for instance, have singular and plural forms in all six cases (nominative, accusative, genitive, dative, locative, and instrumental). Most adjectives (at least potentially) form all three genders (masculine, feminine and neuter), both numbers (singular and plural), all six cases, all three degrees of comparison, and can be either of positive or negative polarity.

3). ). The analyzer is based on a lexicon containing about 228K lemmata and it can analyze about 20M word forms. 25% on the full tag. 2 Other experiments Finally, some experiments combine the exponential model described above with various other learning algorithms to improve tagging results. Hajiˇc et al. (2001) describe a hybrid system (applied to Czech) which combines the strength of manual rule-writing and statistical learning, obtaining results superior to both methods if applied separately.

Download PDF sample

A Resource-Light Approach to Morpho-Syntactic Tagging by Anna Feldman


by Steven
4.0

Get A Resource-Light Approach to Morpho-Syntactic Tagging PDF
Rated 4.21 of 5 – based on 50 votes