ISBN:
9783642453588
Language:
English
Pages:
Online-Ressource (XXIV, 459 p. 61 illus., 23 illus. in color, online resource)
Series Statement:
Theory and Applications of Natural Language Processing
Series Statement:
SpringerLink
Series Statement:
Bücher
Parallel Title:
Druckausg. Natural language processing of semitic languages
Keywords:
Computer science
;
Computer Science
;
Translators (Computer programs)
;
Computational linguistics
;
Computer science
;
Translators (Computer programs)
;
Computational linguistics
;
Aufsatzsammlung
;
Semitische Sprachen
;
Natürliche Sprache
Abstract:
Part I Natural Language Processing Core-Technologies -- 1.Linguistic Introduction: The Orthography, Morphology and Syntax of Semitic Languages. R.Fabri, M.Gasser, N. Habash, G. Kiraz and S.Wintner -- 2.Morphological Processing of Semitic Languages. S.Wintner -- 3.Syntax and Parsing of Semitic Languages. R. Tsarfaty -- 4.Semantic Processing of Semitic Languages. M. Diab and Y.Marton -- 5.Language Modeling. I. Heintz -- Part II Natural Language Processing Applications -- 6.Statistical Machine Translation. H. Hassan and K.Darwish -- 7.Named Entity Recognition. B.Mohit -- 8.Anaphora Resolution. K.M. Seddik and A. Farghaly -- 9.Relation Extraction. V. Castelli and I. Zitouni -- 10.Information Retrieval. K. Darwish -- 11.Question Answering. Y. Benajiba, P. Rosso, L. Abouenour, O. Trigui, K.Bouzoubaa and L.H. Belguith -- 12.Automatic Summarization -- L.H. Belguith, M. Ellouze, M.H. Maaloul, M. Jaoua, F. Kallel Jaoua and P. Blache -- 13.Automatic Speech Recognition. H. Soltau, G. Saon, L. Mangu, H-K.Kuo, B.Kingsbury, S. Chu and F. Biadsy
Abstract:
Research in Natural Language Processing (NLP) has rapidly advanced in recent years, resulting in exciting algorithms for sophisticated processing of text and speech in various languages. Much of this work focuses on English; in this book we address another group of interesting and challenging languages for NLP research: the Semitic languages. The Semitic group of languages includes Arabic (206 million native speakers), Amharic (27 million), Hebrew (7 million), Tigrinya (6.7 million), Syriac (1 million) and Maltese (419 thousand). Semitic languages exhibit unique morphological processes, challenging syntactic constructions, and various other phenomena that are less prevalent in other natural languages. These challenges call for unique solutions, many of which are described in this book. The 13 chapters presented in this book bring together leading scientists from several universities and research institutes worldwide. While this book devotes some attention to cutting-edge algorithms and techniques, its primary purpose is a thorough explication of best practices in the field. Furthermore, every chapter describes how the techniques discussed apply to Semitic languages. The book covers both statistical approaches to NLP, which are dominant across various applications nowadays, and the more traditional, rule-based approaches, that were proven useful for several other application domains. We hope that this book will provide a "one-stop-shop'' for all the requisite background and practical advice when building NLP applications for Semitic languages
Description / Table of Contents:
Preface; Acknowledgments; Technical Review Committee; Contents; About the Editor; Part I Natural Language Processing Core-Technologies; Chapter1 Linguistic Introduction: The Orthography, Morphology and Syntax of Semitic Languages; 1.1 Introduction; 1.2 Amharic; 1.2.1 Orthography; 1.2.2 Derivational Morphology; Lexicon; Root and Pattern Processes; Other Derivational Processes; 1.2.3 Inflectional Morphology; Verbs; Nominals; 1.2.4 Basic Syntactic Structure; Noun Phrases; Clauses; 1.3 Arabic; 1.3.1 Orthography; Arabic Script; Arabic Spelling; 1.3.2 Morphology; Templatic Morphology
Description / Table of Contents:
Concatenative MorphologyDerivational Morphology; Inflectional Morphology; Form-Function Independence; Dialectal Arabic Morphology; Morphological Ambiguity; 1.3.3 Basic Syntactic Structure; Morphology and Syntax; Sentence Structure; Nominal Phrase Structure; Relative Clauses; Arabic Dialect Syntax; 1.4 Hebrew; 1.4.1 Orthography; 1.4.2 Derivational Morphology; Root and Pattern Processes; Other Derivational Processes; 1.4.3 Inflectional Morphology; Verbs; Nominals; Other Closed-Class Items; 1.4.4 Morphological Ambiguity; 1.4.5 Basic Syntactic Structure; 1.5 Maltese; 1.5.1 Orthography
Description / Table of Contents:
1.5.2 Derivational MorphologyMixed Root-Based and Stem-Based Morphology; 1.5.3 Inflectional Morphology; Verbs; Nominals; Other Closed Class Items; 1.5.4 Basic Syntactic Structure; 1.6 Syriac; 1.6.1 Orthography; 1.6.2 Derivational Morphology; 1.6.3 Inflectional Morphology; 1.6.4 Syntax; 1.7 Contrastive Analysis; 1.7.1 Orthography; 1.7.2 Phonology; 1.7.3 Morphology; 1.7.4 Syntax; 1.7.5 Lexicon; 1.8 Conclusion; References; Chapter2 Morphological Processing of Semitic Languages; 2.1 Introduction; 2.2 Basic Notions; 2.3 The Challenges of Morphological Processing
Description / Table of Contents:
2.4 Computational Approaches to Morphology2.4.1 Two-Level Morphology; 2.4.2 Multi-tape Automata; 2.4.3 The Xerox Approach; 2.4.4 Registered Automata; 2.4.5 Analysis by Generation; 2.4.6 Functional Morphology; 2.5 Morphological Analysis and Generation of Semitic Languages; 2.5.1 Amharic; 2.5.2 Arabic; 2.5.3 Hebrew; 2.5.4 Other Languages; 2.5.5 Related Applications; 2.6 Morphological Disambiguation of Semitic Languages; 2.7 Future Directions; References; Chapter3 Syntax and Parsing of Semitic Languages; 3.1 Introduction; 3.1.1 Parsing Systems; Syntactic Analysis; Models and Algorithms
Description / Table of Contents:
3.1.2 Semitic LanguagesScript and Orthography; Morphology; Syntax; 3.1.3 The Main Challenges; The Architectural Challenge; The Modeling Challenge; The Lexical Challenge; 3.1.4 Summary and Conclusion; 3.2 Case Study: Generative Probabilistic Parsing; 3.2.1 Formal Preliminaries; Probabilistic Grammars; Training; Decoding; Evaluation; 3.2.2 An Architecture for Parsing Semitic Languages; Preliminaries; Joint Probabilistic Modeling; Lattice-Based Decoding; Evaluation; Summary and Conclusion; 3.2.3 The Syntactic Model; PCFG Refinements; Constrained Parsing; Discriminative Approaches
Description / Table of Contents:
3.2.4 The Lexical Model
Note:
Description based upon print version of record
DOI:
10.1007/978-3-642-45358-8
Permalink