Online Public Access Catalogue (OPAC)
Library,Documentation and Information Science Division

“A research journal serves that narrow

borderland which separates the known from the unknown”

-P.C.Mahalanobis


Normal view MARC view ISBD view

Corpus linguistics and statistics with R : introduction to quantitative methods in linguistics / Guillaume Desagulier.

By: Desagulier, Guillaume [author].
Material type: TextTextSeries: Quantitative methods in the humanities and social sciences.Publisher: Cham : Springer, 2017Description: xiii, 353 pages : illustrations (some color) ; 27 cm.ISBN: 9783319645704.Subject(s): Corpora (Linguistics) | R (Computer program language)DDC classification: 410.188
Contents:
1. Introduction.- 2. R Fundamentals.- 3. Digital Corpora.- 4. Processing and Manipulating Character Strings.- 5. Applied Character String Processing.- 6. Summary Graphics for Frequency Data.- 7. Descriptive Statistics.- 8. Notions of Statistical Testing.- 9. Association and Productivity.- 10. Clustering Methods.
Summary: This book examines empirical linguistics from a theoretical linguist?s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and text-based sensorimetrics. Chapters cover topics including corpus processing, frequencing data, and clustering methods. Case studies illustrate each chapter with accompanying data sets, R code, and exercises for use by readers. This book may be used in advanced undergraduate courses, graduate courses, and self-study.
Tags from this library: No tags from this library for this title. Log in to add tags.
Item type Current location Call number Status Date due Barcode Item holds
Books Books ISI Library, Kolkata
 
410.188 D441 (Browse shelf) Available 138414
Total holds: 0

Includes bibliographical references and index.

1. Introduction.-
2. R Fundamentals.-
3. Digital Corpora.-
4. Processing and Manipulating Character Strings.-
5. Applied Character String Processing.-
6. Summary Graphics for Frequency Data.-
7. Descriptive Statistics.-
8. Notions of Statistical Testing.-
9. Association and Productivity.-
10. Clustering Methods.

This book examines empirical linguistics from a theoretical linguist?s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and text-based sensorimetrics. Chapters cover topics including corpus processing, frequencing data, and clustering methods. Case studies illustrate each chapter with accompanying data sets, R code, and exercises for use by readers. This book may be used in advanced undergraduate courses, graduate courses, and self-study.

There are no comments for this item.

Log in to your account to post a comment.
Library, Documentation and Information Science Division, Indian Statistical Institute, 203 B T Road, Kolkata 700108, INDIA
Phone no. 91-33-2575 2100, Fax no. 91-33-2578 1412, ksatpathy@isical.ac.in


Visitor Counter