Online Public Access Catalogue (OPAC)
Library,Documentation and Information Science Division

“A research journal serves that narrow

borderland which separates the known from the unknown”

-P.C.Mahalanobis


Image from Google Jackets

Automated data collection with R : a practical guide to Web scraping and text mining / Simon Munzert...[et al.].

By: Contributor(s): Material type: TextTextPublication details: Chichester : John Wiley, c2015.Description: xxii, 452 p. : illustrations ; 25 cmISBN:
  • 9781118834817 (hardback)
Subject(s): DDC classification:
  • 006.312 23 M971
Contents:
Machine generated contents note: Dedication Table of Contents List of Figures List of Tables Preface 1 Introduction-- 2 HTML-- 3 XML and JSON-- 4 XPath-- 5 HTTP-- 6 AJAX-- 7 SQL and Relational Databases-- 8 Regular Expressions and String Functions-- 9 Scraping the Web-- 10 Statistical Text Processing-- 11 Managing Data Projects-- 12 Collaboration Networks in the U.S. Senate-- 13 Parsing Information from Semi-Structured Documents-- 14 Predicting the 2014 Academy Awards using Twitter-- 15 Mapping the Geographic Distribution of Names-- 16 Gathering Data on Mobile Phones-- 17 Analyzing Sentiments of Product Reviews-- References-- General Index-- Package Index-- Function Index .
Summary: This book provides a unified framework of web scraping and information extraction from text data with R for the social sciences.
Tags from this library: No tags from this library for this title. Log in to add tags.

Includes bibliographical references and indexes.

Machine generated contents note: Dedication Table of Contents List of Figures List of Tables Preface
1 Introduction--
2 HTML--
3 XML and JSON--
4 XPath--
5 HTTP--
6 AJAX--
7 SQL and Relational Databases--
8 Regular Expressions and String Functions--
9 Scraping the Web--
10 Statistical Text Processing--
11 Managing Data Projects--
12 Collaboration Networks in the U.S. Senate--
13 Parsing Information from Semi-Structured Documents--
14 Predicting the 2014 Academy Awards using Twitter--
15 Mapping the Geographic Distribution of Names--
16 Gathering Data on Mobile Phones--
17 Analyzing Sentiments of Product Reviews--
References--
General Index--
Package Index--
Function Index .

This book provides a unified framework of web scraping and information extraction from text data with R for the social sciences.

There are no comments on this title.

to post a comment.
Library, Documentation and Information Science Division, Indian Statistical Institute, 203 B T Road, Kolkata 700108, INDIA
Phone no. 91-33-2575 2100, Fax no. 91-33-2578 1412, ksatpathy@isical.ac.in