+612 9045 4394
Survey of Text Mining II : Clustering, Classification, and Retrieval - Michael W. Berry

Survey of Text Mining II

Clustering, Classification, and Retrieval

By: Michael W. Berry (Editor), Malu Castellanos (Editor)


Published: 1st November 2007
Ships: 7 to 10 business days
7 to 10 business days
RRP $341.99
or 4 easy payments of $59.19 with Learn more
if ordered within

Other Available Formats (Hide)

  • Paperback View Product Published: 13th October 2010

This Second Edition brings readers thoroughly up to date with the emerging field of text mining, the application of techniques of machine learning in conjunction with natural language processing, information extraction, and algebraic/mathematical approaches to computational information retrieval. The book explores a broad range of issues, ranging from the development of new learning approaches to the parallelization of existing algorithms. Authors highlight open research questions in document categorization, clustering, and trend detection. In addition, the book describes new application problems in areas such as email surveillance and anomaly detection.

Prefacep. v
Contributorsp. ix
Cluster-Preserving Dimension Reduction Methods for Document Classificationp. 3
Automatic Discovery of Similar Wordsp. 25
Principal Direction Divisive Partitioning with Kernels and k-Means Steeringp. 45
Hybrid Clustering with Divergencesp. 65
Text Clustering with Local Semantic Kernelsp. 87
Document Retrieval and Representation
Vector Space Models for Search and Cluster Miningp. 109
Applications of Semidefinite Programming in XML Document Classificationp. 129
Email Surveillance and Filtering
Discussion Tracking in Enron Email Using PARAFACp. 147
Spam Filtering Based on Latent Semantic Indexingp. 165
Anomaly Detection
A Probabilistic Model for Fast and Confident Categorization of Textual Documentsp. 187
Anomaly Detection Using Nonnegative Matrix Factorizationp. 203
Document Representation and Quality of Text: An Analysisp. 219
SIAM Text Mining Competition 2007p. 233
Indexp. 237
Table of Contents provided by Ingram. All Rights Reserved.

ISBN: 9781848000452
ISBN-10: 1848000456
Audience: Tertiary; University or College
Format: Hardcover
Language: English
Number Of Pages: 240
Published: 1st November 2007
Publisher: Springer London Ltd
Country of Publication: GB
Dimensions (cm): 23.5 x 15.5  x 1.91
Weight (kg): 1.2
Edition Type: Revised