Category: Classification

Fundamentals of Predictive Text Mining

comments Comments Off
By Volkan TUNALI, September 4, 2010 11:09 pm

Fundamentals of Predictive Text MiningFundamentals of Predictive Text Mining is the new book I’ve found recently about text mining. This book explains the essentials of text mining very very well with very good examples, so I strongly recommend it to the newcomers to the field. Although the goal of the book is predictive text mining, its content is sufficiently broad to cover such topics as text clustering, information retrieval, and information extraction.

The book also contains several case studies that find solutions to several real life problems.

Authors: Sholom M. Weiss, Nitin Indurkyha, Tong Zhang
ISBN: 9781849962254
Publisher: Springer

Introduction to Information Retrieval

comments Comments Off
By Volkan TUNALI, September 1, 2010 11:36 pm

Introduction to Information RetrievalI will introduce a new book I find very useful: Introduction to Information Retrieval by Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, from Cambridge University Press (ISBN: 0521865719).

The book provides a modern approach to information retrieval from a computer science perspective. It is based on a course the authors have been teaching in various forms at Stanford University and at the University of Stuttgart.

It contains the following chapters:

  1. Boolean retrieval
  2. The term vocabulary & postings lists
  3. Dictionaries and tolerant retrieval
  4. Index construction
  5. Index compression
  6. Scoring, term weighting & the vector space model
  7. Computing scores in a complete search system
  8. Evaluation in information retrieval
  9. Relevance feedback & query expansion
  10. XML retrieval
  11. Probabilistic information retrieval
  12. Language models for information retrieval
  13. Text classification & Naive Bayes
  14. Vector space classification
  15. Support vector machines & machine learning on documents
  16. Flat clustering
  17. Hierarchical clustering
  18. Matrix decompositions & latent semantic indexing
  19. Web search basics
  20. Web crawling and indexes
  21. Link analysis

The book is freely available for download in PDF format at http://nlp.stanford.edu/IR-book/information-retrieval-book.html (as a whole or as individual chapters).

Panorama Theme by Themocracy