Jump to ratings and reviews
Rate this book

Foundations of Statistical Natural Language Processing

Rate this book
Statistical approaches to processing natural language text have become dominant in recent years. This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear. The book contains all the theory and algorithms needed for building NLP tools. It provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations. The book covers collocation finding, word sense disambiguation, probabilistic parsing, information retrieval, and other applications.

679 pages, Hardcover

First published June 18, 1999

Loading interface...
Loading interface...

About the author

Christopher D. Manning

6 books9 followers
Professor of Linguistics and Computer Science, Natural Language Processing Group, Stanford University

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
93 (37%)
4 stars
108 (43%)
3 stars
42 (16%)
2 stars
6 (2%)
1 star
1 (<1%)
Displaying 1 - 17 of 17 reviews
3 reviews2 followers
December 7, 2011
A must read for anyone looking to get into NLP. Teaches from first principles, including briefly touching on information theory/entropy. I felt it was well grounded, and proceded at a good pace. No prior knowledge is required.

I picked this up at the same time as "Speech and Language Processing" (Jurafsky & Martin) and while Foundations is a much narrower book (making up with depth), I think it's for the better, as I found SLP far too broad and thin.
Profile Image for Ushan.
801 reviews65 followers
December 29, 2010
As the great American anthropologist-linguist Edward Sapir put it, all grammars leak. Some sentences are obviously grammatical, some are obviously ungrammatical, but there are gray areas; native speakers of English disagree on whether sentences such as "Who did Jo think said John saw him?" and "The boys read Mary's stories about each other" are grammatical. A way of resolving this difficulty is to look at a large corpus of texts; sentence structures that occur there often are grammatical, sentence structures that never occur are ungrammatical, and those that occur rarely are in a gray area. We will also need to assign a nonzero probability to sentence structures that we have never seen before, higher if they resembe ones that we've seen before than if they don't. Before Noam Chomsky invented them in 1957, neither "Colorless green ideas sleep furiously" nor "Furiously sleep ideas green colorless" ever occurred in an English text, but sentences like the former occurred much more frequently than sentences like the latter. This book discusses various algorithms used in corpus-based linguistics: parsing text, aligning text in two languages, deciding on the meaning of ambiguous words such as "plant" (a living organism from the kingdom Plantae, or a factory) and "interest" (curiosity, or share in a company). These algorithms do not always work correctly, but they work well enough to be used in the real world.
Profile Image for Emmi.
120 reviews
December 14, 2017
Explanation on basic idea on NLP is very good, but only this book is not enough to get entire idea on NLP. Better to read "Speech and Language Processing" as well (By Dan Jurafsky, James H. Martin ).
Profile Image for Terran M.
78 reviews88 followers
May 19, 2018
A classic on natural language processing. If you know nothing about natural language processing, or have a piecemeal understanding, this book will give you an overview of the field in a rigorous and yet comprehensible way.

Note that this book was written in 1999, so it far predates the current practice to use recursive neural networks for natural language. This book will give you exactly what it says in the title, Foundations, not “modern best practices.”

You may also be interested in Introduction to Information Retrieval by the same authors
October 21, 2020
This book is an exceptional introduction into the world of statistical methods for NLP tasks. The math is fairly accessible and it continues to be my main resource for reference in this field.
Profile Image for Farzam.
12 reviews6 followers
May 26, 2021
A little bit outdated and also tough to read, but I liked it. I only needed to cover a few chapters, however I think some more practical examples and coding snippets would be extremely helpful
Profile Image for Douglas Summers-Stay.
Author 2 books37 followers
June 14, 2015
This 1999 book does a good job of explaining the different areas of statistical NLP. It was easy to read and very clear, even the formula-heavy sections. The sections on collocations (multi-word phrases) and verb subcategorization were largely new to me.
The problems that natural-language research has faced are similar to the ones computer vision faces, but easier. What that means is that the researchers have made a lot more progress in the higher-level organization of concepts instead of getting stuck at the level of simple features and recognizing objects like computer vision has been.
Profile Image for David.
Author 17 books333 followers
December 4, 2011
This and Speech and Language Processing by Jurafsky and Martin are the two big introductory texts in natural language processing. I prefer the Jurafsky book; it goes into more detail, has more examples, and is written more for use as a class text. The Manning and Schutze book is much more mathematically oriented and goes into more detail on algorithms, so if you're focusing on the statistical aspect more than the language aspect, refer to this book. Ideally, you probably want both.
May 10, 2012
Needs more walk-through integrated examples, not just simple illustrations for specific paragraphs.

It could also benefit from a discussion of NLP software and possible architectures for the domain.
Profile Image for Brian.
11 reviews
Currently reading
April 19, 2011
Currently it's a bit tough for me to read because I haven't formally studied logic or statistics... but a well written book
Profile Image for Shubhajoy Das.
1 review1 follower
April 13, 2013
this was a good book loved the exercisesvery few nlp books have such good exercises
Profile Image for Vít Baisa.
59 reviews1 follower
January 1, 2015
Sometimes it felt a bit out-dated but the explanations of various algorithms and principles was very good and understanable.
Profile Image for Reuben.
20 reviews1 follower
June 23, 2009
a little too cynical for my taste...but well written
Displaying 1 - 17 of 17 reviews

Can't find what you're looking for?

Get help and learn more about the design.