Jump to ratings and reviews
Rate this book

Taming Text: How to Find, Organize, and Manipulate It

Rate this book
It is no secret that the world is drowning in text and data. This causes real problems for everyday users who need to make sense of all the information available, and for software engineers who want to make their text-based applications more useful and user-friendly. Whether building a search engine for a corporate website, automatically organizing email, or extracting important nuggets of information from the news, dealing with unstructured text can be daunting.

Taming Text is a hands-on, example-driven guide to working with unstructured text in the context of real-world applications. It explores how to automatically organize text, using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. This book gives examples illustrating each of these topics, as well as the foundations upon which they are built.

Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book.

322 pages, Paperback

First published July 1, 2011

27 people are currently reading
259 people want to read

About the author

Grant S. Ingersoll

1 book1 follower

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
19 (16%)
4 stars
56 (50%)
3 stars
32 (28%)
2 stars
5 (4%)
1 star
0 (0%)
Displaying 1 - 7 of 7 reviews
Profile Image for Alex Ott.
Author 3 books207 followers
December 25, 2013
Good overview of different systems (Solr, OpenNLP & Mahout), approaches & algorithms for working with (unstructured) text - search, analyze, cluster, etc.
The book itself is completely practical with references to articles & books for people interested in more detailed/theoretical information.
111 reviews1 follower
July 28, 2024
Great overview of the state of the art about ten years ago. Still an interesting overview. _Very_ Java based.
Profile Image for Ashish.
31 reviews1 follower
June 10, 2014
This book is like a hands on guide to Text Analytics and Processing as it talks about the Open Source projects related to this topic. Other than that it is a very good introduction to basics of Text Analysis and how one can use Open Source Solutions like Lucene, Solr, OpenNLP etc to do the same.


The last chapter about Untamed text: the next frontier is very good. For someone interested in Text Processing and Analytics that last chapter is a very good read with a lots of ideas regarding what could be done next in this field.
35 reviews
October 10, 2016
This book was alright. Many of the theoretical bits are things I've already encountered. The practical bits weren't so relevant because they were too tightly coupled to specific Java libraries and other pieces of software. Nothing too earth shattering here, but might be just right for some readers.
Profile Image for Dgg32.
146 reviews6 followers
April 25, 2013
A nice introduction to the field natural language programming. May come in handy for some of my projects.
Profile Image for Akshay Ratan.
36 reviews2 followers
January 31, 2015
Good for understanding content analysis using efficient techniques. Apache Solr and Lucene explained with concepts and codes.

Displaying 1 - 7 of 7 reviews

Can't find what you're looking for?

Get help and learn more about the design.