Goodreads helps you keep track of books you want to read.
Start by marking “Taming Text” as Want to Read:
Taming Text
Enlarge cover
Rate this book
Clear rating
Open Preview

Taming Text

3.81  ·  Rating details ·  99 ratings  ·  7 reviews
It is no secret that the world is drowning in text and data. This causes real problems for everyday users who need to make sense of all the information available, and for software engineers who want to make their text-based applications more useful and user-friendly. Whether building a search engine for a corporate website, automatically organizing email, or extracting imp ...more
Paperback, 322 pages
Published January 21st 2013 by Manning Publications (first published July 2011)
More Details... edit details

Friend Reviews

To see what your friends thought of this book, please sign up.

Reader Q&A

To ask other readers questions about Taming Text, please sign up.

Be the first to ask a question about Taming Text

Community Reviews

Showing 1-30
3.81  · 
Rating details
 ·  99 ratings  ·  7 reviews

Sort order
Alex Ott
Feb 16, 2011 rated it really liked it
Good overview of different systems (Solr, OpenNLP & Mahout), approaches & algorithms for working with (unstructured) text - search, analyze, cluster, etc.
The book itself is completely practical with references to articles & books for people interested in more detailed/theoretical information.
Deane Barker
Oct 01, 2014 rated it liked it
The book alternates between a great overview of the subject, and getting down-and-dirty with the code. I don't know a good solution to this, but I was less concerned with the code and more concerned with the overhead view.

It's a good discussion of how to manage text: how to tokenize it, search it, cluster it, and classify it. Towards the later chapters, it bogs way, way down -- Chapter 7, on classification, is probably a quarter of the entire book in length. In many cases they "dropped to code"
Aug 16, 2013 rated it really liked it
This book is like a hands on guide to Text Analytics and Processing as it talks about the Open Source projects related to this topic. Other than that it is a very good introduction to basics of Text Analysis and how one can use Open Source Solutions like Lucene, Solr, OpenNLP etc to do the same.

The last chapter about Untamed text: the next frontier is very good. For someone interested in Text Processing and Analytics that last chapter is a very good read with a lots of ideas regarding what coul
Jan 06, 2016 rated it it was ok
This book was alright. Many of the theoretical bits are things I've already encountered. The practical bits weren't so relevant because they were too tightly coupled to specific Java libraries and other pieces of software. Nothing too earth shattering here, but might be just right for some readers.
Vuk Trifkovic
Feb 09, 2013 rated it liked it
Shelves: tech
Bit scattergun, a bit basic, but sound introduction to text handling.
Apr 06, 2013 rated it really liked it
Shelves: programming
A nice introduction to the field natural language programming. May come in handy for some of my projects.
Akshay Ratan
Jan 08, 2015 rated it really liked it
Good for understanding content analysis using efficient techniques. Apache Solr and Lucene explained with concepts and codes.

Julio Sueiras
rated it it was amazing
Jan 20, 2015
Michael S. Wakkinen
rated it liked it
Jul 16, 2015
Rizki Kurniawan
rated it it was amazing
Dec 25, 2018
Trung Ngoc
rated it liked it
Feb 02, 2015
rated it really liked it
Feb 10, 2013
rated it it was ok
Oct 30, 2015
Aby James
rated it it was amazing
Apr 20, 2017
André Hagenbruch
rated it really liked it
Jul 31, 2011
Michael Taylor
rated it really liked it
Apr 04, 2017
Russell Jurney
rated it it was amazing
Oct 11, 2016
rated it really liked it
Jun 19, 2013
rated it liked it
Dec 31, 2016
Preslav Rachev
rated it really liked it
Oct 10, 2013
Jaromir Savelka
rated it really liked it
Jul 25, 2015
Johan Pretorius
rated it really liked it
May 17, 2015
Danilo Mutti
rated it really liked it
Feb 18, 2016
Antonio J
rated it liked it
Dec 04, 2015
rated it liked it
Mar 22, 2013
Michael Lee
rated it really liked it
Nov 08, 2018
Haozhe Xu
rated it it was amazing
May 07, 2016
Vitor Oliveira
rated it really liked it
Jan 08, 2015
rated it liked it
Apr 30, 2018
rated it really liked it
Oct 30, 2014
« previous 1 3 4 next »
There are no discussion topics on this book yet. Be the first to start one »
  • Introduction to Information Retrieval
  • Single Page Web Applications
  • Interactive Data Visualization for the Web
  • Algorithms of the Intelligent Web
  • Mining of Massive Datasets
  • Data Mining: Practical Machine Learning Tools and Techniques
  • Hadoop: The Definitive Guide
  • Machine Learning in Action
  • Python for Data Analysis
  • Beautiful Visualization: Looking at Data through the Eyes of Experts
  • Data Analysis with Open Source Tools: A Hands-On Guide for Programmers and Data Scientists
  • ERLANG Programming
  • Bad Data Handbook: Cleaning Up The Data So You Can Get Back To Work
  • R in Action
  • VI Editor Pocket Reference (Pocket Reference (O'Reilly))
  • Beautiful Data: The Stories Behind Elegant Data Solutions (Theory In Practice, #31)
  • Mining the Social Web: Analyzing Data from Facebook, Twitter, LinkedIn, and Other Social Media Sites
  • Machine Learning: A Probabilistic Perspective

Goodreads is hiring!

If you like books and love to build cool products, we may be looking for you.
Learn more »
No trivia or quizzes yet. Add some now »