Goodreads helps you keep track of books you want to read.
Start by marking “Taming Text: How to Find, Organize, and Manipulate It” as Want to Read:
Taming Text: How to Find, Organize, and Manipulate It
Enlarge cover
Rate this book
Clear rating
Open Preview

Taming Text: How to Find, Organize, and Manipulate It

3.81  ·  Rating details ·  106 ratings  ·  7 reviews
It is no secret that the world is drowning in text and data. This causes real problems for everyday users who need to make sense of all the information available, and for software engineers who want to make their text-based applications more useful and user-friendly. Whether building a search engine for a corporate website, automatically organizing email, or extracting imp ...more
Paperback, 322 pages
Published January 24th 2013 by Manning Publications (first published July 2011)
More Details... Edit Details

Friend Reviews

To see what your friends thought of this book, please sign up.

Reader Q&A

To ask other readers questions about Taming Text, please sign up.

Be the first to ask a question about Taming Text

Community Reviews

Showing 1-30
Average rating 3.81  · 
Rating details
 ·  106 ratings  ·  7 reviews


More filters
 | 
Sort order
Start your review of Taming Text: How to Find, Organize, and Manipulate It
Alex Ott
Feb 16, 2011 rated it really liked it
Good overview of different systems (Solr, OpenNLP & Mahout), approaches & algorithms for working with (unstructured) text - search, analyze, cluster, etc.
The book itself is completely practical with references to articles & books for people interested in more detailed/theoretical information.
...more
Deane Barker
Oct 01, 2014 rated it liked it
The book alternates between a great overview of the subject, and getting down-and-dirty with the code. I don't know a good solution to this, but I was less concerned with the code and more concerned with the overhead view.

It's a good discussion of how to manage text: how to tokenize it, search it, cluster it, and classify it. Towards the later chapters, it bogs way, way down -- Chapter 7, on classification, is probably a quarter of the entire book in length. In many cases they "dropped to code"
...more
Ashish
Aug 16, 2013 rated it really liked it
This book is like a hands on guide to Text Analytics and Processing as it talks about the Open Source projects related to this topic. Other than that it is a very good introduction to basics of Text Analysis and how one can use Open Source Solutions like Lucene, Solr, OpenNLP etc to do the same.


The last chapter about Untamed text: the next frontier is very good. For someone interested in Text Processing and Analytics that last chapter is a very good read with a lots of ideas regarding what coul
...more
Zac
Jan 06, 2016 rated it it was ok
This book was alright. Many of the theoretical bits are things I've already encountered. The practical bits weren't so relevant because they were too tightly coupled to specific Java libraries and other pieces of software. Nothing too earth shattering here, but might be just right for some readers.
Vuk Trifkovic
Feb 09, 2013 rated it liked it
Shelves: tech
Bit scattergun, a bit basic, but sound introduction to text handling.
Dgg32
Apr 06, 2013 rated it really liked it
Shelves: programming
A nice introduction to the field natural language programming. May come in handy for some of my projects.
Akshay Ratan
Jan 08, 2015 rated it really liked it
Good for understanding content analysis using efficient techniques. Apache Solr and Lucene explained with concepts and codes.

Julio Sueiras
rated it it was amazing
Jan 20, 2015
Michael S. Wakkinen
rated it liked it
Jul 16, 2015
Rizki Kurniawan
rated it it was amazing
Dec 25, 2018
Trung Ngoc
rated it liked it
Feb 02, 2015
Vijayakumar
rated it really liked it
Feb 10, 2013
Geert-Jan
rated it it was ok
Oct 30, 2015
Aby James
rated it it was amazing
Apr 20, 2017
André Hagenbruch
rated it really liked it
Jul 31, 2011
Michael Taylor
rated it really liked it
Apr 04, 2017
Russell Jurney
rated it it was amazing
Oct 11, 2016
Carl Lim
rated it really liked it
Apr 19, 2019
Tom
rated it liked it
Dec 31, 2016
Ismail Mayat
rated it really liked it
Jun 26, 2019
Preslav Rachev
rated it really liked it
Oct 10, 2013
Jaromir Savelka
rated it really liked it
Jul 25, 2015
Johan Pretorius
rated it really liked it
May 17, 2015
Danilo Mutti
rated it really liked it
Feb 18, 2016
Antonio J
rated it liked it
Dec 04, 2015
Ivo
rated it liked it
Mar 22, 2013
Michael Lee
rated it really liked it
Nov 08, 2018
Haozhe Xu
rated it it was amazing
May 07, 2016
Vitor Oliveira
rated it really liked it
Jan 08, 2015
Marco Carnini
rated it really liked it
Mar 22, 2019
« previous 1 3 4 next »
There are no discussion topics on this book yet. Be the first to start one »

Readers also enjoyed

  • Solr in Action
  • Programming Collective Intelligence: Building Smart Web 2.0 Applications
  • Fluent Python: Clear, Concise, and Effective Programming
  • Back to Work: Why We Need Smart Government for a Strong Economy
  • Give People Money: The Simple Idea to Solve Inequality and Revolutionise Our Lives
  • How We Got to Now: Six Innovations That Made the Modern World
  • Drift: The Unmooring of American Military Power
  • The Deepest Well: Healing the Long-Term Effects of Childhood Adversity
  • Before the Dawn: Recovering the Lost History of Our Ancestors
  • Celephaïs
  • Conscious Loving: The Journey to Co-Committment
  • Decision Trees and Random Forests: A Visual Introduction For Beginners: A Simple Guide to Machine Learning with Decision Trees
  • Not a Penny More, Not a Penny Less
  • The Japanese Today: Change and Continuity, Enlarged Edition
  • The Old Way: A Story of the First People
  • The Road to Little Dribbling: Adventures of an American in Britain
  • The Road Ahead
  • Code Complete
See similar books…

Goodreads is hiring!

If you like books and love to build cool products, we may be looking for you.
Learn more »

News & Interviews

While books about anti-racism are trending on Goodreads and dominating the bestseller lists right now, some of our favorite Black authors are a...
164 likes · 32 comments