Jump to ratings and reviews
Rate this book

Text Processing Basics

Rate this book
This book covers basic concepts of text processing with an emphasis on methods used in information retrieval, document matching and clustering, and natural language processing (NLP). It starts by defining the concepts of tokenization, n-grams, shingles, and text similarity measures. It then introduces core problems in NLP and in text matching and clustering, followed by algorithms for them. Covered problems and algorithms include those for text clustering, text classification, topic modeling, sequence modeling, and information extraction.

35 pages, Kindle Edition

First published December 13, 2013

Loading...
Loading...

About the author

Arun Jagota

21 books2 followers

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
0 (0%)
4 stars
0 (0%)
3 stars
0 (0%)
2 stars
1 (100%)
1 star
0 (0%)
No one has reviewed this book yet.