Survey of Text Mining II: Cluster-Preserving Dimension Reduction Methods...
Some time ago, I promised a colleague a review of an excellent book’ Survey of Text Mining II: Clustering, Classification, and Retrieval, edited by Michael W. Berry and Malu Castellanos. Overall, this...
View ArticleFollow-on Thoughts: Clustering Algorithm Improvements for Text-based Data Mining
A good night’s sleep is excellent for clearing away mental cobwebs, and has given me more perspective on Chapter 1, “Cluster-Preserving Dimension Reduction Methods,” by Howland and Park in Survey of...
View Article"Automatic Discovery of Similar Words"– Chapter 2 in Survey of Text Mining II
This post begins a review of “Automatic Discovery of Similar Words,” by Pierre Senellart and Vincent D. Blondel, published as Chapter 2 in Berry and Castellanos’ Survey of Text Mining II. This is an...
View ArticleChapter 2 Review, Continued, Part 2 —"Automatic Discovery of Similar Words"
(Direct continuation of yesterday’s post, w/r/t Senellart & Blondel on “Automatic Discovery of Similar Words” in Survey of Text Mining II. I give the references that cite, which I discuss in this...
View ArticleChapter 2 (Part 3), Sennelart & Blondel – Automatic Discovery of Similar Words
In Section 2.3, we get to the meat of Sennelart & Blondel’s work, which is a graph-based method for determining similar words, using a dictionary as source. Their method uses a vXv matrix, where...
View ArticleThe 1-D Cluster Variation Method (CVM) – Simple Application
The 1-D Cluster Variation Method – Application to Text Mining and Data Mining There are three particularly good reasons for us to look at the Cluster Variation Method (CVM) as an alternative means of...
View ArticleGround-Truthing – The First Step in Predictive Analytics
Ground Truthing – Your First Day in Class You may be joining me for the Summer 2015 class in Text Analytics (PREDICT 453) that I’ll be teaching through Northwestern University’s Master of Science in...
View ArticleNovelty Detection in Text Corpora
Detecting Novelty Using Text Analytics Detecting novel events – new words, meaning new events – is one of the most important text analytics tasks, and is an important step towards predictive analytics...
View ArticleMaking Sense: Extracting Meaning from Text
Making Sense: Extracting Meaning from Text by Matching Terms and Entities to Ontologies and Concepts Text analytics is the means by which computer algorithms can extract meaning and useful insights...
View Article