python-machine-learning-book/code/ch08 at master · anilujohn/python-machine-learning-book

Sebastian Raschka, 2015

Python Machine Learning - Code Examples

Chapter 8 - Applying Machine Learning to Sentiment Analysis

  • Obtaining the IMDb movie review dataset
  • Introducing the bag-of-words model
    • Transforming words into feature vectors
    • Assessing word relevancy via term frequency-inverse document frequency
    • Cleaning text data
    • Processing documents into tokens
  • Training a logistic regression model for document classification
  • Working with bigger data – online algorithms and out-of-core learning
  • Summary