Sentiment Analysis on Reddit Headlines: Intro to Python’s NLTK – LearnDataSciPublished 2017-04-21
shares Facebook Twitter Google+ Pinterest LinkedIn Digg Del StumbleUpon Tumblr VKontakte Print Email Flattr Reddit Bu...
Home » Sentiment Analysis on Reddit Headlines: Intro to Python’s NLTK Grab the code to this tutorial on GitHub. In my last post, K-Means Clustering with Python, we just grabbed some precompiled data, but for this post, I wanted to get deeper into actually getting some live data. Using the Reddit API we can get thousands of headlines from various news subreddits and start to have some fun with Sentiment Analysis Sentiment analysis is the process of computationally identifying and categorizing opinions expressed in a piece of text. The opinions regarding a particular topic are usually positive, negative, or neutral. In this post, instead of providing you with the dataset itself, I'll will show you how to gather your own data. Technically you can download the text files right now, but I suggest doing this manually. This tutorial will be based off of the latest political news headlines using Reddit’s API. Before I get started, you will need to install the Natural Language Toolkit (NLTK) python package. To see how to install NLTK, you can go here: http://www.nltk.org/install.html... Read more (2 min reading time!)
Found in hashtags
Found in tweets
Sentiment Analysis on @Reddit Headlines: Intro to #Python NLTK #NLP https://t.co/iUL0IUo40H https://t.co/WkcYOVhdd8
Sentiment Analysis on Reddit Headlines: Intro to #Python NLTK #NLP https://t.co/yEA0V8j8r8 https://t.co/lCgxrpYucY