Week Ending May 19

Hurricane Analysis Using Tweets Geolocation

Completed Tasks

  1. plot all the counties evacuated (mandatory one color, voluntary another color) for Hurricane Irma in Georgia/Florida
  2. plot all the counties evacuated (mandatory one color, voluntary another color) for Hurricane Maria in Puerto Rico.
  3. For all the tweets which are geolocated on the tweets database, we need to resolve their geocoordinates to counties.
  4. Find all people that tweeted one full month he hurricane start date and tweeted more than once from any given county. This will give us a list of people that 'live' in the hurricane area.
  5. Find all people that lived in the area (see before) that tweeted during the hurricane dates (here the filter is not the county, but the dates and the people found in the previous step).
  6. Find all tweets from people that lived in the area, which tweeted outside (and inside) of the hurricane area. For 1, 2 and 4 weeks after the hurricane time.

Code stored here https://github.com/thepanacealab/Hurricane-Analysis

Next Steps

  1. Chase all 'residents' and get all their tweets from the hurricane period that we don't already have.

ex: searchTwitter('charlie sheen', since='2011-03-01', until='2011-03-02') or tweets ← userTimeline(“realDonaldTrump”, n=200)

ex: https://gist.github.com/dsparks/4329876

  1. Try the location imputation algorithms to enhance the dataset - this is one of the novel things to try
  2. Use all tweets during hurricane from within the evacuation area and do sentiment analysis to get a plot of 'sentiment' over day for the hurricane times (pre, during, post).