update_02_oct_2018

This is an old revision of the document!


02_Oct_2018

After last week's meeting, I found the html tags that contain the content of the entire tweet, all the way down to the username and time stamp.

It is contained within a div element with a class attribute named content. I have been using beautiful soup to isolate these chunks, and I have successfully isolated them over the weekend. I hope that by isolating these chunks isolating everything else can follow in an orderly fashion when I output the information to the screen. I did not want to print out all of the user id's or a certain aspect of the tweet all out once without the rest of the information following it, since the user id would have no context, no tweet to tie it to.

Right now I am in the process of isolating the user id from the tweet, and I found that that information was contained within a span element with the class attribute name of username. Within that <span class=“username u-dir u-textTruncate”…> was the bolded user id.

  • update_02_oct_2018.1538488651.txt.gz
  • Last modified: 2018/10/02 09:57
  • by kimberlylg