add README

This commit is contained in:
lufo 2014-12-28 20:28:40 +08:00
parent c96831714a
commit 568864bb43
2 changed files with 4 additions and 4 deletions

View File

@ -73,8 +73,8 @@
</option>
</component>
<component name="ProjectFrameBounds">
<option name="x" value="-1861" />
<option name="y" value="-4" />
<option name="x" value="59" />
<option name="y" value="1076" />
<option name="width" value="1861" />
<option name="height" value="1084" />
</component>
@ -358,7 +358,7 @@
<servers />
</component>
<component name="ToolWindowManager">
<frame x="-1861" y="-4" width="1861" height="1084" extended-state="6" />
<frame x="59" y="1076" width="1861" height="1084" extended-state="6" />
<editor active="false" />
<layout>
<window_info id="Changes" active="false" anchor="bottom" auto_hide="false" internal_type="DOCKED" type="DOCKED" visible="false" weight="0.33" sideWeight="0.5" order="7" side_tool="false" content_ui="tabs" />

View File

@ -4,7 +4,7 @@ This program is a realization of [TwitterRank Algorithm](http://ink.library.smu.
#How to get Data
First we need to pick some users.The dataset doesn't need to be very big but users in this database should have close relationship,because in this case we can ensure most of users have influence on others.So i pick top 100 twitter users based on their followers.Because many of them follow other top 100 twitter users.This easy to get their user id from [twitaholic.com](http://twitaholic.com/),I wrote a script to get their user id.The code is in GetTopTwitters.py.
First we need to pick some users.The dataset doesn't need to be very big but users in this database should have close relationship,because in this case we can ensure most of users have influence on others.So I pick top 100 twitter users based on their followers.Because many of them follow other top 100 twitter users.This easy to get their user id from [twitaholic.com](http://twitaholic.com/),I wrote a script to get their user id.The code is in GetTopTwitters.py.
TwitterRank Algorithm use users' tweets' content,the number of their tweets,and their relationship to get the rank,so I wrote a spider to get them.The code is in spider.py,I use the [Tweepy library](https://github.com/tweepy/tweepy) to use Twitter API easier.