Update README.md

This commit is contained in:
Giuseppe Attardi 2015-12-03 13:00:12 +01:00
parent a412c7e3ab
commit 0bb3061e79

View File

@ -1,7 +1,7 @@
# wikiextractor
[WikiExtractor.py](http://medialab.di.unipi.it/wiki/Wikipedia_Extractor) is a Python script that extracts and cleans text from a [Wikipedia database dump](http://download.wikimedia.org/).
The tool is written in Python and requires no additional library.
The tool is written in Python and requires Python 2.7 but no additional library.
**Warning**: problems have been reported on Windows due to poor support for `StringIO` in the Python implementation on Windows.
For further information, see the [project Home Page](http://medialab.di.unipi.it/wiki/Wikipedia_Extractor) or the [Wiki](https://github.com/attardi/wikiextractor/wiki).