A tool for extracting plain text from Wikipedia dumps
275dcc9ac5
minor regex improvement |
||
---|---|---|
.gitignore | ||
extract.sh | ||
WikiExtractor.py |
275dcc9ac5
minor regex improvement |
||
---|---|---|
.gitignore | ||
extract.sh | ||
WikiExtractor.py |