From 3cebfdd4c04df4d94c008c5636ee2ed30948b939 Mon Sep 17 00:00:00 2001 From: attardi Date: Thu, 4 Feb 2016 11:08:37 +0100 Subject: [PATCH] Updated Copyright. --- ChangeLog | 68 +++++++++--------------------------------------- WikiExtractor.py | 6 ++--- 2 files changed, 16 insertions(+), 58 deletions(-) diff --git a/ChangeLog b/ChangeLog index 2c11e4b..60f8dd0 100644 --- a/ChangeLog +++ b/ChangeLog @@ -294,39 +294,39 @@ 2. Use full parser in order to better deal with nested and unbalanced expressions. -2011-02-10 Giuseppe Attardi - - * WikiExtractor.py: added Copyright. - 2012-02-15 Stefano Dei Rossi * WikiExtractor.py (WikiExtractor):   replaced with a simple space instead of u'\u00A0'. -2009-11-03 Antonio Fuschetto +2012-02-10 Giuseppe Attardi + + * WikiExtractor.py: added Copyright. + +2011-11-03 Antonio Fuschetto * WikiExtractor.py: updated version to 1.6 (Oct 17). -2009-10-17 Giuseppe Attardi +2011-10-17 Giuseppe Attardi * WikiExtractor.py: turned prefix into a parameter. -2009-07-29 Antonio Fuschetto +2011-07-29 Antonio Fuschetto * WikiExtractor.py (init): fixed bugs in apostrophe_bold_pattern and apostrophe_italic_pattern. -2009-07-28 Antonio Fuschetto +2011-07-28 Antonio Fuschetto * WikiExtractor.py (__garbage_namespaces): added "file" namespace to remove list. -2009-07-10 Antonio Fuschetto +2011-07-10 Antonio Fuschetto * WikiExtractor.py (get_wiki_document_url): changed the handling of URL prefix (anchors don't use prefix but a relative URLs). -2009-06-26 Antonio Fuschetto +2011-06-26 Antonio Fuschetto * WikiExtractor.py (extract_document): changed the handling of wikilinks, adding an anchor tag for each link with a reference to the @@ -335,7 +335,7 @@ * WikiExtractor.py (WikiExtractor): changed the handling of placeholders: from "[Formula 12]" to "formula_12". -2009-04-06 Antonio Fuschetto +2011-04-06 Antonio Fuschetto * WikiExtractor.py (init): fixed bugs in apostrophe_bold_pattern and apostrophe_italic_pattern. @@ -345,56 +345,14 @@ * WikiExtractor.py: released version 1.1. -2009-03-12 Antonio Fuschetto +2011-03-12 Antonio Fuschetto * wiki-extractor.py (main): removed the sentence splitting option. * wiki-extractor.py: fixed some bugs; released version 1.0; changed filename to "Wiki-Extractor.py" according to Tanl module names. -2009-03-01 Antonio Fuschetto +2011-03-01 Antonio Fuschetto * wiki-extractor.py (main): added cross platform path management. -2008-12-12 Antonio Fuschetto - - * wiki-extractor.py: fixed a wrong cleaning of apostrophes prior - italic and bold text. - -2008-10-27 Antonio Fuschetto - - * wiki-extractor.py: script complete rewriting (ver 0.8). - -2008-10-27 Giuseppe Attardi - - * wiki-extractor.py: added CopyLeft. - -2008-07-20 Giuseppe Attardi - - * wiki-extractor.py (main): renamed option gzip to bzip. - - * wiki-extractor.py (Document.__str__): removed global variables. - - * wiki-extractor.py (Document): turned split_sentences, clean_document, - print_document into methods. - -2008-07-15 Antonio Fuschetto - - * wiki-extractor.py: changed object serialization using standard - pickle module. - -2008-06-28 Antonio Fuschetto - - * wiki-extractor.py: added the management of italics with a bad format. - -2008-06-27 Antonio Fuschetto - - * wiki-extractor.py: fixed a wrong use of conversion between conding; - added the management of wikilink with a bad format; added the - menagement of unicode character (numeric entity); added the management - of italics like quoted text. - -2008-06-26 Giuseppe Attardi - - * wiki-extractor.py (main): turned global variables _infile and - _outfile into locals. diff --git a/WikiExtractor.py b/WikiExtractor.py index 1b5e20f..5b15d20 100755 --- a/WikiExtractor.py +++ b/WikiExtractor.py @@ -1,8 +1,8 @@ -#!/usr/bin/python +#!/usr/bin/env python # -*- coding: utf-8 -*- # ============================================================================= -# Version: 2.41 (November 19, 2015) +# Version: 2.42 (November 19, 2015) # Author: Giuseppe Attardi (attardi@di.unipi.it), University of Pisa # # Contributors: @@ -16,7 +16,7 @@ # Radics Geza (radicsge@gmail.com) # # ============================================================================= -# Copyright (c) 2009-2015. Giuseppe Attardi (attardi@di.unipi.it). +# Copyright (c) 2011-2016. Giuseppe Attardi (attardi@di.unipi.it). # ============================================================================= # This file is part of Tanl. #