debian/0000775000000000000000000000000012137271515007174 5ustar debian/rules0000775000000000000000000000005512137271406010253 0ustar #!/usr/bin/make -f %: dh $@ --with python2 debian/pyversions0000664000000000000000000000000512137271406011332 0ustar 2.4- debian/watch0000664000000000000000000000022312137271406010221 0ustar version=3 opts="uversionmangle=s/\(b.*$\)/~\1/" \ http://code.google.com/p/nltk/downloads/list http://nltk.googlecode.com/files/nltk-(.*)\.tar\.gz debian/changelog0000664000000000000000000000170412137271505011047 0ustar nltk (2.0~b9-0ubuntu4) saucy; urgency=low * debian/control: Bump Standards-Version to 3.9.4. * debian/copyright: - Update Format URI. - Update obsolete field names. -- Logan Rosen Sun, 28 Apr 2013 15:06:23 -0400 nltk (2.0~b9-0ubuntu3) precise; urgency=low * Rebuild to drop python2.6 dependencies. -- Matthias Klose Sat, 31 Dec 2011 02:05:46 +0000 nltk (2.0~b9-0ubuntu2) oneiric; urgency=low * Switch from pysupport to dh_python2 (LP: #835186) - Build --with python2 - Drop B-D-I on python-support -- Scott Kitterman Fri, 26 Aug 2011 21:38:03 -0400 nltk (2.0~b9-0ubuntu1) natty; urgency=low * New upstream release * Bump standards to 3.9.1 -- Bhavani Shankar Fri, 26 Nov 2010 13:21:15 +0530 nltk (2.0~b8-0ubuntu1) lucid; urgency=low * Initial release (LP: #514936) -- Robin Munn Mon, 15 Feb 2010 16:22:10 -0600 debian/compat0000664000000000000000000000000212137271406010371 0ustar 7 debian/README.source0000664000000000000000000000433712137271406011361 0ustar python-nltk for Debian ---------------------- The upstream source package contains a binary file, nltk.jar. Binary files cannot be checked for backdoors, Trojan horses, etc. the way source files can, so the binary nltk.jar is removed in debian/clean. The presence of this binary in .orig.tar.gz, and thus its redistribution via the Debian archives, is not a problem from a licensing standpoint, because source for nltk.jar can be found in javasrc/org/nltk/mallet/*.java. All these *.java files are licensed under the Apache-2.0 license, which allows redistribution of both source and binaries as long as a copy of the license accompanies the software. Since all source files for nltk.jar are present, in principle it should be straightforward to compile these files to *.class files and recreate nltk.jar during the package build process. The problem is that these Java classes are an interface to Mallet, an open-source library found at http://mallet.cs.umass.edu/, and compiling the files in javasrc/org/nltk/mallet/*.java requires Mallet to be installed and working because the source files do an "import edu.umass.cs.mallet.base.util.*" (and other imports from Mallet). But Mallet has never been packaged for Debian, so adding a build dependency on Mallet is problematic. Furthermore, the NLTK Mallet interface was built for Mallet 0.4, which is about four years old as of this writing; if Mallet were to be packaged for Debian, the version packaged would probably be the current version of Mallet, which is 2.0 and has a different API (Mallet 2.0 puts classes in the package "cc.mallet" instead of "edu.umass.cs.mallet.base"). Therefore, until such time as Mallet has been packaged for Debian and the NLTK Mallet interface has been updated to work with Mallet 2.0, the Mallet interface has been removed from the Debian package of NLTK. Calling the nltk.config_mallet() or nltk.call_mallet() functions, as well as any of the functions or classes found in nltk/tag/crf.py (which rely on Mallet for their functionality) will fail with a NotImplementedError. Once a Mallet package is available for Debian, a new python-nltk package will be built with the Mallet interface functionality restored. -- Robin Munn Mon, 08 Feb 2010 16:08:56 -0600 debian/copyright0000664000000000000000000005751012137271455011142 0ustar Format: http://www.debian.org/doc/packaging-manuals/copyright-format/1.0/ Upstream-Name: NLTK Upstream-Contact: Robin Munn Source: http://www.nltk.org/ Files: * Copyright: 2001-2010, NLTK Project License: Apache-2.0 Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at . http://www.apache.org/licenses/LICENSE-2.0 . Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License. . On Debian-based systems the full text of the Apache version 2.0 license can be found in `/usr/share/common-licenses/Apache-2.0'. Files: debian/* Copyright: 2010, Robin Munn License: Apache-2.0 On Debian-based systems the full text of the Apache version 2.0 license can be found in `/usr/share/common-licenses/Apache-2.0'. Files: nltk/etree/* Copyright: 1999-2005, Fredrik Lundh License: other By obtaining, using, and/or copying this software and/or its associated documentation, you agree that you have read, understood, and will comply with the following terms and conditions: . Permission to use, copy, modify, and distribute this software and its associated documentation for any purpose and without fee is hereby granted, provided that the above copyright notice appears in all copies, and that both that copyright notice and this permission notice appear in supporting documentation, and that the name of Secret Labs AB or the author not be used in advertising or publicity pertaining to distribution of the software without specific, written prior permission. . SECRET LABS AB AND THE AUTHOR DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS. IN NO EVENT SHALL SECRET LABS AB OR THE AUTHOR BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE. Files: nltk/stem/porter.py Copyright: 2002, Vivake Gupta (vivakeATomniscia.org) License: GPL-2+ with GNU Linking Exception This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. . This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. . On Debian-based systems the full text of the GPL version 2 can be found at /usr/share/common-licenses/GPL-2. . GNU Linking Exception: Using this module statically or dynamically with other modules is making a combined work based on this module. Thus, the terms and conditions of the GNU General Public License cover the whole combination. As a special exception, the copyright holders of this module give you permission to combine this module with independent modules to produce an executable program, regardless of the license terms of these independent modules, and to copy and distribute the resulting program under terms of your choice, provided that you also meet, for each linked independent module, the terms and conditions of the license of that module. An independent module is a module which is not derived from or based on this module. If you modify this module, you may extend this exception to your version of the module, but you are not obliged to do so. If you do not wish to do so, delete this exception statement from your version. Files: nltk/treetransforms.py Copyright: 2005-2007, Oregon Graduate Institute License: Apache-2.0 On Debian-based systems the full text of the Apache version 2.0 license can be found in `/usr/share/common-licenses/Apache-2.0'. Files: nltk/lazyimport.py Copyright: 1999-2005, Marc-Andre Lemburg License: eGenix.com Public License EGENIX.COM PUBLIC LICENSE AGREEMENT Version 1.1.0 . This license agreement is based on the Python CNRI License Agreement, a widely accepted open- source license. 1. Introduction This "License Agreement" is between eGenix.com Software, Skills and Services GmbH ("eGenix.com"), having an office at Pastor-Loeh-Str. 48, D-40764 Langenfeld, Germany, and the Individual or Organization ("Licensee") accessing and otherwise using this software in source or binary form and its associated documentation ("the Software"). 2. License Subject to the terms and conditions of this eGenix.com Public License Agreement, eGenix.com hereby grants Licensee a non-exclusive, royalty-free, world-wide license to reproduce, analyze, test, perform and/or display publicly, prepare derivative works, distribute, and otherwise use the Software alone or in any derivative version, provided, however, that the eGenix.com Public License Agreement is retained in the Software, or in any derivative version of the Software prepared by Licensee. 3. NO WARRANTY eGenix.com is making the Software available to Licensee on an "AS IS" basis. SUBJECT TO ANY STATUTORY WARRANTIES WHICH CAN NOT BE EXCLUDED, EGENIX.COM MAKES NO REPRESENTATIONS OR WARRANTIES, EXPRESS OR IMPLIED. BY WAY OF EXAMPLE, BUT NOT LIMITATION, EGENIX.COM MAKES NO AND DISCLAIMS ANY REPRESENTATION OR WARRANTY OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE OR THAT THE USE OF THE SOFTWARE WILL NOT INFRINGE ANY THIRD PARTY RIGHTS. 4. LIMITATION OF LIABILITY EGENIX.COM SHALL NOT BE LIABLE TO LICENSEE OR ANY OTHER USERS OF THE SOFTWARE FOR ANY INCIDENTAL, SPECIAL, OR CONSEQUENTIAL DAMAGES OR LOSS (INCLUDING, WITHOUT LIMITATION, DAMAGES FOR LOSS OF BUSINESS PROFITS, BUSINESS INTERRUPTION, LOSS OF BUSINESS INFORMATION, OR OTHER PECUNIARY LOSS) AS A RESULT OF USING, MODIFYING OR DISTRIBUTING THE SOFTWARE, OR ANY DERIVATIVE THEREOF, EVEN IF ADVISED OF THE POSSIBILITY THEREOF. SOME JURISDICTIONS DO NOT ALLOW THE EXCLUSION OR LIMITATION OF INCIDENTAL OR CONSEQUENTIAL DAMAGES, SO THE ABOVE EXCLUSION OR LIMITATION MAY NOT APPLY TO LICENSEE. 5. Termination This License Agreement will automatically terminate upon a material breach of its terms and conditions. 6. Third Party Rights Any software or documentation in source or binary form provided along with the Software that is associated with a separate license agreement is licensed to Licensee under the terms of that license agreement. This License Agreement does not apply to those portions of the Software. Copies of the third party licenses are included in the Software Distribution. 7. General Nothing in this License Agreement affects any statutory rights of consumers that cannot be waived or limited by contract. Nothing in this License Agreement shall be deemed to create any relationship of agency, partnership, or joint venture between eGenix.com and Licensee. If any provision of this License Agreement shall be unlawful, void, or for any reason unenforceable, such provision shall be modified to the extent necessary to render it enforceable without losing its intent, or, if no such modification is possible, be severed from this License Agreement and shall not affect the validity and enforceability of the remaining provisions of this License Agreement. This License Agreement shall be governed by and interpreted in all respects by the law of Germany, excluding conflict of law provisions. It shall not be governed by the United Nations Convention on Contracts for International Sale of Goods. This License Agreement does not grant permission to use eGenix.com trademarks or trade names in a trademark sense to endorse or promote products or services of Licensee, or any third party. The controlling language of this License Agreement is English. If Licensee has received a translation into another language, it has been provided for Licensee's convenience only. 8. Agreement By downloading, copying, installing or otherwise using the Software, Licensee agrees to be bound by the terms and conditions of this License Agreement. For question regarding this License Agreement, please write to: eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str. 48 D-40764 Langenfeld Germany Files: nltk/decorators.py Copyright: Michele Simionato License: BSD without no-endorsement clause Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer. Redistributions in bytecode form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution. . THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDERS OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. Files: nltk/app/nemo_app.py Copyright: 2006, Aristide Grange License: PSF-2 Files: nltk/misc/babelfish.py Copyright: 2001, Jonathan Feinberg 2009, Steven Bird License: PSF-2 Files: nltk/misc/chomsky.py Copyright: 2005, Raymond Hettinger License: PSF-2 License: PSF-2 A. HISTORY OF THE SOFTWARE ========================== . Python was created in the early 1990s by Guido van Rossum at Stichting Mathematisch Centrum (CWI, see http://www.cwi.nl) in the Netherlands as a successor of a language called ABC. Guido remains Python's principal author, although it includes many contributions from others. . In 1995, Guido continued his work on Python at the Corporation for National Research Initiatives (CNRI, see http://www.cnri.reston.va.us) in Reston, Virginia where he released several versions of the software. . In May 2000, Guido and the Python core development team moved to BeOpen.com to form the BeOpen PythonLabs team. In October of the same year, the PythonLabs team moved to Digital Creations (now Zope Corporation, see http://www.zope.com). In 2001, the Python Software Foundation (PSF, see http://www.python.org/psf/) was formed, a non-profit organization created specifically to own Python-related Intellectual Property. Zope Corporation is a sponsoring member of the PSF. . All Python releases are Open Source (see http://www.opensource.org for the Open Source Definition). Historically, most, but not all, Python releases have also been GPL-compatible; the table below summarizes the various releases. . Release Derived Year Owner GPL- from compatible? (1) . 0.9.0 thru 1.2 1991-1995 CWI yes 1.3 thru 1.5.2 1.2 1995-1999 CNRI yes 1.6 1.5.2 2000 CNRI no 2.0 1.6 2000 BeOpen.com no 1.6.1 1.6 2001 CNRI yes (2) 2.1 2.0+1.6.1 2001 PSF no 2.0.1 2.0+1.6.1 2001 PSF yes 2.1.1 2.1+2.0.1 2001 PSF yes 2.2 2.1.1 2001 PSF yes 2.1.2 2.1.1 2002 PSF yes 2.1.3 2.1.2 2002 PSF yes 2.2.1 2.2 2002 PSF yes 2.2.2 2.2.1 2002 PSF yes 2.2.3 2.2.2 2003 PSF yes 2.3 2.2.2 2002-2003 PSF yes 2.3.1 2.3 2002-2003 PSF yes 2.3.2 2.3.1 2002-2003 PSF yes 2.3.3 2.3.2 2002-2003 PSF yes 2.3.4 2.3.3 2004 PSF yes 2.3.5 2.3.4 2005 PSF yes 2.4 2.3 2004 PSF yes 2.4.1 2.4 2005 PSF yes 2.4.2 2.4.1 2005 PSF yes 2.4.3 2.4.2 2006 PSF yes 2.5 2.4 2006 PSF yes 2.5.1 2.5 2007 PSF yes . Footnotes: . (1) GPL-compatible doesn't mean that we're distributing Python under the GPL. All Python licenses, unlike the GPL, let you distribute a modified version without making your changes open source. The GPL-compatible licenses make it possible to combine Python with other software that is released under the GPL; the others don't. . (2) According to Richard Stallman, 1.6.1 is not GPL-compatible, because its license has a choice of law clause. According to CNRI, however, Stallman's lawyer has told CNRI's lawyer that 1.6.1 is "not incompatible" with the GPL. . Thanks to the many outside volunteers who have worked under Guido's direction to make these releases possible. . . B. TERMS AND CONDITIONS FOR ACCESSING OR OTHERWISE USING PYTHON =============================================================== . PYTHON SOFTWARE FOUNDATION LICENSE VERSION 2 -------------------------------------------- . 1. This LICENSE AGREEMENT is between the Python Software Foundation ("PSF"), and the Individual or Organization ("Licensee") accessing and otherwise using this software ("Python") in source or binary form and its associated documentation. . 2. Subject to the terms and conditions of this License Agreement, PSF hereby grants Licensee a nonexclusive, royalty-free, world-wide license to reproduce, analyze, test, perform and/or display publicly, prepare derivative works, distribute, and otherwise use Python alone or in any derivative version, provided, however, that PSF's License Agreement and PSF's notice of copyright, i.e., "Copyright (c) 2001, 2002, 2003, 2004, 2005, 2006, 2007 Python Software Foundation; All Rights Reserved" are retained in Python alone or in any derivative version prepared by Licensee. . 3. In the event Licensee prepares a derivative work that is based on or incorporates Python or any part thereof, and wants to make the derivative work available to others as provided herein, then Licensee hereby agrees to include in any such work a brief summary of the changes made to Python. . 4. PSF is making Python available to Licensee on an "AS IS" basis. PSF MAKES NO REPRESENTATIONS OR WARRANTIES, EXPRESS OR IMPLIED. BY WAY OF EXAMPLE, BUT NOT LIMITATION, PSF MAKES NO AND DISCLAIMS ANY REPRESENTATION OR WARRANTY OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE OR THAT THE USE OF PYTHON WILL NOT INFRINGE ANY THIRD PARTY RIGHTS. . 5. PSF SHALL NOT BE LIABLE TO LICENSEE OR ANY OTHER USERS OF PYTHON FOR ANY INCIDENTAL, SPECIAL, OR CONSEQUENTIAL DAMAGES OR LOSS AS A RESULT OF MODIFYING, DISTRIBUTING, OR OTHERWISE USING PYTHON, OR ANY DERIVATIVE THEREOF, EVEN IF ADVISED OF THE POSSIBILITY THEREOF. . 6. This License Agreement will automatically terminate upon a material breach of its terms and conditions. . 7. Nothing in this License Agreement shall be deemed to create any relationship of agency, partnership, or joint venture between PSF and Licensee. This License Agreement does not grant permission to use PSF trademarks or trade name in a trademark sense to endorse or promote products or services of Licensee, or any third party. . 8. By copying, installing or otherwise using Python, Licensee agrees to be bound by the terms and conditions of this License Agreement. . . BEOPEN.COM LICENSE AGREEMENT FOR PYTHON 2.0 ------------------------------------------- . BEOPEN PYTHON OPEN SOURCE LICENSE AGREEMENT VERSION 1 . 1. This LICENSE AGREEMENT is between BeOpen.com ("BeOpen"), having an office at 160 Saratoga Avenue, Santa Clara, CA 95051, and the Individual or Organization ("Licensee") accessing and otherwise using this software in source or binary form and its associated documentation ("the Software"). . 2. Subject to the terms and conditions of this BeOpen Python License Agreement, BeOpen hereby grants Licensee a non-exclusive, royalty-free, world-wide license to reproduce, analyze, test, perform and/or display publicly, prepare derivative works, distribute, and otherwise use the Software alone or in any derivative version, provided, however, that the BeOpen Python License is retained in the Software, alone or in any derivative version prepared by Licensee. . 3. BeOpen is making the Software available to Licensee on an "AS IS" basis. BEOPEN MAKES NO REPRESENTATIONS OR WARRANTIES, EXPRESS OR IMPLIED. BY WAY OF EXAMPLE, BUT NOT LIMITATION, BEOPEN MAKES NO AND DISCLAIMS ANY REPRESENTATION OR WARRANTY OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE OR THAT THE USE OF THE SOFTWARE WILL NOT INFRINGE ANY THIRD PARTY RIGHTS. . 4. BEOPEN SHALL NOT BE LIABLE TO LICENSEE OR ANY OTHER USERS OF THE SOFTWARE FOR ANY INCIDENTAL, SPECIAL, OR CONSEQUENTIAL DAMAGES OR LOSS AS A RESULT OF USING, MODIFYING OR DISTRIBUTING THE SOFTWARE, OR ANY DERIVATIVE THEREOF, EVEN IF ADVISED OF THE POSSIBILITY THEREOF. . 5. This License Agreement will automatically terminate upon a material breach of its terms and conditions. . 6. This License Agreement shall be governed by and interpreted in all respects by the law of the State of California, excluding conflict of law provisions. Nothing in this License Agreement shall be deemed to create any relationship of agency, partnership, or joint venture between BeOpen and Licensee. This License Agreement does not grant permission to use BeOpen trademarks or trade names in a trademark sense to endorse or promote products or services of Licensee, or any third party. As an exception, the "BeOpen Python" logos available at http://www.pythonlabs.com/logos.html may be used according to the permissions granted on that web page. . 7. By copying, installing or otherwise using the software, Licensee agrees to be bound by the terms and conditions of this License Agreement. . . CNRI LICENSE AGREEMENT FOR PYTHON 1.6.1 --------------------------------------- . 1. This LICENSE AGREEMENT is between the Corporation for National Research Initiatives, having an office at 1895 Preston White Drive, Reston, VA 20191 ("CNRI"), and the Individual or Organization ("Licensee") accessing and otherwise using Python 1.6.1 software in source or binary form and its associated documentation. . 2. Subject to the terms and conditions of this License Agreement, CNRI hereby grants Licensee a nonexclusive, royalty-free, world-wide license to reproduce, analyze, test, perform and/or display publicly, prepare derivative works, distribute, and otherwise use Python 1.6.1 alone or in any derivative version, provided, however, that CNRI's License Agreement and CNRI's notice of copyright, i.e., "Copyright (c) 1995-2001 Corporation for National Research Initiatives; All Rights Reserved" are retained in Python 1.6.1 alone or in any derivative version prepared by Licensee. Alternately, in lieu of CNRI's License Agreement, Licensee may substitute the following text (omitting the quotes): "Python 1.6.1 is made available subject to the terms and conditions in CNRI's License Agreement. This Agreement together with Python 1.6.1 may be located on the Internet using the following unique, persistent identifier (known as a handle): 1895.22/1013. This Agreement may also be obtained from a proxy server on the Internet using the following URL: http://hdl.handle.net/1895.22/1013". . 3. In the event Licensee prepares a derivative work that is based on or incorporates Python 1.6.1 or any part thereof, and wants to make the derivative work available to others as provided herein, then Licensee hereby agrees to include in any such work a brief summary of the changes made to Python 1.6.1. . 4. CNRI is making Python 1.6.1 available to Licensee on an "AS IS" basis. CNRI MAKES NO REPRESENTATIONS OR WARRANTIES, EXPRESS OR IMPLIED. BY WAY OF EXAMPLE, BUT NOT LIMITATION, CNRI MAKES NO AND DISCLAIMS ANY REPRESENTATION OR WARRANTY OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE OR THAT THE USE OF PYTHON 1.6.1 WILL NOT INFRINGE ANY THIRD PARTY RIGHTS. . 5. CNRI SHALL NOT BE LIABLE TO LICENSEE OR ANY OTHER USERS OF PYTHON 1.6.1 FOR ANY INCIDENTAL, SPECIAL, OR CONSEQUENTIAL DAMAGES OR LOSS AS A RESULT OF MODIFYING, DISTRIBUTING, OR OTHERWISE USING PYTHON 1.6.1, OR ANY DERIVATIVE THEREOF, EVEN IF ADVISED OF THE POSSIBILITY THEREOF. . 6. This License Agreement will automatically terminate upon a material breach of its terms and conditions. . 7. This License Agreement shall be governed by the federal intellectual property law of the United States, including without limitation the federal copyright law, and, to the extent such U.S. federal law does not apply, by the law of the Commonwealth of Virginia, excluding Virginia's conflict of law provisions. Notwithstanding the foregoing, with regard to derivative works based on Python 1.6.1 that incorporate non-separable material that was previously distributed under the GNU General Public License (GPL), the law of the Commonwealth of Virginia shall govern this License Agreement only as to issues arising under or with respect to Paragraphs 4, 5, and 7 of this License Agreement. Nothing in this License Agreement shall be deemed to create any relationship of agency, partnership, or joint venture between CNRI and Licensee. This License Agreement does not grant permission to use CNRI trademarks or trade name in a trademark sense to endorse or promote products or services of Licensee, or any third party. . 8. By clicking on the "ACCEPT" button where indicated, or by copying, installing or otherwise using Python 1.6.1, Licensee agrees to be bound by the terms and conditions of this License Agreement. . ACCEPT . . CWI LICENSE AGREEMENT FOR PYTHON 0.9.0 THROUGH 1.2 -------------------------------------------------- . Copyright (c) 1991 - 1995, Stichting Mathematisch Centrum Amsterdam, The Netherlands. All rights reserved. . Permission to use, copy, modify, and distribute this software and its documentation for any purpose and without fee is hereby granted, provided that the above copyright notice appear in all copies and that both that copyright notice and this permission notice appear in supporting documentation, and that the name of Stichting Mathematisch Centrum or CWI not be used in advertising or publicity pertaining to distribution of the software without specific, written prior permission. . STICHTING MATHEMATISCH CENTRUM DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL STICHTING MATHEMATISCH CENTRUM BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE. debian/source/0000775000000000000000000000000012137271515010474 5ustar debian/source/format0000664000000000000000000000001412137271406011701 0ustar 3.0 (quilt) debian/clean0000664000000000000000000000001612137271406010175 0ustar nltk/nltk.jar debian/control0000664000000000000000000000145712137271417010607 0ustar Source: nltk Section: python Priority: optional Maintainer: Ubuntu Developers XSBC-Original-Maintainer: Robin Munn Build-Depends: debhelper (>= 7), python (>= 2.4), python-yaml Standards-Version: 3.9.4 Homepage: http://www.nltk.org/ Package: python-nltk Architecture: all Depends: ${python:Depends}, ${misc:Depends}, python-yaml Suggests: python-numpy, python-matplotlib, prover9 Description: Python libraries for natural language processing The Natural Language Toolkit (NLTK) is a suite of open source Python modules, data and documentation for research and development in natural language processing. NLTK contains code supporting dozens of NLP tasks, along with 40 popular corpora and extensive documentation including a 375-page online book. debian/docs0000664000000000000000000000001312137271406010040 0ustar README.txt debian/patches/0000775000000000000000000000000012137271515010623 5ustar debian/patches/10-remove-mallet-interface.patch0000664000000000000000000003127712137271406016602 0ustar Author: Robin Munn Description: Remove Mallet interface functionality The Mallet interface in NLTK requires Mallet to be installed and working on the target system; however, Mallet has never been packaged for Debian. Until such time as Mallet has been packaged for Debian, the NLTK-Mallet interface will be removed and attempting to call its functions will raise a NotImplementedError exception. diff -Nur -x '*.orig' -x '*~' python-nltk-2.0~b8/nltk/classify/mallet.py python-nltk-2.0~b8.new/nltk/classify/mallet.py --- python-nltk-2.0~b8/nltk/classify/mallet.py 2010-02-09 11:41:49.825746067 -0600 +++ python-nltk-2.0~b8.new/nltk/classify/mallet.py 2010-02-09 11:45:07.616940569 -0600 @@ -41,6 +41,8 @@ C{LookupError} exception. @type mallet_home: C{string} """ + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." global _mallet_home, _mallet_classpath # We don't actually care about this binary -- we just use it to @@ -73,6 +75,8 @@ See L{nltk.internals.java()} for parameter and return value descriptions. """ + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." if _mallet_classpath is None: config_mallet() diff -Nur -x '*.orig' -x '*~' python-nltk-2.0~b8/nltk/tag/crf.py python-nltk-2.0~b8.new/nltk/tag/crf.py --- python-nltk-2.0~b8/nltk/tag/crf.py 2010-02-09 11:41:49.825746067 -0600 +++ python-nltk-2.0~b8.new/nltk/tag/crf.py 2010-02-09 11:45:07.616940569 -0600 @@ -62,6 +62,8 @@ a pickled pointer to the feature detector (e.g., if the feature detector was a lambda function). """ + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." # Read the CRFInfo from the model file. zf = zipfile.ZipFile(filename) crf_info = CRFInfo.fromstring(zf.read('crf-info.xml')) @@ -90,6 +92,8 @@ #///////////////////////////////////////////////////////////////// def _get_filename(self): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." return self.crf_info.model_filename filename = property(_get_filename , doc=""" The filename of the crf model file that backs this @@ -99,6 +103,8 @@ structure of the CRF (C{crf-info.xml}).""") def _get_feature_detector(self): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." return self.crf_info.model_feature_detector feature_detector = property(_get_feature_detector , doc=""" The feature detector function that is used to convert tokens @@ -114,6 +120,8 @@ _RUN_CRF = "org.nltk.mallet.RunCRF" def batch_tag(self, sentences): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." # Write the test corpus to a temporary file (fd, test_file) = mkstemp('.txt', 'test') self.write_test_corpus(sentences, os.fdopen(fd, 'w')) @@ -215,6 +223,8 @@ while training the CRF. Higher numbers generate more verbose output. """ + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." t0 = time.time() # Record starting time. # If they did not supply a model filename, then choose one. @@ -288,6 +298,8 @@ set of configuration parameters, and based on the contents of a given corpus. """ + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." state_info_list = [] labels = set() @@ -372,6 +384,8 @@ selected for display are determined by L{_FILTER_TRAINING_OUTPUT}. """ + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." out = [] while p.poll() is None: while True: @@ -398,6 +412,8 @@ Write a given training corpus to a given stream, in a format that can be read by the java script C{org.nltk.mallet.TrainCRF}. """ + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." feature_detector = self.crf_info.feature_detector for sentence in corpus: if self.crf_info.add_start_state: @@ -418,6 +434,8 @@ Write a given test corpus to a given stream, in a format that can be read by the java script C{org.nltk.mallet.TestCRF}. """ + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." feature_detector = self.crf_info.feature_detector for sentence in corpus: if self.crf_info.add_start_state: @@ -438,6 +456,8 @@ C{org.nltk.mallet.TestCRF}, and convert it to a labeled corpus. """ + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." if re.match(r'\s*<>', s): assert 0, 'its a lattice' corpus = [[]] @@ -455,6 +475,8 @@ _ESCAPE_RE = re.compile('[^a-zA-Z0-9]') @staticmethod def _escape_sub(m): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." return '%' + ('%02x' % ord(m.group())) @staticmethod @@ -465,6 +487,8 @@ character in fname or fval that's not a letter or a number, just to be conservative. """ + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." fname = MalletCRF._ESCAPE_RE.sub(MalletCRF._escape_sub, fname) if isinstance(fval, basestring): fval = "'%s'" % MalletCRF._ESCAPE_RE.sub( @@ -478,6 +502,8 @@ #///////////////////////////////////////////////////////////////// def __repr__(self): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." return 'MalletCRF(%r)' % self.crf_info.model_filename ########################################################################### @@ -503,6 +529,8 @@ max_iterations, transduction_type, weight_groups, add_start_state, add_end_state, model_filename, feature_detector): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." self.gaussian_variance = float(gaussian_variance) self.default_label = default_label self.states = states @@ -540,6 +568,8 @@ '\n') def toxml(self): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." info = self.__dict__.copy() info['states'] = '\n'.join(state.toxml() for state in self.states) info['w_groups'] = '\n'.join(wg.toxml() for wg in self.weight_groups) @@ -557,10 +587,14 @@ @staticmethod def fromstring(s): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." return CRFInfo._read(ElementTree.fromstring(s)) @staticmethod def _read(etree): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." states = [CRFInfo.State._read(et) for et in etree.findall('states/state')] weight_groups = [CRFInfo.WeightGroup._read(et) for et in @@ -583,6 +617,8 @@ feature_detector) def write(self, filename): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." out = open(filename, 'w') out.write(self.toxml()) out.write('\n') @@ -593,6 +629,8 @@ A description of a single CRF state. """ def __init__(self, name, initial_cost, final_cost, transitions): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." if initial_cost != '+inf': initial_cost = float(initial_cost) if final_cost != '+inf': final_cost = float(final_cost) self.name = name @@ -608,6 +646,8 @@ ' \n' ' \n') def toxml(self): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." info = self.__dict__.copy() info['transitions'] = '\n'.join(transition.toxml() for transition in self.transitions) @@ -615,6 +655,8 @@ @staticmethod def _read(etree): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." transitions = [CRFInfo.Transition._read(et) for et in etree.findall('transitions/transition')] return CRFInfo.State(etree.get('name'), @@ -636,6 +678,8 @@ which weight groups should be used to calculate the cost of traversing this transition. """ + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." self.destination = destination self.label = label self.weightgroups = weightgroups @@ -644,12 +688,16 @@ 'destination="%(destination)s" ' 'weightGroups="%(w_groups)s"/>') def toxml(self): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." info = self.__dict__ info['w_groups'] = ' '.join(wg for wg in self.weightgroups) return self._XML_TEMPLATE % info @staticmethod def _read(etree): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." return CRFInfo.Transition(etree.get('destination'), etree.get('label'), etree.get('weightGroups').split()) @@ -680,6 +728,8 @@ this regexp is passed streight to java -- i.e., it must be a java-style regexp! """ + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." if re.search('\s', name): raise ValueError('weight group name may not ' 'contain whitespace.') @@ -695,10 +745,14 @@ _XML_TEMPLATE = (' ') def toxml(self): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." return self._XML_TEMPLATE % self.__dict__ @staticmethod def _read(etree): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." return CRFInfo.WeightGroup(etree.get('name'), etree.get('src'), etree.get('dst'), @@ -706,6 +760,8 @@ # [xx] feature name???? def match(self, src, dst): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." # Check if the source matches src_match = self._src_match_cache.get(src) if src_match is None: @@ -734,6 +790,8 @@ def demo(train_size=100, test_size=100, java_home='/usr/local/jdk1.5.0/', mallet_home='/usr/local/mallet-0.4'): + raise NotImplementedError, \ + "This version of NLTK was built without support for Mallet." from nltk.corpus import brown import textwrap diff -Nur -x '*.orig' -x '*~' python-nltk-2.0~b8/setup.py python-nltk-2.0~b8.new/setup.py --- python-nltk-2.0~b8/setup.py 2010-02-05 02:13:10.000000000 -0600 +++ python-nltk-2.0~b8.new/setup.py 2010-02-09 11:45:23.864727719 -0600 @@ -32,7 +32,7 @@ ############################################# ## Package Data - package_data = {'nltk': ['nltk.jar', 'test/*.doctest']}, + package_data = {'nltk': ['test/*.doctest']}, ############################################# ## Package List debian/patches/series0000664000000000000000000000004112137271406012032 0ustar 10-remove-mallet-interface.patch