1. 下载nltk_data仓库到本地
git clone https://gitee.com/opennlp/nltk_data.git
2. 进入目录,checkout NLTK Data
git checkout gh-pages
3. 修改目录下的index.xml
sed -i 's;s://raw.githubusercontent.com/nltk/nltk_data/gh-pages;://localhost:8000;g' index.xml
4. 启动python http server
python -m http.server 8000
5. 进入python
import ntlk
nltk.download()
6. 修改server index
下载如下: