1. 抱抱脸transformers 报错Couldn't instantiate the backend tokenizer
ValueError: Couldn't instantiate the backend tokenizer from one of:
(1) a `tokenizers` library serialization file,
(2) a slow tokenizer instance to convert or
(3) an equivalent slow tokenizer class to instantiate and convert.
You need to have sentencepiece installed to convert a slow tokenizer to a fast one.
感觉像sentencepiece不匹配transformers.
直接在环境里下载干净transformers, 然后pip install transformers sentencepiece
,这样会自动匹配合适的sentencepiece。
在colab里也这样安装transformers, 就不会tokenizer时报错