ludwig 无法使用auto_transformer的分词器,

s1ag04yj 于 2个月前发布在其他

关注(0)|答案(4)|浏览(28)

我想使用 this model 作为编码器。从描述中可以看出，模型可以像这样上传：

model = AutoModel.from_pretrained("ibm/MoLFormer-XL-both-10pct", deterministic_eval=True, trust_remote_code=True) 
tokenizer = AutoTokenizer.from_pretrained("ibm/MoLFormer-XL-both-10pct", trust_remote_code=True)

我尝试使用

encoder: auto_transformer
   pretrained_model_name_or_path: ibm/MoLFormer-XL-both-10pct

加载它，结果是 RuntimeError: Caught exception during model preprocessing: Tokenizer class MolformerTokenizer does not exist or is not currently imported. 。这并不令人惊讶，因为这个模型没有使用特定的 MolformerTokenizer ,而是使用了 AutoTokenizer 。
然而，文档中说的是 "If a text feature's encoder specifies a huggingface model, then the tokenizer for that model will be used automatically." 。我该如何为这个模型加载分词器？

来源：https://github.com/ludwig-ai/ludwig/issues/3894

4条答案

按热度按时间

我发现问题出在trust_remote_code上，它也是加载分词器的强制性要求。
另请参阅#3632。

赞(0）回复(0）举报 2个月前

你好，@sergsb,
感谢你分享你的经验。
Ludwig团队专注于为HF上原生支持的模型提供一流的支持。据我了解，支持需要trust_remote_code=True的模型是可行的，但也存在其他风险需要仔细考虑。
CC:@arnavgarg1

赞(0）回复(0）举报 2个月前

你好，@justinxzhao,
感谢你的回答。或许可以考虑引入一个全局配置参数trust_remote_code,并将其设置为HF模型和分词器？

赞(0）回复(0）举报 2个月前

@sergsb,这对我来说似乎是合理的。我认为这是@arnavgarg1在#3632中，特别是在这里所要表达的意思。

赞(0）回复(0）举报 2个月前

相关问题

热门标签

Java query python Node 开发语言 request Util 数据库 Table 后端算法 Logger Message Element Parser

最新问答

xxl-job 安全组扫描到执行器端口服务存在信息泄露漏洞
回答(1) 发布于 22天前
xxl-job 不能和nacos兼容？
回答(3) 发布于 22天前
xxl-job 任务执行完后无法结束，日志一直转圈
回答(3) 发布于 22天前
xxl-job-admin页面上查看调度日志样式问题
回答(1) 发布于 22天前
xxl-job 参数512字符限制能否去掉
回答(1) 发布于 22天前