Name : R-wordpiece.data
| |
Version : 2.0.0
| Vendor : obs://build_opensuse_org/devel:languages:R
|
Release : lp156.1.1
| Date : 2024-06-24 13:56:19
|
Group : Development/Libraries/Other
| Source RPM : R-wordpiece.data-2.0.0-lp156.1.1.src.rpm
|
Size : 0.29 MB
| |
Packager : https://www_suse_com/
| |
Summary : Data for Wordpiece-Style Tokenization
|
Description :
Provides data to be used by the wordpiece algorithm in order to tokenize text into somewhat meaningful chunks. Included vocabularies were retrieved from < https://huggingface.co/bert-base-cased/resolve/main/vocab.txt> and < https://huggingface.co/bert-base-uncased/resolve/main/vocab.txt> and parsed into an R-friendly format.
|
RPM found in directory: /packages/linux-pbone/ftp5.gwdg.de/pub/opensuse/repositories/devel:/languages:/R:/autoCRAN/15.6/x86_64 |