SEARCH
NEW RPMS
DIRECTORIES
ABOUT
FAQ
VARIOUS
BLOG

 
 

python312-tokenizers rpm build for : openSUSE Tumbleweed. For other distributions click python312-tokenizers.

Name : python312-tokenizers
Version : 0.20.0 Vendor : obs://build_opensuse_org/science
Release : 1.1 Date : 2024-08-20 09:27:42
Group : Unspecified Source RPM : python-tokenizers-0.20.0-1.1.src.rpm
Size : 6.70 MB
Packager : (none)
Summary : Provides an implementation of today\'s most used tokenizers
Description :
Provides an implementation of today\'s most used tokenizers, with a focus on
performance and versatility.
* Train new vocabularies and tokenize, using today\'s most used tokenizers.
* Extremely fast (both training and tokenization), thanks to the Rust
implementation. Takes less than 20 seconds to tokenize a GB of text on a
server\'s CPU.
* Easy to use, but also extremely versatile.
* Designed for research and production.
* Normalization comes with alignments tracking. It\'s always possible to get the
part of the original sentence that corresponds to a given token.
* Does all the pre-processing: Truncate, Pad, add the special tokens your model
needs.

RPM found in directory: /packages/linux-pbone/ftp5.gwdg.de/pub/opensuse/repositories/science:/machinelearning/openSUSE_Tumbleweed/i586

Content of RPM  Changelog  Provides Requires

Download
ftp.icm.edu.pl  python312-tokenizers-0.20.0-1.1.i586.rpm
ftp.icm.edu.pl  python312-tokenizers-0.20.0-1.1.i586.rpm
     Search for other platforms
python312-tokenizers-0.20.0-1.1.sparc.rpm
python312-tokenizers-0.20.0-1.1.alpha.rpm
python312-tokenizers-0.20.0-1.1.ppc.rpm
python312-tokenizers-0.20.0-1.1.ia64.rpm
python312-tokenizers-0.20.0-1.1.s390.rpm

Provides :
python3.12dist(tokenizers)
python312-tokenizers
python312-tokenizers(x86-32)
python3dist(tokenizers)

Requires :
ld-linux.so.2
ld-linux.so.2(GLIBC_2.3)
libc.so.6
libc.so.6(GLIBC_2.0)
libc.so.6(GLIBC_2.1)
libc.so.6(GLIBC_2.1.3)
libc.so.6(GLIBC_2.16)
libc.so.6(GLIBC_2.17)
libc.so.6(GLIBC_2.18)
libc.so.6(GLIBC_2.2)
libc.so.6(GLIBC_2.2.4)
libc.so.6(GLIBC_2.25)
libc.so.6(GLIBC_2.28)
libc.so.6(GLIBC_2.3)
libc.so.6(GLIBC_2.3.2)
libc.so.6(GLIBC_2.3.4)
libc.so.6(GLIBC_2.32)
libc.so.6(GLIBC_2.33)
libc.so.6(GLIBC_2.34)
libgcc_s.so.1
libgcc_s.so.1(GCC_3.0)
libgcc_s.so.1(GCC_3.3)
libgcc_s.so.1(GCC_4.2.0)
libm.so.6
libm.so.6(GLIBC_2.0)
libm.so.6(GLIBC_2.1)
libm.so.6(GLIBC_2.29)
libstdc++.so.6
libstdc++.so.6(CXXABI_1.3)
libstdc++.so.6(CXXABI_1.3.8)
libstdc++.so.6(GLIBCXX_3.4)
python(abi) = 3.12
python310-huggingface-hub
python311-huggingface-hub
python312-huggingface-hub
rpmlib(CompressedFileNames) <= 3.0.4-1
rpmlib(FileDigests) <= 4.6.0-1
rpmlib(PartialHardlinkSets) <= 4.0.4-1
rpmlib(PayloadFilesHavePrefix) <= 4.0-1
rpmlib(PayloadIsZstd) <= 5.4.18-1


Content of RPM :
/usr/lib/python3.12/site-packages/tokenizers
/usr/lib/python3.12/site-packages/tokenizers-0.20.0.dist-info
/usr/lib/python3.12/site-packages/tokenizers-0.20.0.dist-info/INSTALLER
/usr/lib/python3.12/site-packages/tokenizers-0.20.0.dist-info/METADATA
/usr/lib/python3.12/site-packages/tokenizers-0.20.0.dist-info/RECORD
/usr/lib/python3.12/site-packages/tokenizers-0.20.0.dist-info/REQUESTED
/usr/lib/python3.12/site-packages/tokenizers-0.20.0.dist-info/WHEEL
/usr/lib/python3.12/site-packages/tokenizers/__init__.py
/usr/lib/python3.12/site-packages/tokenizers/__init__.pyi
/usr/lib/python3.12/site-packages/tokenizers/__pycache__
/usr/lib/python3.12/site-packages/tokenizers/__pycache__/__init__.cpython-312.opt-1.pyc
/usr/lib/python3.12/site-packages/tokenizers/__pycache__/__init__.cpython-312.pyc
/usr/lib/python3.12/site-packages/tokenizers/decoders
/usr/lib/python3.12/site-packages/tokenizers/decoders/__init__.py
/usr/lib/python3.12/site-packages/tokenizers/decoders/__init__.pyi
/usr/lib/python3.12/site-packages/tokenizers/decoders/__pycache__
/usr/lib/python3.12/site-packages/tokenizers/decoders/__pycache__/__init__.cpython-312.opt-1.pyc
/usr/lib/python3.12/site-packages/tokenizers/decoders/__pycache__/__init__.cpython-312.pyc
/usr/lib/python3.12/site-packages/tokenizers/implementations
/usr/lib/python3.12/site-packages/tokenizers/implementations/__init__.py
/usr/lib/python3.12/site-packages/tokenizers/implementations/__pycache__
/usr/lib/python3.12/site-packages/tokenizers/implementations/__pycache__/__init__.cpython-312.opt-1.pyc
/usr/lib/python3.12/site-packages/tokenizers/implementations/__pycache__/__init__.cpython-312.pyc
/usr/lib/python3.12/site-packages/tokenizers/implementations/__pycache__/base_tokenizer.cpython-312.opt-1.pyc
/usr/lib/python3.12/site-packages/tokenizers/implementations/__pycache__/base_tokenizer.cpython-312.pyc
/usr/lib/python3.12/site-packages/tokenizers/implementations/__pycache__/bert_wordpiece.cpython-312.opt-1.pyc
/usr/lib/python3.12/site-packages/tokenizers/implementations/__pycache__/bert_wordpiece.cpython-312.pyc
/usr/lib/python3.12/site-packages/tokenizers/implementations/__pycache__/byte_level_bpe.cpython-312.opt-1.pyc
/usr/lib/python3.12/site-packages/tokenizers/implementations/__pycache__/byte_level_bpe.cpython-312.pyc
/usr/lib/python3.12/site-packages/tokenizers/implementations/__pycache__/char_level_bpe.cpython-312.opt-1.pyc
There is 55 files more in these RPM.

 
ICM