SEARCH
NEW RPMS
DIRECTORIES
ABOUT
FAQ
VARIOUS
BLOG

 
 

libtextcat rpm build for : OpenSuSE. For other distributions click libtextcat.

Name : libtextcat
Version : 2.2 Vendor : openSUSE Build Service
Release : 4.1 Date : 2007-10-27 22:23:01
Group : Development/Languages/C and C++ Source RPM : libtextcat-2.2-4.1.src.rpm
Size : 0.67 MB
Packager : (none)
Summary : Library for text classification
Description :
Libtextcat is a library with functions that implement the classification
technique described in Cavnar & Trenkle, \"N-Gram-Based Text Categorization\"
[1]. It was primarily developed for language guessing, a task on which it is
known to perform with near-perfect accuracy.

The central idea of the Cavnar & Trenkle technique is to calculate a
\"fingerprint\" of a document with an unknown category, and compare this with the
fingerprints of a number of documents of which the categories are known. The
categories of the closest matches are output as the classification. A
fingerprint is a list of the most frequent n-grams occurring in a document,
ordered by frequency. Fingerprints are compared with a simple out-of-place
metric. See the article for more details.

Considerable effort went into making this implementation fast and efficient.
The language guesser processes over 100 documents/second on a simple PC, which
makes it practical for many uses. It was developed for use in our webcrawler
and search engine software, in which it it handles millions of documents a day.

Authors:
--------
Frank Scheelen

RPM found in directory: /packages/linux-pbone/ftp5.gwdg.de/pub/opensuse/repositories/server:/search/SLE_10/x86_64

Content of RPM  Changelog  Provides Requires

Hmm ... It's impossible ;-) This RPM doesn't exist on any FTP server

Provides :
libtextcat.so.0()(64bit)
libtextcat

Requires :
rpmlib(PayloadIsBzip2) <= 3.0.5-1
libc.so.6()(64bit)
rpmlib(CompressedFileNames) <= 3.0.4-1
libtextcat.so.0()(64bit)
libc.so.6(GLIBC_2.3.4)(64bit)
libc.so.6(GLIBC_2.2.5)(64bit)
libc.so.6(GLIBC_2.3)(64bit)
rpmlib(PayloadFilesHavePrefix) <= 4.0-1


Content of RPM :
/usr/bin/createfp
/usr/lib64/libtextcat.so.0
/usr/lib64/libtextcat.so.0.0.0
/usr/share/doc/packages/libtextcat
/usr/share/doc/packages/libtextcat/ChangeLog
/usr/share/doc/packages/libtextcat/LICENSE
/usr/share/doc/packages/libtextcat/README
/usr/share/doc/packages/libtextcat/TODO
/usr/share/libtextcat
/usr/share/libtextcat/LM
/usr/share/libtextcat/LM/afrikaans.lm
/usr/share/libtextcat/LM/albanian.lm
/usr/share/libtextcat/LM/amharic-utf.lm
/usr/share/libtextcat/LM/arabic-iso8859_6.lm
/usr/share/libtextcat/LM/arabic-windows1256.lm
/usr/share/libtextcat/LM/armenian.lm
/usr/share/libtextcat/LM/basque.lm
/usr/share/libtextcat/LM/belarus-windows1251.lm
/usr/share/libtextcat/LM/bosnian.lm
/usr/share/libtextcat/LM/breton.lm
/usr/share/libtextcat/LM/bulgarian-iso8859_5.lm
/usr/share/libtextcat/LM/catalan.lm
/usr/share/libtextcat/LM/chinese-big5.lm
/usr/share/libtextcat/LM/chinese-gb2312.lm
/usr/share/libtextcat/LM/croatian-ascii.lm
/usr/share/libtextcat/LM/czech-iso8859_2.lm
/usr/share/libtextcat/LM/danish.lm
/usr/share/libtextcat/LM/drents.lm
/usr/share/libtextcat/LM/dutch.lm
/usr/share/libtextcat/LM/english.lm
There is 135 files more in these RPM.

 
ICM