SEARCH
NEW RPMS
DIRECTORIES
ABOUT
FAQ
VARIOUS
BLOG

 
 

perl-Text-Similarity rpm build for : OpenSuSE. For other distributions click perl-Text-Similarity.

Name : perl-Text-Similarity
Version : 0.13 Vendor : obs://build_opensuse_org/devel:languages:perl
Release : lp155.1.1 Date : 2023-07-20 18:39:17
Group : Development/Libraries/Perl Source RPM : perl-Text-Similarity-0.13-lp155.1.1.src.rpm
Size : 0.12 MB
Packager : https://www_suse_com/
Summary : Measure the pair-wise Similarity of Files or Strings
Description :
This module is a superclass for other modules and provides generic services
such as stop word removal, compound identification, and text cleaning or
sanitizing.

It\'s important to realize that additional methods of measuring similarity
can be added to this package. Text::Similarity::Overlaps is just one
possible way of measuring similarity, others can be added.

Subroutine sanitizeString carries out text cleaning. Briefly, it removes
nearly all punctuation except for underscores and embedded apostrophes,
converts all text to lower case, and collapes multiple white spaces to a
single space.

This module is where compounds are identified (although currently
disabled). When implemented it will check a list of compounds provided by
the user, and then when a compound is found in the text it will be
desigated via an underscore (e.g., white house might be converted to
white_house).

Stop words are removed here. The length of the documents reported does not
include the stop words. Overlaps are found after stopword removal. By
including a word in the stoplist, you are saying that the word never
existed in your input (in effect).

RPM found in directory: /packages/linux-pbone/ftp5.gwdg.de/pub/opensuse/repositories/devel:/languages:/perl:/CPAN-T/15.5/noarch

Content of RPM  Provides Requires

Download
ftp.icm.edu.pl  perl-Text-Similarity-0.13-lp155.1.1.noarch.rpm
     

Provides :
perl(Text::OverlapFinder)
perl(Text::Similarity)
perl(Text::Similarity::Overlaps)
perl-Text-Similarity

Requires :
/usr/bin/perl
perl(:MODULE_COMPAT_5.26.1)
rpmlib(CompressedFileNames) <= 3.0.4-1
rpmlib(FileDigests) <= 4.6.0-1
rpmlib(PayloadFilesHavePrefix) <= 4.0-1
rpmlib(PayloadIsXz) <= 5.2-1


Content of RPM :
/usr/bin/text_similarity.pl
/usr/lib/perl5/vendor_perl/5.26.1/Text
/usr/lib/perl5/vendor_perl/5.26.1/Text/OverlapFinder.pm
/usr/lib/perl5/vendor_perl/5.26.1/Text/Similarity
/usr/lib/perl5/vendor_perl/5.26.1/Text/Similarity.pm
/usr/lib/perl5/vendor_perl/5.26.1/Text/Similarity/Overlaps.pm
/usr/lib/perl5/vendor_perl/5.26.1/x86_64-linux-thread-multi
/usr/share/doc/packages/perl-Text-Similarity
/usr/share/doc/packages/perl-Text-Similarity/CHANGES
/usr/share/doc/packages/perl-Text-Similarity/FDL.txt
/usr/share/doc/packages/perl-Text-Similarity/GPL.txt
/usr/share/doc/packages/perl-Text-Similarity/README
/usr/share/doc/packages/perl-Text-Similarity/doc
/usr/share/doc/packages/perl-Text-Similarity/doc/CHANGES.pod
/usr/share/doc/packages/perl-Text-Similarity/doc/INSTALL.pod
/usr/share/doc/packages/perl-Text-Similarity/doc/README.pod
/usr/share/doc/packages/perl-Text-Similarity/doc/update-pod.sh
/usr/share/doc/packages/perl-Text-Similarity/samples
/usr/share/doc/packages/perl-Text-Similarity/samples/stoplist-nsp.regex
/usr/share/doc/packages/perl-Text-Similarity/samples/stoplist.txt
/usr/share/man/man1/text_similarity.pl.1.gz
/usr/share/man/man3/Text::OverlapFinder.3pm.gz
/usr/share/man/man3/Text::Similarity.3pm.gz
/usr/share/man/man3/Text::Similarity::Overlaps.3pm.gz

 
ICM