Name : perl-WWW-Robot
| |
Version : 0.026
| Vendor : obs://build_opensuse_org/devel:languages:perl
|
Release : lp155.7.1
| Date : 2023-07-20 16:34:26
|
Group : Development/Libraries/Perl
| Source RPM : perl-WWW-Robot-0.026-lp155.7.1.src.rpm
|
Size : 0.07 MB
| |
Packager : https://www_suse_com/
| |
Summary : configurable web traversal engine (for web robots & agents)
|
Description :
This module implements a configurable web traversal engine, for a _robot_ or other web agent. Given an initial web page (_URL_), the Robot will get the contents of that page, and extract all links on the page, adding them to a list of URLs to visit.
Features of the Robot module include:
* *
Follows the _Robot Exclusion Protocol_.
* *
Supports the META element proposed extensions to the Protocol.
* *
Implements many of the _Guidelines for Robot Writers_.
* *
Configurable.
* *
Builds on standard Perl 5 modules for WWW, HTTP, HTML, etc.
A particular application (robot instance) has to configure the engine using _hooks_, which are perl functions invoked by the Robot engine at specific points in the control loop.
The robot engine obeys the Robot Exclusion protocol, as well as a proposed addition. See the SEE ALSO manpage for references to documents describing the Robot Exclusion protocol and web robots.
|
RPM found in directory: /packages/linux-pbone/ftp5.gwdg.de/pub/opensuse/repositories/devel:/languages:/perl:/CPAN-W/15.5/noarch |