Name : perl-HTML-TableContentParser
| |
Version : 0.305
| Vendor : obs://build_opensuse_org/devel:languages:perl
|
Release : lp154.1.1
| Date : 2023-01-27 16:37:48
|
Group : Unspecified
| Source RPM : perl-HTML-TableContentParser-0.305-lp154.1.1.src.rpm
|
Size : 0.02 MB
| |
Packager : https://www_suse_com/
| |
Summary : Do interesting things with the contents of tables
|
Description :
This package parses tables out of HTML. The return from the parse is a reference to an array containing the tables found.
Tables appear in the output in the order in which they are encountered. If a table is nested inside a cell of another table, it will appear after the containing table in the output, and any connection between the two will be lost. As of version 0.200_01, the appearance of a nested table should not cause any truncation of the containing table.
The following tags are processed by this module: \'< table>\', \'< caption>\', \'< tr>\', \'< th>\', and \'< td>\'. In the return from the parse method, each tag is represented by a hash reference, having the tag\'s attributes as keys, and the attribute values as values. In addition, the following keys will be provided:
* \'< table>\'
* caption
the \'< caption>\' tag, if any
* headers
a reference to an array containing all the \'< th>\' tags, in the order encountered
* rows
a reference to an array containing all the \'< tr>\' tags, in the order encountered
* \'< caption>\'
* data
the content of the \'< caption>\' tag
* \'< tr>\'
* cells
a reference to an array containing all the \'< td>\' tags, in the order encountered, with \'undef\' representing any \'< th>\' tags encountered. Trailing \'undef\' values will be dropped, and the entire key will be absent unless actual \'< td>\' tags are found in the row.
Note that prior to version 0.299_01, \'< th>\' tags were not represented at all.
* headers
new with version 0.299_01, this is a reference to an array containing all the \'< th>\' tags in the row, in the order encountered, with \'undef\' representing any \'< td>\' tags. Trailing \'undef\' values will be dropped, and the entire key will be absent unless actual \'< th>\' tags are found in the row.
It is the understanding of the current author (TRW) that in valid HTML \'< th>\' tags must occur inside a \'< tr>\' element, so they need to be recognized there, rather than (or in addition to) in isolation.
* \'< th>\'
* data
the content of the \'< th>\' tag
* \'< td>\'
* data
the content of the \'< td>\' tag
|
RPM found in directory: /packages/linux-pbone/ftp5.gwdg.de/pub/opensuse/repositories/devel:/languages:/perl:/CPAN-H/15.4/noarch |