20.19.3. Discussion
The HTML::TableContentParser module converts all tables in the HTML
document into a Perl data structure. As with HTML tables, there are
three layers of nesting in the data structure: the table, the row,
and the data in that row.
Each table, row, and data tag is represented as a hash reference. The
hash keys correspond to attributes of the tag that defined that
table, row, or cell. In addition, the value for a special key gives
the contents of the table, row, or cell. In a table, the value for
the rows key is a reference to an array of rows.
In a row, the cols key points to an array of
cells. In a cell, the data key holds the HTML
contents of the data tag.
For example, take the following table:
<table width="100%" bgcolor="#ffffff">
<tr>
<td>Larry & Gloria</td>
<td>Mountain View</td>
<td>California</td>
</tr>
<tr>
<td><b>Tom</b></td>
<td>Boulder</td>
<td>Colorado</td>
</tr>
<tr>
<td>Nathan & Jenine</td>
<td>Fort Collins</td>
<td>Colorado</td>
</tr>
</table>
[
{
'width' => '100%',
'bgcolor' => '#ffffff',
'rows' => [
{
'cells' => [
{ 'data' => 'Larry & Gloria' },
{ 'data' => 'Mountain View' },
{ 'data' => 'California' },
],
'data' => "\n "
},
{
'cells' => [
{ 'data' => '<b>Tom</b>' },
{ 'data' => 'Boulder' },
{ 'data' => 'Colorado' },
],
'data' => "\n "
},
{
'cells' => [
{ 'data' => 'Nathan & Jenine' },
{ 'data' => 'Fort Collins' },
{ 'data' => 'Colorado' },
],
'data' => "\n "
}
]
}
]