Publisher review:gen_tree script scans the tree of HTML pages of a web site. This script scans the tree of HTML pages of a web site. (It's not always a tree because circles and loops are possible!)
It starts at the home page of that site (called the "root page" ) and follows all hyperlinks in a recursive descent (width first, in order to produce a representation in the expected way).
Since it scans files in the file system of the host bearing the web site, it is confined to pages lying physically on one host . The web server (HTTP daemon) of the web site is NOT used at all.
Circles and loops are recognized through unique identification of each page by the device and inode numbers of its corresponding file.
gen_tree is a Perl script for Perl Modules scripts design by Steffen Beyer.
It runs on following operating system: Linux / BSD.
gen_tree script scans the tree of HTML pages of a web site.
Operating system:Linux / BSD