Download
Public Forums
Bug Tracking
|
InSite
InSite is a free site management tool written in Perl.
View a sample report.
Features:
- fast local link verification: 76,000 documents containing
1.5 million links were verified in 2 hours
on a PII 300 with 512 MB RAM
running RedHat Linux 6.2 with all web files mounted
via NFS over 100 Mbps Ethernet (the 1.3 million local links
took 95 minutes to verify -- 230 links per second)
- remote link verification and classification by error type
- lists all mailto links' email addresses and frequency of use
- provides email alerts for any errors found on user-
defined "critical pages"
- calculates download sizes for critical pages
- scans pages for the presence or absence of red flags,
user-defined text strings (defined using powerful Perl
regular expressions)
- runs CGI programs and validates output
- full report of all broken links, including the document
in which the link was found and the document to which
the link was pointing.
- provides summary statistics:
- number of links per document by link type
(A, IMG, APPLET, etc.) and number and percentage
of each that are broken
- number of documents by file extension
- page size breakdown (how many documents are between
1K and 5K? how many are between 50K and 100K?)
- builds a simple text database of all parsed documents
and the links they contain so you can quickly find all
links to a particular document
- can be configured to ignore subsets of the site and to
ignore specified remote links
- can validate user pages (translates URLs of the form
"/~username")
- understands Apache-style SSIs
new in version 2.0
- parallel remote link
checking. Using 25 parallel clients, we can validate
78,000 remote links (6600 unique URLs) in 22 minutes!
- simple installation script
- better handling of servers with poor HEAD implementations
- @error_monitors array,a list of e-mail addresses to receive a list of
local errors (and remote errors designated in @serious_remote_errors);
the list is divided evenly among the recipients so that each can work
to correct broken links independently
- find_orphans, a program to scour your Web documents tree looking
for unused files
Requires libwww and MIME::Lite (available at any CPAN mirror).
|