SourceForge
Logo

Download

Public Forums

Bug Tracking

InSite

InSite is a free site management tool written in Perl.

View a sample report.

Features:

  • fast local link verification: 76,000 documents containing 1.5 million links were verified in 2 hours on a PII 300 with 512 MB RAM running RedHat Linux 6.2 with all web files mounted via NFS over 100 Mbps Ethernet (the 1.3 million local links took 95 minutes to verify -- 230 links per second)
  • remote link verification and classification by error type
  • lists all mailto links' email addresses and frequency of use
  • provides email alerts for any errors found on user- defined "critical pages"
  • calculates download sizes for critical pages
  • scans pages for the presence or absence of red flags, user-defined text strings (defined using powerful Perl regular expressions)
  • runs CGI programs and validates output
  • full report of all broken links, including the document in which the link was found and the document to which the link was pointing.
  • provides summary statistics:
    • number of links per document by link type (A, IMG, APPLET, etc.) and number and percentage of each that are broken
    • number of documents by file extension
    • page size breakdown (how many documents are between 1K and 5K? how many are between 50K and 100K?)
  • builds a simple text database of all parsed documents and the links they contain so you can quickly find all links to a particular document
  • can be configured to ignore subsets of the site and to ignore specified remote links
  • can validate user pages (translates URLs of the form "/~username")
  • understands Apache-style SSIs

new in version 2.0

  • parallel remote link checking. Using 25 parallel clients, we can validate 78,000 remote links (6600 unique URLs) in 22 minutes!
  • simple installation script
  • better handling of servers with poor HEAD implementations
  • @error_monitors array,a list of e-mail addresses to receive a list of local errors (and remote errors designated in @serious_remote_errors); the list is divided evenly among the recipients so that each can work to correct broken links independently
  • find_orphans, a program to scour your Web documents tree looking for unused files

Requires libwww and MIME::Lite (available at any CPAN mirror).