checks if cross references (links referenced within the page) exist

Tags
Iteration
Filled
Level
1
Adresses requirement

Description

Cross references are document-internal links where the href="link-target" from the html anchor tag has no prefix like +http, https, ftp, telnet, mailto, file and such.

Only links with prefix # shall be taken into account, e.g. <a href="#internalLink">.

Architecture Decisions

NameShort Description
HTML Parsing with jsoup
To check HTML we parse it into an internal (DOM-like) representation. For this task we use jsoup HTML parser, an open-source parser without external dependencies.
In the current {revision} we won’t check external links. These checks have been postponed to later versions.
String Similarity Checking with Jaro-Winkler-Distance
The small java-string-similarity library (by Ralph Allen Rice) contains implementations of several similarity-calculation algorithms.