Skip to end of metadata
Go to start of metadata

checks if cross references (links referenced within the page) exist

Adresses requirement


Cross references are document-internal links where the href="link-target" from the html anchor tag has no prefix like +http, https, ftp, telnet, mailto, file and such.

Only links with prefix # shall be taken into account, e.g. <a href="#internalLink">.

Architecture Decisions

Name Short Description
To check HTML we parse it into an internal (DOM-like) representation. For this task we use jsoup HTML parser, an open-source parser without external dependencies.
In the current {revision} we won’t check external links. These checks have been postponed to later versions.
The small java-string-similarity library (by Ralph Allen Rice) contains implementations of several similarity-calculation algorithms.