You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

Short Description
checks if cross references (links referenced within the page) exist
Iteration
Facade
Level
1

Description

Cross references are document-internal links where the href="link-target" from the html anchor tag has no prefix like +http, https, ftp, telnet, mailto, file and such.

Only links with prefix # shall be taken into account, e.g. <a href="#internalLink">.

Architecture Decisions

NameShort Description
HTML Parsing with jsoup
To check HTML we parse it into an internal (DOM-like) representation. For this task we use jsoup HTML parser, an open-source parser without external dependencies.
In the current {revision} we won’t check external links. These checks have been postponed to later versions.
String Similarity Checking with Jaro-Winkler-Distance
The small java-string-similarity library (by Ralph Allen Rice) contains implementations of several similarity-calculation algorithms.
  • No labels