URLs (Uniform Resource Locators) are wonderful to work with because they are structured, human meaningful, globally unique identifiers. But they do have a few thorny edge cases.
There is some subjectivity in this analysis. The exploration in this article goes all the way back to 1987 in an effort to automatically determine the unique identity of a domain’s owner. The work primarily relies on two libraries, the Apache Commons Validator API and Java’s Uniform Resource Identifier (URI) reference. The latter is our parser while the former is, unsurprisingly, our validator.
Read the complete post,Turning URLs Into Meaningful Names Using Clojure →