Skip to main content


Showing posts from 2018

Similarity Of Authors

This visualization depicts the similarity of the works of authors in various fields. The proximity of two authors represent the similarity of their works. The closer they are- the more similar their works (with some approximations). A directed red arrow from author A to author B means that - for author A the most similar author was found to be author B. More technically, the works of each author is represented as an n dimensional vector where each dimension represents a word. The co-ordinate for each dimension is determined as the TF(Term Frequency) * IDF (Relative rarity) of each word. The distance used to measure is the CosineDistance ( 1 - CosineSimilarity). For a further exploration of Author Similarity here is a live demo