This page represents the corpus in terms of inter-document distances. Distance in this case refers to the Helsinger distance between documents, based on their topic mixtures. Documents with similar topic mixtures—similar topics in similar concentrations—are closer than those with different mixtures. The graph on the right shows the distribution of distances, the mode being 0.76.

01,8003,6005,4007,2009,00010,80012,60014,40016,20018,00019,80021,60023,40025,20027,00028,80030,60032,40034,20036,0000.00.10.20.30.40.50.60.70.80.91.0Helinger distancenumber of document dyads

Most Connected Documents

Top 100 documents with the lowest average Helsinger distance to other documents. These tend to have low topic entropy.

Most Lonely (Least Connected) Documents

Top 100 documents with the highest average Helsinger distance to other documents. These tend to have high topic entropy.