Topic Modelling with Latent Dirichlet Allocation (LDA) on Wikipedia Articles

Here is an interactive dashboard showing the learned 45 topics from 33k Wikipedia articles. You can slightly scroll to the right to view the full picture. The left hand side illustrates the topic clusters projected in 2-dimensional space, while the right hand side lists the top terms associated with each topic. You can hover over each topic cluster (left) and view its corresponding most frequent words.

Avatar
Ningyuan (Teresa) Huang
PhD candidate

I am a PhD candidate at Johns Hopkins University. I enjoy telling stories with visualizations and using data science for social good.

Related