The folks over at SEOmoz knew they were on to something…
There is a growing body of evidence in SEO to support the idea that “Topic Modeling“, Latent Semantic Indexing (aka LSI) and its hip new cousin, Latent Dirichlet Allocation, represent the missing link in determining how some pages consistently rank higher than other pages when all other things are equal (number and quality of backlinks, for example).
In preparation for release of my latest software product, ClickBump SEO! I’ve been intently studying software applications of LSI in general and more specifically, the science of topic and vector space modeling.
So, it was with great interest, that I approached a post on SEOmoz regarding a new and exciting offshoot of LSI called Latent Dirichlet Allocation. LDA, a really geeky sounding term if there ever was one, can legitimately be described as the single hottest development in search engine optimization that you’ve never heard of.
However, the folks at SEOmoz are about to change that. And in a major way.
They have developed a context-based algorithm for determining search relevance called Latent Dirichlet Allocation or LDA for short (thank Goodness for abbreviations!). What’s absolutely ground breaking though is that their research has shown a strong and unmistakable correlation between LDA and search rankings:
I was particularly struck with this point:
It was particularly gratifying to see these results because this is the exact model I’ve implemented in the topic matching algorithm for ClickBump SEO! And in particular, it confirms what I’ve found to be true in my own experience with topic modeling and content authority rank predictability with the top results on Google for a given search query.
Without boring you too much with the details of what the ClickBump SEO! keyword matching algorithm is doing at a mathematical level, its primary function is to facilitate the process of content rank maximization by assembling the best collection of topic modeled, LSI matching keyphrases from the ordered aggregate top content/page results for a given keyphrase in real time. Essentially, the secret soup collection of terms that are found most often across the top 50 ranked pages for a given query and weighted by result rank and frequency of occurrence.
To see this new LSI matching algorithm in action, lets take a look at ClickBump SEO! for WordPress…