Here are some highlights.
Lots of papers on LSA-LDA. I am fascinating by the elegance and non-supervision(?) of this technique.
•Link-PLSA-LDA: A new unsupervised model for topics and influence of blogs
•The Psychology of Word Use in Depression Forums in English and in Spanish
This is a conference where all papers are about solving a fixed set of problems on a given set of data. So, everybody understands 100% of all papers and fully engaged.
•Document Representation and Query Expansion Models for Blog Recommendation basically, the winner of Blog'07
•On TREC Blog Track. The dataset is available for a nominal fee of 400 pounds.
With over 200 world languages and the most widely used language being oddly the simplest one, I thought that good language coverage can be achieved only by teams of zillion people. However, there are some tricks with machine translation:

0 comments:
Post a Comment