1. Klimt, B., and Yang, Y.: A New Dataset for Email Classification Research. 15th
European Conference on Machine Learning (2004)
2. Yang, Y.: An Evaluation of Statistical Approaches to Text Categorization. Journal
of Information Retrieval, Vol. 1 (1999) 67–88
3. Chakrabarti, S: Mining the Web: Discovering Knowledge from Hypertext Data.
Morgan Kaufmann Publishers (2002)
4. Deerwester, S.C. , Dumais, S.T. , Landauer, T.K. , Furnas, G.W. , Harshman, R.A.:
Indexing by Latent Semantic Analysis. American Society of Information Science,
41(6) (1990) 391–407
5. Berry, M.W., Dumais, S.T., O’Brien G.W.: Using Linear Algebra for Intelligent
Information Retrieval. SIAM Review 37 (1994) 573–595
6. Hofmann, T.: Probabilistic Latent Semantic Indexing. 22nd Int’l. Conference on
Research and Development in Information Retrieval (1999)
7. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet Allocation. Journal of Machine
Learning Research, 1 (2003) 993–1022
8. Minka, T., and La, J.: Expectation-Propagation for the Generative Aspect Model.
18th Conference on Uncertainty and Artificial Intelligence (2002)
9. Griffiths, T.L., and Steyvers, M.: Finding Scientific Topics. National Academy of
Sciences, 101 (suppl. 1) (2004) 5228–5235
10. Pritchard, J.K., Stephens, M., Donnelly, P.: Inference of Population Structure using
Multilocus Genotype Data. Genetics 155 (2000) 945–959
11. Buntine, W. , Perttu, S. , Tuulos, V.: Using Discrete PCA on Web Pages. Pro-
ceedings of the Workshop W1 on Statistical Approaches for Web Mining (SAWM).
Italy (2004) 99-110
12. McCallum, A., Corrada-Emmanuel, A., Wang, X.: Topic and Role Discovery in
Social Networks. 19th Joint Conference on Artificial Intelligence (2005)
13. Steyvers, M., Smyth, P., Rosen-Zvi, M., Griffiths, T.: Probabilistic Author-Topic
Models for Information Discovery. 10th ACM SIGKDD (2004)