Automatic KOS based indexing – i.e. indexing based on a
restricted, controlled vocabulary, a thesaurus or a classification – can play
an important role to close the gap between the intellectually, high quality
indexed publications and the mass of unindexed publications. Especially
for unknown, heterogeneous publications, like web publications, simple
processes that do not rely on manually created training data are needed.
With this contribution, we propose a straight-forward linguistic indexer,
that can be used as a basis for own developments and for experiments
and analyses to explore own documents and KOSs; it uses state-of-the-
art information retrieval techniques and hence forms a suitable baseline
for evaluations. Finally, it is free and open source.