Bags of words, edit distance (as see in bioinformatics, hamming distances, cunning kernels and vector
spaces over documents. Vector spaces induced by document structures.
Metrics based on generation by finite state machines.
Maybe co-occurrence metrics would also be useful as musical metrics?
If I were to actually write this entry, it would be a big research project.