CONF
kurimo-icassp00b/IDIAP
Fast latent semantic indexing of spoken documents by using self-organizing maps
Kurimo, Mikko
EXTERNAL
https://publications.idiap.ch/attachments/papers/2000/kurimo_icassp00.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/kurimo-icassp00
Related documents
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP'2000
2000
Istanbul, Turkey
June 2000
IDIAP-RR 99-20
This paper describes a new latent semantic indexing (LSI) method for spoken audio documents. The framework is indexing broadcast news from radio and TV as a combination of large vocabulary continuous speech recognition (LVCSR,',','),
natural language processing (NLP) and information retrieval (IR). For indexing, the documents are presented as vectors of word counts, whose dimensionality is rapidly reduced by random mapping (RM). The obtained vectors are projected into the latent semantic subspace determined by SVD, where the vectors are then smoothed by a self-organizing map (SOM). The smoothing by the closest document clusters is important here, because the documents are often short and have a high word error rate (WER). As the clusters in the semantic subspace reflect the news topics, the SOMs provide an easy way to visualize the index and query results and to explore the database. Test results are reported for TREC's spoken document retrieval databases.
REPORT
kurimo-icassp00/IDIAP
Fast latent semantic indexing of spoken documents by using self-organizing maps
Kurimo, Mikko
EXTERNAL
https://publications.idiap.ch/attachments/reports/1999/rr99-20.pdf
PUBLIC
Idiap-RR-20-1999
1999
IDIAP
Published in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP'2000, Istanbul, Turkey, 2000
This paper describes a new latent semantic indexing (LSI) method for spoken audio documents. The framework is indexing broadcast news from radio and TV as a combination of large vocabulary continuous speech recognition (LVCSR,',','),
natural language processing (NLP) and information retrieval (IR). For indexing, the documents are presented as vectors of word counts, whose dimensionality is rapidly reduced by random mapping (RM). The obtained vectors are projected into the latent semantic subspace determined by SVD, where the vectors are then smoothed by a self-organizing map (SOM). The smoothing by the closest document clusters is important here, because the documents are often short and have a high word error rate (WER). As the clusters in the semantic subspace reflect the news topics, the SOMs provide an easy way to visualize the index and query results and to explore the database. Test results are reported for TREC's spoken document retrieval databases.