Multilevel integration of vision and speech understanding using bayseian networks

Wachsmuth, S.; Brandt-Pook, Hans; Socher, G.; Kummert, F.; Sagerer, G.

Multilevel integration of vision and speech understanding using bayseian networks

S. Wachsmuth, H. Brandt-Pook, G. Socher, F. Kummert, G. Sagerer, in: H. Christensen (Ed.), International Conference on Computer Vision Systems, Springer, 1999, pp. 231–254.

Download

Es wurde kein Volltext hochgeladen. Nur Publikationsnachweis!

DOI

10.1007/3-540-49256-9_15

Buchbeitrag | Englisch

Autor*in

Wachsmuth, S.; Brandt-Pook, Hans ; Socher, G.; Kummert, F.; Sagerer, G.

Herausgeber*in

Christensen, H.

Abstract

The interaction of image and speech processing is a crucial property of multimedia systems. Classical systems using inferences on pure qualitative high level descriptions miss a lot of information when concerned with erroneous, vague, or incomplete data. We propose a new architecture that integrates various levels of processing by using multiple representations of the visually observed scene. They are vertically connected by Bayesian networks in order to find the most plausible interpretation of the scene. The interpretation of a spoken utterance naming an object in the visually observed scene is modeled as another partial representation of the scene. Using this concept, the key problem is the identification of the verbally specified object instances in the visually observed scene. Therefore, a Bayesian network is generated dynamically from the spoken utterance and the visual scene representation. In this network spatial knowledge as well as knowledge extracted from psycholinguistic experiments is coded. First results show the robustness of our approach.

Erscheinungsjahr

1999

Buchtitel

International Conference on Computer Vision Systems

Seite

231-254

FH-PUB-ID

3588

Zitieren

Wachsmuth, S. ; Brandt-Pook, Hans ; Socher, G. ; Kummert, F. ; Sagerer, G.: Multilevel integration of vision and speech understanding using bayseian networks. In: Christensen, H. (Hrsg.): International Conference on Computer Vision Systems, Bd.1542 von Lecture Notes in Computer Science : Springer, 1999, S. 231–254

Wachsmuth S, Brandt-Pook H, Socher G, Kummert F, Sagerer G. Multilevel integration of vision and speech understanding using bayseian networks. In: Christensen H, ed. International Conference on Computer Vision Systems. Bd.1542 von Lecture Notes in Computer Science. Springer; 1999:231-254. doi:10.1007/3-540-49256-9_15

Wachsmuth, S., Brandt-Pook, H., Socher, G., Kummert, F., & Sagerer, G. (1999). Multilevel integration of vision and speech understanding using bayseian networks. In H. Christensen (Ed.), International Conference on Computer Vision Systems (pp. 231–254). Springer. https://doi.org/10.1007/3-540-49256-9_15

@inbook{Wachsmuth_Brandt-Pook_Socher_Kummert_Sagerer_1999, series={Bd.1542 von Lecture Notes in Computer Science}, title={Multilevel integration of vision and speech understanding using bayseian networks}, DOI={10.1007/3-540-49256-9_15}, booktitle={International Conference on Computer Vision Systems}, publisher={Springer}, author={Wachsmuth, S. and Brandt-Pook, Hans and Socher, G. and Kummert, F. and Sagerer, G.}, editor={Christensen, H.Editor}, year={1999}, pages={231–254}, collection={Bd.1542 von Lecture Notes in Computer Science} }

Wachsmuth, S., Hans Brandt-Pook, G. Socher, F. Kummert, and G. Sagerer. “Multilevel Integration of Vision and Speech Understanding Using Bayseian Networks.” In International Conference on Computer Vision Systems, edited by H. Christensen, 231–54. Bd.1542 von Lecture Notes in Computer Science. Springer, 1999. https://doi.org/10.1007/3-540-49256-9_15.

S. Wachsmuth, H. Brandt-Pook, G. Socher, F. Kummert, and G. Sagerer, “Multilevel integration of vision and speech understanding using bayseian networks,” in International Conference on Computer Vision Systems, H. Christensen, Ed. Springer, 1999, pp. 231–254.

Wachsmuth, S., et al. “Multilevel Integration of Vision and Speech Understanding Using Bayseian Networks.” International Conference on Computer Vision Systems, edited by H. Christensen, Springer, 1999, pp. 231–54, doi:10.1007/3-540-49256-9_15.

Export

Markierte Publikationen

Open Data LibreCat

Suchen in

Google Scholar