Show simple item record

dc.contributor.authorRosenberg, Aaron
Magrin-Chagnolleau, Ivan
Parthasarathy, S.
dc.creatorRosenberg, Aaron
Magrin-Chagnolleau, Ivan
Parthasarathy, S.
dc.date.accessioned 2007-10-31T01:03:05Z
dc.date.available 2007-10-31T01:03:05Z
dc.date.issued 1998-01-15
dc.date.submitted 1998-01-15
dc.identifier.urihttps://hdl.handle.net/1911/20304
dc.description Conference Paper
dc.description.abstract Experiments have been carried out to assess the feasibility of detecting target speaker segments in multi-speaker broadcast databases. The experiemental database consists of NBC Nightly News broadcasts. The target speaker is the news anchor, Tom Brokaw. Gaussian mixture models are constructed from labelled training data for the target speaker as well as background models for other speakers, commercials, and music. Four labelled 30-min. broadcasts are used for testing. Mel-frequency cepstral features, augmented by delta cepstral features are calculated over 20 msec. windows shifted every 10 msec. through a broadcast. Likelihood ratio scores are calculated for each test frame averaged over blocks of frames with a specified duration. The block scores are input to a detection routine which returns estimates of target segments boundaries. The range of best results obtained over the test broadcasts is 82% to 100% detection of target segments with segment frame accuracy ranging from 86% to 95%. 0 to 2 false alarm segments are detected over each 30 min. broadcast.
dc.language.iso eng
dc.subjectTemporary
dc.subject.otherSignal Processing Applications
dc.title Speaker Detection in Broadcast Speech Databases
dc.type Conference paper
dc.date.note 2004-01-14
dc.citation.bibtexName inproceedings
dc.date.modified 2004-11-04
dc.contributor.orgDigital Signal Processing (http://dsp.rice.edu/)
dc.subject.keywordTemporary
dc.citation.conferenceName Proceedings of International Conference on Spoken Language Processsing
dc.type.dcmi Text
dc.type.dcmi Text
dc.identifier.citation A. Rosenberg, I. Magrin-Chagnolleau and S. Parthasarathy, "Speaker Detection in Broadcast Speech Databases," 1998.


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

  • DSP Publications [508]
    Publications by Rice Faculty and graduate students in digital signal processing.
  • ECE Publications [1473]
    Publications by Rice University Electrical and Computer Engineering faculty and graduate students

Show simple item record