Now showing items 1-6 of 6
Detection of Target Speakers in Audio Databases
The problem of speaker detection in audio databases is addressed in this paper. Gaussian mixture modeling is used to build target speaker and background models. A detection algorithm based on a likelihood ratio calculation ...
Empirical Mode Decomposition Based Frequency Attributes
This paper describes a new technique, called <i>Empirical Mode Decomposition</i> (EMD), which allows the decomposition of one-dimensional signals into intrinsic oscillatory modes. Each component, called <i>Intrinsic Mode ...
Speaker Detection in Broadcast Speech Databases
Experiments have been carried out to assess the feasibility of detecting target speaker segments in multi-speaker broadcast databases. The experiemental database consists of NBC Nightly News broadcasts. The target speaker ...
An Overview of the AT&T Spoken Document Retrieval System
We present an overview of a spoken document retrieval system developed at AT&T Labs-Research for the HUB4 Broadcast News corpus. This overview includes a description of the intonational phrase boundary detection, ...
SCAN - Speech Content Based Audio Navigator: A Systems Overview
SCAN (Speech Content based Audio Navigator) is a spoken document retrieval system integrating speaker-independent, large-vocabulary speech recognition with information-retrieval to support query-based retrieval of information ...
Second-Order Statistical Measures for Text-Independent Speaker Identification
This article presents an overview of several measures for speaker recognition. These measures relate to second-order statistical tests, and can be expressed under a common formalism. Alternate formulations of these measures ...