• OAI
  • SRU
  • Mapa web
  • Castellano
    • Inglés
AMERICANAE
  • Inicio
  • Presentación
  • Búsqueda
  • Directorio
  • Repositorios OAI
AMERICANAE
Gobierno de España Ministerio de Asuntos Exteriores, Unión Europea y Cooperación Agencia Española de Cooperación Internacional para el Desarrollo
AMERICANAE
  • Inicio
  • Presentación
  • Búsqueda
  • Directorio
  • Repositorios OAI
Está en:  › Datos de registro
Linked Open Data
Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition
Identificadores del recurso
http://hdl.handle.net/10533/197802
doi: 10.1109/TSA.2005.852994
wos: WOS:000235369100024
issn: 1063-6676
Procedencia
(LA Referencia)

Ficha

Título:
Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition
Descripción:
A solution to the problem of speech recognition with signals distorted by low-bit rate coders is presented in this paper. A model for the coding-decoding distortion, a HMM compensation method to include this model, and an EM-based adaptation algorithm to estimate this distortion are proposed here. Medium vocabulary continuous-speech speaker-independent recognition experiments with 8 kbps G.729(CS-CELP), 13 kbps RPE-LTP (GSM), 5.3 kbps G723.1, 4.8 kbps FS-1016 and 32 kbps G.726(ADPCM) coders show that the approach described in this paper is able to dramatically reduce the effect of the coding distortion and, in some cases, gives a word accuracy higher than the baseline system with uncoded speech. Finally, the EM estimation algorithm requires only one adapting utterance and the approach described is certainly suitable for dialogue systems where just a few adapting utterances are available.
Fuente:
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING
reponame:Artículos CONICYT
instname:CONICYT Chile
instacron:CONICYT
Idioma:
English
Relación:
instname: Conicyt
reponame: Repositorio Digital RI2.0
info:eu-repo/grantAgreement/Fondef/D02I1089
info:eu-repo/semantics/dataset/hdl.handle.net/10533/93477
Autor/Productor:
Becerra-Yoma, Nestor
Silva-Sánchez, Jorge
Busso-Vyhmeister, Carlos
Derechos:
info:eu-repo/semantics/openAccess
Fecha:
2006
Tipo de recurso:
info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
About:
2020-01-27T14:04:19Zhttp://www.openarchives.org/OAI/2.0/oai_dc/Artículos CONICYT - CONICYT Chile

oai_dc

Descargar XML

    <?xml version="1.0" encoding="UTF-8" ?>

  1. <oai_dc:dc schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">

    1. <dc:title>Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition</dc:title>

    2. <dc:creator>Becerra-Yoma, Nestor</dc:creator>

    3. <dc:creator>Silva-Sánchez, Jorge</dc:creator>

    4. <dc:creator>Busso-Vyhmeister, Carlos</dc:creator>

    5. <dc:creator>Busso-Vyhmeister, Carlos</dc:creator>

    6. <dc:description>A solution to the problem of speech recognition with signals distorted by low-bit rate coders is presented in this paper. A model for the coding-decoding distortion, a HMM compensation method to include this model, and an EM-based adaptation algorithm to estimate this distortion are proposed here. Medium vocabulary continuous-speech speaker-independent recognition experiments with 8 kbps G.729(CS-CELP), 13 kbps RPE-LTP (GSM), 5.3 kbps G723.1, 4.8 kbps FS-1016 and 32 kbps G.726(ADPCM) coders show that the approach described in this paper is able to dramatically reduce the effect of the coding distortion and, in some cases, gives a word accuracy higher than the baseline system with uncoded speech. Finally, the EM estimation algorithm requires only one adapting utterance and the approach described is certainly suitable for dialogue systems where just a few adapting utterances are available.</dc:description>

    7. <dc:date>2006</dc:date>

    8. <dc:type>info:eu-repo/semantics/article</dc:type>

    9. <dc:type>info:eu-repo/semantics/publishedVersion</dc:type>

    10. <dc:identifier>http://hdl.handle.net/10533/197802</dc:identifier>

    11. <dc:identifier>doi: 10.1109/TSA.2005.852994</dc:identifier>

    12. <dc:identifier>wos: WOS:000235369100024</dc:identifier>

    13. <dc:identifier>issn: 1063-6676</dc:identifier>

    14. <dc:language>eng</dc:language>

    15. <dc:relation>instname: Conicyt</dc:relation>

    16. <dc:relation>reponame: Repositorio Digital RI2.0</dc:relation>

    17. <dc:relation>instname: Conicyt</dc:relation>

    18. <dc:relation>reponame: Repositorio Digital RI2.0</dc:relation>

    19. <dc:relation>info:eu-repo/grantAgreement/Fondef/D02I1089</dc:relation>

    20. <dc:relation>info:eu-repo/semantics/dataset/hdl.handle.net/10533/93477</dc:relation>

    21. <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>

    22. <dc:source>IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING</dc:source>

    23. <dc:source>reponame:Artículos CONICYT</dc:source>

    24. <dc:source>instname:CONICYT Chile</dc:source>

    25. <dc:source>instacron:CONICYT</dc:source>

    26. <about>

      1. <provenance>

        1. <originDescription altered="" harvestDate="">

          1. <baseURL />
          2. <identifier />
          3. <datestamp>2020-01-27T14:04:19Z</datestamp>

          4. <metadataNamespace>http://www.openarchives.org/OAI/2.0/oai_dc/</metadataNamespace>

          5. <repositoryID />
          6. <repositoryName>Artículos CONICYT - CONICYT Chile</repositoryName>

          </originDescription>

        </provenance>

      </about>

    </oai_dc:dc>

xoai

Descargar XML

    <?xml version="1.0" encoding="UTF-8" ?>

  1. <metadata schemaLocation="http://www.lyncode.com/xoai http://www.lyncode.com/xsd/xoai.xsd">

    1. <element name="dc">

      1. <element name="title">

        1. <element name="none">

          1. <field name="value">Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition</field>

          </element>

        </element>

      2. <element name="creator">

        1. <element name="none">

          1. <field name="value">Becerra-Yoma, Nestor</field>

          2. <field name="value">Silva-Sánchez, Jorge</field>

          3. <field name="value">Busso-Vyhmeister, Carlos</field>

          4. <field name="value">Busso-Vyhmeister, Carlos</field>

          </element>

        </element>

      3. <element name="description">

        1. <element name="none">

          1. <field name="value">A solution to the problem of speech recognition with signals distorted by low-bit rate coders is presented in this paper. A model for the coding-decoding distortion, a HMM compensation method to include this model, and an EM-based adaptation algorithm to estimate this distortion are proposed here. Medium vocabulary continuous-speech speaker-independent recognition experiments with 8 kbps G.729(CS-CELP), 13 kbps RPE-LTP (GSM), 5.3 kbps G723.1, 4.8 kbps FS-1016 and 32 kbps G.726(ADPCM) coders show that the approach described in this paper is able to dramatically reduce the effect of the coding distortion and, in some cases, gives a word accuracy higher than the baseline system with uncoded speech. Finally, the EM estimation algorithm requires only one adapting utterance and the approach described is certainly suitable for dialogue systems where just a few adapting utterances are available.</field>

          </element>

        </element>

      4. <element name="date">

        1. <element name="none">

          1. <field name="value">2006</field>

          </element>

        </element>

      5. <element name="type">

        1. <element name="none">

          1. <field name="value">info:eu-repo/semantics/article</field>

          2. <field name="value">info:eu-repo/semantics/publishedVersion</field>

          </element>

        </element>

      6. <element name="identifier">

        1. <element name="none">

          1. <field name="value">http://hdl.handle.net/10533/197802</field>

          2. <field name="value">doi: 10.1109/TSA.2005.852994</field>

          3. <field name="value">wos: WOS:000235369100024</field>

          4. <field name="value">issn: 1063-6676</field>

          </element>

        </element>

      7. <element name="source">

        1. <element name="none">

          1. <field name="value">IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING</field>

          2. <field name="value">reponame:Artículos CONICYT</field>

          3. <field name="value">instname:CONICYT Chile</field>

          4. <field name="value">instacron:CONICYT</field>

          </element>

        </element>

      8. <element name="relation">

        1. <element name="none">

          1. <field name="value">instname: Conicyt</field>

          2. <field name="value">reponame: Repositorio Digital RI2.0</field>

          3. <field name="value">instname: Conicyt</field>

          4. <field name="value">reponame: Repositorio Digital RI2.0</field>

          5. <field name="value">info:eu-repo/grantAgreement/Fondef/D02I1089</field>

          6. <field name="value">info:eu-repo/semantics/dataset/hdl.handle.net/10533/93477</field>

          </element>

        </element>

      9. <element name="rights">

        1. <element name="none">

          1. <field name="value">info:eu-repo/semantics/openAccess</field>

          </element>

        </element>

      10. <element name="language">

        1. <element name="none">

          1. <field name="value">eng</field>

          </element>

        </element>

      </element>

    2. <element name="bundles" />
    3. <element name="others">

      1. <field name="handle" />
      2. <field name="lastModifyDate">2020-01-27T14:04:19Z</field>

      </element>

    </metadata>

  • Biblioteca AECID
  • Av. Reyes Católicos, nº 4. 28040 Madrid.
  • biblio.cooperacion@aecid.es
  • (+34) 91 583 81 75 - (+34) 91 583 81 64
  • Aviso legal
  • Protección de datos
  • Accesibilidad
  • 
  • Logo Flickr
  • 
  • 
  • 