The Resource Improvement of decoding engine & phonetic decision tree in acoustic modeling for online large vocabulary conversational speech recognition, by Jian Xue, (electronic resource)

Improvement of decoding engine & phonetic decision tree in acoustic modeling for online large vocabulary conversational speech recognition, by Jian Xue, (electronic resource)

Label
Improvement of decoding engine & phonetic decision tree in acoustic modeling for online large vocabulary conversational speech recognition
Title
Improvement of decoding engine & phonetic decision tree in acoustic modeling for online large vocabulary conversational speech recognition
Statement of responsibility
by Jian Xue
Title variation
Improvement of decoding engine and phonetic decision tree in acoustic modeling for online large vocabulary conversational speech recognition
Creator
Contributor
Thesis advisor
Subject
Genre
Language
eng
Summary
In this work, new approaches are proposed for online large vocabulary conversational speech recognition, including a fast confusion network algorithm, novel features and a Random Forests based classifier for word confidence annotation, new improvements in speech decoding speed and latency, novel lookahead phonetic decision tree state tying and Random Forests of phonetic decision tree state tying for acoustic modeling of speech sound units. The fast confusion network algorithm significantly improves the time complexity from O(T3) to O(T), with T equaling the number of links in a word lattice. Several novel features, as well as Random Forests based classification technique are proposed to improve word annotation accuracy for automatic captioning. In order to improve the speed of speech decoding engine, we propose to use complementary word confidence scores to prune uncompetitive search paths, and use subspace distribution clustering hidden Markov modeling to speed up computation of acoustic scores and local confidence scores. We further integrate pre-backtrace in decoding search to significantly reduce captioning latency. In this work we also investigate novel approaches to improve the performance of phonetic decision tree state tying, including two lookahead methods and a Random Forests method. Constrained lookahead method finds an optimal question among n pre-selected questions for each split node to decrease effects of outliers, and it also discounts the contributions of likelihood gains by deeper decedents. Stochastic full lookahead method uses sub-tree size instead of likelihood gain as a measure for phonetic question selection, in order to produce small trees with better generalization capability and consistent with training data. The Random Forests method uses an ensemble of phonetic decision trees to derive a single strong model for each speech unit. We investigate several methods of combining the acoustic scores from multiple models obtained from multiple phonetic decision trees in decoding search. We further propose clustering methods to compact the Random Forests generated acoustic models to speed up decoding search
Cataloging source
MUU
http://library.link/vocab/creatorDate
1975-
http://library.link/vocab/creatorName
Xue, Jian
Degree
Ph. D.
Dissertation year
2007.
Granting institution
University of Missouri-Columbia
Illustrations
illustrations
Index
no index present
Literary form
non fiction
Nature of contents
  • dictionaries
  • bibliography
  • theses
http://library.link/vocab/relatedWorkOrContributorName
Zhao, Yunxin
http://library.link/vocab/subjectName
  • Speech processing systems
  • Coding theory
  • Pattern recognition systems
Target audience
specialized
Label
Improvement of decoding engine & phonetic decision tree in acoustic modeling for online large vocabulary conversational speech recognition, by Jian Xue, (electronic resource)
Instantiates
Publication
Note
  • The entire dissertation/thesis text is included in the research.pdf file; the official abstract appears in the short.pdf file (which also appears in the research.pdf); a non-technical general description, or public abstract, appears in the public.pdf file
  • Title from title screen of research.pdf file (viewed on March 4, 2008)
  • Vita
Bibliography note
Includes bibliographical references
Carrier category
online resource
Carrier category code
  • cr
Carrier MARC source
rdacarrier
Color
mixed
Content category
text
Content type code
  • txt
Content type MARC source
rdacontent
Control code
212818230
Dimensions
unknown
Form of item
electronic
Media category
computer
Media MARC source
rdamedia
Media type code
  • c
Specific material designation
remote
System control number
(OCoLC)212818230
System details
Mode of access: World Wide Web
Label
Improvement of decoding engine & phonetic decision tree in acoustic modeling for online large vocabulary conversational speech recognition, by Jian Xue, (electronic resource)
Publication
Note
  • The entire dissertation/thesis text is included in the research.pdf file; the official abstract appears in the short.pdf file (which also appears in the research.pdf); a non-technical general description, or public abstract, appears in the public.pdf file
  • Title from title screen of research.pdf file (viewed on March 4, 2008)
  • Vita
Bibliography note
Includes bibliographical references
Carrier category
online resource
Carrier category code
  • cr
Carrier MARC source
rdacarrier
Color
mixed
Content category
text
Content type code
  • txt
Content type MARC source
rdacontent
Control code
212818230
Dimensions
unknown
Form of item
electronic
Media category
computer
Media MARC source
rdamedia
Media type code
  • c
Specific material designation
remote
System control number
(OCoLC)212818230
System details
Mode of access: World Wide Web

Library Locations

  • Health Sciences LibraryBorrow it
    2411 Holmes St, Kansas City, Kansas City, MO, 64108, US
    39.083418 -94.575323
  • LaBudde Special CollectionsBorrow it
    800 E 51st St, Kansas City, MO, 64110, US
    39.034642 -94.576835
  • Leon E. Bloch Law LibraryBorrow it
    500 E. 52nd Street, Kansas City, MO, 64110, US
    39.032488 -94.581967
  • Marr Sound ArchivesBorrow it
    800 E 51st St, Kansas City, MO, 64110, US
    39.034642 -94.576835
  • Miller Nichols LibraryBorrow it
    800 E 51st St, Kansas City, MO, 64110, US
    39.035061 -94.576518
  • UMKCBorrow it
    800 E 51st St, Kansas City, MO, 64110, US
    39.035061 -94.576518
  • UMKCBorrow it
    800 E 51st St, Kansas City, MO, 64110, US
Processing Feedback ...