Simon/Contribute Data

Revision as of 14:20, 10 August 2013 by Bedahr (talk | contribs) (Created page with "To build a speech recognition system, several types of data files are required: * A phonetic dictionary to learn how words are pronounced * Transcribed audio samples to learn ...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

To build a speech recognition system, several types of data files are required:

  • A phonetic dictionary to learn how words are pronounced
  • Transcribed audio samples to learn how a human pronounces the phonetic elements from the dictionary (phones)
  • Large corpora of written text to learn what word structures commonly co-occur (provides context for the recognizer)

Content is available under Creative Commons License SA 4.0 unless otherwise noted.