top of page

Terminology...The Learning Curve

  • Yavneeka Patel
  • Mar 11, 2015
  • 1 min read

In attempting the speech recognition portion, I'm rapidly discovering that current documentation for these programs assume you have quite a bit of background information. Of which, I have about 1% knowledge. So I've started this list to help me keep track of concepts and terminology...

Speech:

PHONES: Similar classes of sound.

DIPHONES: Parts of phones between two consecutive phones. These can be vowels or consonants placed next to each other. The Spanish language has about 800 diphones.

TRIPHONES: Phones that describe specific sounds. Unlike diphones, they are matched with the same range in waveform as just phones.

SENOMES: Small amount of distinct short sound detectors that detect variety in sound. Sphynx uses 4000 distinct short sound detectors to compose detectors for triphones.

FILLERS: Non-linguisting sounds and words (breathing, um, uh)

UTTERANCES: Breaks in speech made by Fillers.

HIDDEN MARKOV MODEL:

Maven:

MIRROR: Where files are downloaded from on the internet. It is a http link. [Seen on Maven download site]


 
 
 

Comments


Featured Posts
Check back soon
Once posts are published, you’ll see them here.
Recent Posts
Archive
Search By Tags
Follow Us
  • Facebook Basic Square
  • Twitter Basic Square
  • Google+ Basic Square
  • Facebook Classic
  • Twitter Classic
  • Google Classic
  • RSS Classic

© 2023 by TOKYO DESIGN. Proudly created with Wix.com

bottom of page