Skip to main content

Automatic Speech Recognition

Automatic Speech Recognition or ASR is the technology which takes what you say into a microphone, and transcribes it into a form the computer can understand more easily. Other systems which do automatic Closed Captioning or dictation use the same kind of technology.


Speech Emotion Recognition (SER) and encoding meaningful prosody information is still very early. Our ASR system has very limited emotion understanding. Because of this, the AI can usually detect exclamations or questions, but the AI does not currently change its behavior based on How you say things.


"Don't take that tone with me!" ~Some AI in the future, but not today.