Google is really improving Android dictation capabilities, and while you’re not going to experience Dragon-like dictation, I think you’ll be pleasantly surprised with the results.
STT types what you tell it to
Android’s speech to text engine is quite literal. That means, when you say, “new paragraph,” the STT engine translates and types, “new paragraph.”
This “stupidity” happens for essentially two reasons:
- STT isn’t really designed for dictation (at this time); and
- Google’s interested in conversational context for its STT engine.
STT design
In the grand world of speech to text, dictation — at least as attorney’s think of dictation — takes a very distant place. The reality is that most Android consumers will never use their Android devices with the intent of dictating long, legalese-filled, memos and diatribes. The true purpose, at least in the STT world according to Google, is to translate and send short bursts of text (think text messages or Twitter updates — 140 characters), rather than lengthier discourses.
When STT accomplishes that goal, 95% or more of users are happy.
The conversation is the important factor
Those happy users, sending short bursts of messages, is what Google’s trying to capture. Google’s search engine is all about conversational elements, as opposed to choppy keyword searches.
If you remember conversational, it’s easier to get engaged with STT.
What commands work?
Ultimately, figuring out what commands work, and when, will drive you bat crazy. Here’s a list of usable commands to make STT work a little better.
Here are some commands that are guaranteed to work:
- Period = period (.)
- Comma = comma (,)
- Question mark = question mark (?)
- Exclamation or exclamation point = exclamation point (!)
- Apostrophe = apostrophe (‘)
- Enter or new line = moves to a new line
- New paragraph = new paragraph
- Tab key = tab
- Colon = colon (:)
- Dash = dash (-)
- Ellipsis or dot dot dot = ellipsis (…)
- Ampersand = ampersand (&)
- Asterisk = asterisk (*)
- At sign = at sign (@)
- Backslash = backslash (\)
- Forward slash = forward slash (/)
- Open bracket = open bracket ([)
- Close bracket = closed bracket (])
- Open parenthesis = open parenthesis (()
- Close parenthesis = close parenthesis ())
Making it all work together
I’ve found that the best way to get dictation working is to have a continual conversation with your device. This means that if you pause for long periods of time, usually 2 or more seconds, STT won’t connect your command as a command. This is especially true for commands like “new paragraph” and “new line” or “enter.”
I find that if you pause only slightly, STT will spell out “enter,” rather than execute the command. It’s a really finicky function, but if you do it right dictation works well.
Source: The Droid Lawyer