Topic: Microphone, Voice Encoding
As most of us know, the new microphone has, up to this point, been implemented with the following...
-- Push and hold button to start the recording from the mic.
-- Release the button to signal you are done recording.
-- The stored audio file (mp3 encoded?) is then sent back to the server
-- On the server, some decoding of speech to text is employed to extract the words spoken
-- The proper service or event is triggered (such as reading the weather, playing a radio station, etc).
So, a couple of questions I have that maybe you smart folks know the answer to...
-- What is the audio encoding format?
-- What is the time limit on how long you can speak (memory limitations)?
-- Can the audio upload and event triggered happen on the same connection to the server (I think it does this now)?
-- Can the 'record audio' button be 'pushed' during a server ping, allowing for automatic audio capture without user intervention?
Roy