add stt output to input prompt area #3269

yashschandra · 2025-02-19T01:43:34Z

Pull Request Type

Relevant Issues

resolves #3268

What is in this change?

Stopped sending voice command directly as prompt to LLM provider so that user can get a chance to edit or add to it if there was a long pause while speaking.

Additional Information

Before -

before.mov

Now -

after.mov

Additionally fixed a minor typo: endTTSSession -> endSTTSession

Developer Validations

I ran yarn lint from the root of the repo & committed changes
Relevant documentation has been updated
I have tested my code functionality
Docker build succeeds locally

timothycarambat · 2025-02-19T02:48:39Z

FWIW, it used to be this way, then people wanted it to autosubmit, now this PR would change it back

therealtimex · 2025-02-19T06:40:16Z

Can we implement two modes:

A long press of the microphone icon activates "continuous" mode, which enables autosubmit. The UI should visually indicate that this mode is active.
A short press of the microphone icon triggers manual submit.

yashschandra · 2025-02-19T09:33:09Z

FWIW, it used to be this way, then people wanted it to autosubmit, now this PR would change it back

I get this, but I feel for a non-native english speaking (like myself) it may be useful

Can we implement two modes:

A long press of the microphone icon activates "continuous" mode, which enables autosubmit. The UI should visually indicate that this mode is active.
A short press of the microphone icon triggers manual submit.

I was thinking having this configuration in Settings but this may also work. @timothycarambat any thoughts on this approach?

yashschandra · 2025-02-22T21:47:29Z

@timothycarambat is there any chance a feature like this (or something similar) can be included?

timothycarambat · 2025-02-25T20:35:25Z

@therealtimex That kind of UX is ambiguous and is bound to be non-discoverable. Will have to make this a setting in the Voice and Speech area or elsewhere so that we can stop flip-flipping PRs every couple months on this.

therealtimex · 2025-02-25T21:30:39Z

Agreed, setting is always a safe bet.

yashschandra · 2025-03-01T19:05:44Z

added a checkbox for Autosubmit in Voice and Speech settings section -

output.mp4

add stt output to input prompt area

45c44f4

yashschandra mentioned this pull request Feb 19, 2025

[FEAT]: Speech to text confirmation before submission #3268

Open

Merge branch 'master' into 3268-stt-confirmation

fb5d0a2

Merge branch 'Mintplex-Labs:master' into 3268-stt-confirmation

476d349

timothycarambat self-assigned this Feb 25, 2025

timothycarambat added the needs review label Feb 25, 2025

timothycarambat and others added 4 commits February 26, 2025 15:29

Merge branch 'master' into 3268-stt-confirmation

fbe6f1a

add speech to text autosubmit in settings

5690c84

merge conflict resolved with master

e7f96f6

string value comparison fix

773783d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add stt output to input prompt area #3269

add stt output to input prompt area #3269

yashschandra commented Feb 19, 2025 •

edited

Loading

timothycarambat commented Feb 19, 2025

therealtimex commented Feb 19, 2025

yashschandra commented Feb 19, 2025 •

edited

Loading

yashschandra commented Feb 22, 2025

timothycarambat commented Feb 25, 2025

therealtimex commented Feb 25, 2025

yashschandra commented Mar 1, 2025

add stt output to input prompt area #3269

Are you sure you want to change the base?

add stt output to input prompt area #3269

Conversation

yashschandra commented Feb 19, 2025 • edited Loading

Pull Request Type

Relevant Issues

What is in this change?

Additional Information

Developer Validations

timothycarambat commented Feb 19, 2025

therealtimex commented Feb 19, 2025

yashschandra commented Feb 19, 2025 • edited Loading

yashschandra commented Feb 22, 2025

timothycarambat commented Feb 25, 2025

therealtimex commented Feb 25, 2025

yashschandra commented Mar 1, 2025

yashschandra commented Feb 19, 2025 •

edited

Loading

yashschandra commented Feb 19, 2025 •

edited

Loading