Skip to content

on_speech_started event is unusable #504

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
Vaibhav-Lodha opened this issue Feb 19, 2025 · 0 comments
Open

on_speech_started event is unusable #504

Vaibhav-Lodha opened this issue Feb 19, 2025 · 0 comments

Comments

@Vaibhav-Lodha
Copy link

What is the current behavior?

on_speech_started gets triggered in cases where there is just background noise, making it unusable with cases where we want to use it as a VAD where it can tell in exactly when user has started speaking.

Steps to reproduce

  1. Use the given config -
sample_rate = 16000
model = "nova-2"
default_options = LiveOptions(
            language=Language.HI,
            model=model,
            sample_rate=sample_rate,
            vad_events=True,
            endpointing=300,
            utterance_end_ms="1000",
            interim_results=True,
            punctuate=False,
        )
  1. Register on_speech_started event, and log when it gets triggered without saying anything, it will keep on logging event.

Expected behavior

  1. Only trigger when there is actual word / speech detected.

Please tell us about your environment

  • Operating System/Version: MacOS 15.3 / Ubuntu 22.04
  • Python Version: 3.12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant