Skip to content

Detecting recordings containing human voices? #763

Answered by tphakala
esfel asked this question in Q&A
Discussion options

You must be logged in to vote

The BirdNET model contains human vocal detection. The following labels are included in the v2.4 classifier:

Human non-vocal_Human non-vocal
Human vocal_Human vocal
Human whistle_Human whistle

However, the reliability of human vocal detection is another topic. The classifier in v2.4 isn't very good at this, but it is possible to improve human speech detection by appending additional speech training data to the default classifier.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@esfel
Comment options

Answer selected by esfel
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants