1 - 30 of 100k+
Top Voice Computer Vision Models
The models below have been fine-tuned for various voice detection tasks. You can try out each model in your browser, or test an edge deployment solution (i.e. to an NVIDIA Jetson). You can use the datasets associated with the models below as a starting point for building your own voice detection model.
At the bottom of this page, we have guides on how to count voices in images and videos.
by caricon
665 images 60 classes
aauto-icon-calendar aauto-icon-customize aauto-icon-discord aauto-icon-exit aauto-icon-game aauto-icon-go_back aauto-icon-google-news aauto-icon-google-podcast aauto-icon-list-down aauto-icon-list-up aauto-icon-maps aauto-icon-menu-app_list aauto-icon-menu-home_grid aauto-icon-messager aauto-icon-messages aauto-icon-phone aauto-icon-phone-accept_call aauto-icon-phone-active_call aauto-icon-phone-contact aauto-icon-phone-contacts
106 images 3398 classes
38 images 3 classes
by db
3960 images 15 classes
600 images 58 classes
aauto-icon-calendar aauto-icon-customize aauto-icon-discord aauto-icon-exit aauto-icon-game aauto-icon-go_back aauto-icon-google-news aauto-icon-google-podcast aauto-icon-list-down aauto-icon-list-up aauto-icon-maps aauto-icon-menu-app_list aauto-icon-menu-home_grid aauto-icon-messager aauto-icon-messages aauto-icon-phone aauto-icon-phone-accept_call aauto-icon-phone-active_call aauto-icon-phone-contact aauto-icon-phone-contacts
1 - 30 of 100k+