50 - 100 of 500k+

Top Speaker Computer Vision Models

The models below have been fine-tuned for various speaker detection tasks. You can try out each model in your browser, or test an edge deployment solution (i.e. to an NVIDIA Jetson). You can use the datasets associated with the models below as a starting point for building your own speaker detection model.

At the bottom of this page, we have guides on how to count speakers in images and videos.

50 - 100 of 500k+