1 - 50 of 500k+

Top Drink Computer Vision Models

The models below have been fine-tuned for various drink detection tasks. You can try out each model in your browser, or test an edge deployment solution (i.e. to an NVIDIA Jetson). You can use the datasets associated with the models below as a starting point for building your own drink detection model.

At the bottom of this page, we have guides on how to count drinks in images and videos.

Object Detection Model yolov8 yolov8n snap

9744 images3 models

bag bread burger cake cheese chicken coke cup drink fork fresh lettuce milk pepper pizza sandwich spoon sugar tomato water

Object Detection Model snap

American-sign-language

MajorProject

9819 images1 model

bag bread burger cake cheese chicken coke cup drink fork fresh lettuce milk pepper pizza sandwich spoon sugar tomato water

Object Detection Model snap

ocean waste

research

9074 images2 models

bottle can cardboard drink can drink carton glass metal paper plastic plastic bottle sprite tire 1 2 4 Beverage Cans Bolsa Botella Bottle Bottle cap

Object Detection Model snap

singM8v2

KEVIN ponce

6594 images1 model

bag bread burger cake cheese chicken coke cup drink fork fresh lettuce milk pepper pizza sandwich spoon sugar tomato water

Object Detection Model

merged

yoloMultibleClassesTwo

7936 images1 model

bottle can carton cat cigarette cloth cotton cup glass metal paper plastic styrofoam Bottle Box Butt Cap Cup Glass jar Match_Box

Object Detection Model snap

asd

bumjune lee

3961 images1 model

drink abnormal-situation-while-driving normal-situation-while-driving

Object Detection Model snap

ASL

MajorProject

9648 images1 model

bag bread burger cake cheese chicken coke cup drink fork fresh lettuce milk pepper pizza sandwich spoon sugar tomato water

Object Detection Model snap

ingredients

Wonkeun Jung

9609 images1 model

apple avocado bakery banana beef bell pepper bottle bread broccoli butter cabbage can candy carrot cauliflower cheese cherry chicken chocolate coconut

Object Detection Model

Object detection

ProjDIT

2860 images1 model

apple background bag banana basket bear bed beef bench bike bird board boat book bottle box bread building bus cake

Object Detection Model

beh

674129530@qq.com

3495 images1 model

drink face phone smoke

Object Detection Model snap

ASL new

ASL 2

9036 images2 models

bag bread burger cake cheese chicken coke cup drink fork fresh lettuce milk pepper pizza sandwich spoon sugar tomato water

Object Detection Model

Chicken pose all

Cornell University

1711 images1 model

eat/drink moving rest

Object Detection Model snap yolonas-s yolonas

Appetity

2119 images3 models

apple avocado banana beef beer bread broccoli butter cabbage carrot cauliflower cheese chicken chocolate corn croissant cucumber egg eggplant fish

Object Detection Model snap

four classes detect

2534 images2 models

drink face phone smoke

Object Detection Model

MixCDN

GTnoodles

434 images1 model

Can Drink Food Non-Food Non-noodles Noodles

Object Detection Model

Drinks Image Recognition v2

School

50 images1 model

Energy Drink Soda Water

Object Detection Model

Vegetable 3

Upwork2

1375 images2 models

Apple Baked goods Ball Balloon Banana Bat (Animal)Beer Bell pepper Bench Bird Blender Bottle Bowl Box Boy Bread Broccoli Building Cabbage Cake

Object Detection Model snap yolov8 yolov8l

bsl-testing

CS230

120 images1 model

drink milk allDone changeDiaper eat more

Object Detection Model

food_detect

estesis

762 images1 model

bread drink fruit porridge salad tea bake garnish kompl meal soup sous

Object Detection Model

Multilingual sign language detection

Project

2588 images1 model

drink house stop I Play So-so Tomorrow Use Your before clean eat future good_morning happy hate hello help hour how

Object Detection Model

Mine

MyProject

1547 images3 models

drink Cover Good Happy Hello I No Recognize Time Tire Yes cover

1 - 50 of 500k+

Guide: How to Count Drinks with Computer Vision

With a model hosted on Roboflow like the ones above and the open source supervision Python package, you can count drinks in your images and videos.

The following code snippet counts the number of drinks present in an image.

To use the snippet below, you will need to run pip install roboflow supervision. Replace the project name and model name with any model trained on Universe, such as those listed above.

import supervision as sv
            import roboflow
            
            roboflow.login()
            rf = roboflow.Roboflow()
            
            # replace with the drink project you choose above
            project = rf.workspace("mohamed-traore-2ekkp").project("taco-trash-annotations-in-context")
            drink_model = project.version(16).model
            
            results = drink_model.predict("drink.jpg").json()
            drinks = sv.Detections.from_roboflow(results)
            
            # print number of drinks
            print(len(drinks))

Guide: How to Count Drinks in a Zone

With a bit more code, you can count the number of drink present in a specific zone of your image or video.

The following code snippet counts the number of drink present in each frame in a video.

To use the snippet below, you will need to run pip install roboflow supervision. Replace the project name and model name with any model trained on Universe, such as those listed above.

Read our blog post on counting objects in a zone

import numpy as np
            import supervision as sv
            import roboflow
            
            SOURCE_VIDEO_PATH = "drink.mp4"
            TARGET_VIDEO_PATH = "drink_out.mp4"
            
            # use https://roboflow.github.io/polygonzone/ to get the points for your shape
            polygon = np.array([
                # draw 50x50 box in top left corner
                [0, 0],
                [50, 0],
                [50, 50],
                [0, 50]
            ])
            
            roboflow.login()
            rf = roboflow.Roboflow()
            
            # replace with the drink project you choose above
            project = rf.workspace("mohamed-traore-2ekkp").project("taco-trash-annotations-in-context")
            drink_model = project.version(16).model
            
            # create BYTETracker instance
            drink_tracker = sv.ByteTrack(track_thresh=0.25, track_buffer=30, match_thresh=0.8, frame_rate=30)
            
            # create VideoInfo instance
            video_info = sv.VideoInfo.from_video_path(SOURCE_VIDEO_PATH)
            
            # create frame generator
            generator = sv.get_video_frames_generator(SOURCE_VIDEO_PATH)
            
            # create PolygonZone instance
            zone = sv.PolygonZone(polygon=polygon, frame_resolution_wh=(video_info.width, video_info.height))
            
            # create box annotator
            box_annotator = sv.BoxAnnotator(thickness=4, text_thickness=4, text_scale=2)
            
            colors = sv.ColorPalette.default()
            
            # create instance of BoxAnnotator
            zone_annotator = sv.PolygonZoneAnnotator(thickness=4, text_thickness=4, text_scale=2, zone=zone, color=colors.colors[0])
            
            # define call back function to be used in video processing
            def callback(frame: np.ndarray, index:int) -> np.ndarray:
                # model prediction on single frame and conversion to supervision Detections
                results = drink_model.predict(frame).json()
                drinks = sv.Detections.from_roboflow(results)
            
                # show drink detections in real time
                print(drinks)
            
                # tracking drink detections
                drinks = drink_tracker.update_with_detections(drinks)
            
                annotated_frame = box_annotator.annotate(scene=frame, detections=drinks)
                annotated_frame = zone_annotator.annotate(scene=annotated_frame)
            
                # return frame with box and line annotated result
                return annotated_frame
            
            # process the whole video
            sv.process_video(
                source_path = SOURCE_VIDEO_PATH,
                target_path = TARGET_VIDEO_PATH,
                callback=callback
            )

Guide: How to Track Drinks Crossing a Line

You can count how many drinks have crossed a line using the supervision LineCounter method.

The following code snippet counts the number of drinks that cross a line in a video.

To use the snippet below, you will need to run pip install roboflow supervision. Replace the project name and model name with any model trained on Universe, such as those listed above.

import numpy as np
            import supervision as sv
            import roboflow
            
            SOURCE_VIDEO_PATH = "drink.mp4"
            TARGET_VIDEO_PATH = "drink_out.mp4"
            
            # use https://roboflow.github.io/polygonzone/ to get the points for your line
            LINE_START = sv.Point(0, 300)
            LINE_END = sv.Point(800, 300)
            
            roboflow.login()
            rf = roboflow.Roboflow()
            
            # replace with the drink project you choose above
            project = rf.workspace("mohamed-traore-2ekkp").project("taco-trash-annotations-in-context")
            drink_model = project.version(16).model
            
            # create BYTETracker instance
            drink_tracker = sv.ByteTrack(track_thresh=0.25, track_buffer=30, match_thresh=0.8, frame_rate=30)
            
            # create VideoInfo instance
            video_info = sv.VideoInfo.from_video_path(SOURCE_VIDEO_PATH)
            
            # create frame generator
            generator = sv.get_video_frames_generator(SOURCE_VIDEO_PATH)
            
            # create LineZone instance, it is previously called LineCounter class
            line_zone = sv.LineZone(start=LINE_START, end=LINE_END)
            
            # create instance of BoxAnnotator
            box_annotator = sv.BoxAnnotator(thickness=4, text_thickness=4, text_scale=2)
            
            # create instance of TraceAnnotator
            trace_annotator = sv.TraceAnnotator(thickness=4, trace_length=50)
            line_zone_annotator = sv.LineZoneAnnotator(thickness=4, text_thickness=4, text_scale=2)
            
            # define call back function to be used in video processing
            def callback(frame: np.ndarray, index:int) -> np.ndarray:
                # model prediction on single frame and conversion to supervision Detections
                results = drink_model.predict(frame).json()
                drinks = sv.Detections.from_roboflow(results)
            
                # show drink detections in real time
                print(drinks)
            
                # tracking drink detections
                drinks = drink_tracker.update_with_detections(drinks)
                annotated_frame = trace_annotator.annotate(
                    scene=frame.copy(),
                    detections=drinks
                )
                annotated_frame=box_annotator.annotate(
                    scene=annotated_frame,
                    detections=drinks
                )
            
                # update line counter
                line_zone.trigger(drinks)
            
                # return frame with box and line annotated result
                return line_zone_annotator.annotate(annotated_frame, line_counter=line_zone)
            
            # process the whole video
            sv.process_video(
                source_path = SOURCE_VIDEO_PATH,
                target_path = TARGET_VIDEO_PATH,
                callback=callback
            )