MADA-driver-assistant-redu Computer Vision Project

Alberto Julian

Updated 2 months ago

0

views

0

downloads
Classes (23)
bicycle bus car construction
cycles crossing
dead end street
give way
go left
go right
motorcycle no entry
no left turn
no overtaking
no priority
no right turn
pedestrian crossing person roundabout
school crossing
speed limit stop traffic light truck

Metrics

Try This Model
Drop an image or
Description

MADA (Multimedia Agentic Driver Assistant) project's goal is having a minimum, but as functional as possible, driver assistant that works outdoor in real-time. Next link shows the MADA blocks.

MADA's Object Detector is based on a yolo model trained with COCO 2017 and fine-tuned with a dataset containing 23 classes from 4 categories:

  • Vehicles: 'bicycle', 'bus', 'car', 'motorcycle', 'truck'
  • People: 'person'
  • Traffic Signs: 'construction', 'cycles crossing', 'dead end street', 'give way', 'go left', 'go right', 'no entry', 'no left turn', 'no overtaking', 'no priority', 'no right turn', 'pedestrian crossing', 'roundabout', 'school crossing', 'speed limit', 'stop'
  • Traffic Lights: 'traffic light'

The images and initial annotations were selected from several datasets:

  • Microsoft's COCO (Common Objects in COntext): includes more than 80 general image types, from which a few are useful for MADA: person, car, bus, bicycle, truck, motorcycle.
  • GTSDB (German Traffic Sign Detection Benchmark): composed of images and annotations of more than 40 traffic sign classes, and some of them have been selected for MADA: speed limits, stop, give way, roundabout, pedestrian crossing...
  • DFG-TSD (DFG Traffic Sign Dataset; DFG is a Slovenian company): includes more than 200 traffic sign classes, some of which have been selected to complement those less represented in GTSDB or not included: some speed limits, dead end street, no left turn, no right turn, no priority.
  • S2TLD (SJTU Small Traffic Light Detection; SJTU is Shanghai Jiao Tong University): provides traffic light images and annotations, with separate types for red, green and yellow lights (finally I decided to merge them in a single traffic light class).

Since images from each of the datasets might have non-annotated instances of classes considered only in another dataset (for instance, the images in the S2TLD dataset have only annotations of traffic lights, but there are also non-annotated cars, crossing people and traffic signs), I had to review the whole dataset to add annotations (labels and bounding boxes) for the non-annotated instances. The merging of images and annotations (and the additions) from different datasets was done in the Roboflow platform.

References

COCO dataset:

- @article{lin2014microsoft,
  title={Microsoft COCO: Common Objects in Context},
  author={Lin, Tsung-Yi and Maire, Michael and Belongie, Serge and Hays, James and Perona, Pietro and Ramanan, Deva and Doll{\'a}r, Piotr and Zitnick, C. Lawrence},
  journal={arXiv preprint arXiv:1405.0312},
  year={2014}
}

GTSDB dataset:

@inproceedings{Houben-IJCNN-2013,
   author = {Sebastian Houben and Johannes Stallkamp and Jan Salmen and Marc Schlipsing and Christian Igel},
   booktitle = {International Joint Conference on Neural Networks},
   title = {Detection of Traffic Signs in Real-World Images: The {G}erman {T}raffic {S}ign {D}etection {B}enchmark},
   number = {1288},
   year = {2013},
}

DFG-TSD dataset:

@article{Tabernik2019ITS,
   title = {{Deep Learning for Large-Scale Traffic-Sign Detection and Recognition}},   
   author = {Tabernik, Domen and Sko{\v{c}}aj, Danijel},
   journal = {IEEE Transactions on Intelligent Transportation Systems},
   year = {2019},
   doi={10.1109/TITS.2019.2913588}, 
   ISSN={1524-9050}
}

S2TLD dataset:

@article{yang2022scrdet++,
  title={Scrdet++: Detecting small, cluttered and rotated objects via instance-level feature denoising and rotation loss smoothing},
  author={Yang, Xue and Yan, Junchi and Liao, Wenlong and Yang, Xiaokang and Tang, Jin and He, Tao},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year={2022},
  publisher={IEEE}
}

More details and code in MADA's GitHub repo.

Use This Trained Model

Try it in your browser, or deploy via our Hosted Inference API and other deployment methods.

Supervision

Build Computer Vision Applications Faster with Supervision

Visualize and process your model results with our reusable computer vision tools.

Cite This Project

LICENSE
CC BY 4.0

If you use this dataset in a research paper, please cite it using the following BibTeX:

                        @misc{
                            mada-driver-assistant-redu_dataset,
                            title = { MADA-driver-assistant-redu Dataset },
                            type = { Open Source Dataset },
                            author = { Alberto Julian },
                            howpublished = { \url{ https://universe.roboflow.com/alberto-julian-7z4dc/mada-driver-assistant-redu } },
                            url = { https://universe.roboflow.com/alberto-julian-7z4dc/mada-driver-assistant-redu },
                            journal = { Roboflow Universe },
                            publisher = { Roboflow },
                            year = { 2024 },
                            month = { oct },
                            note = { visited on 2024-12-24 },
                            }
                        
                    

Similar Projects

See More
939 images