People Detection System Using YOLOv3 Algorithm

In crowd security systems, precise real-time detection of people in images or videos can be very challenging especially in complex and dense crowds whereby some individuals could possibly be partly or entirely occluded for varying lengths of time. Thus, this paper presents a large Convolutional Neur...

詳細記述

書誌詳細
出版年:Proceedings - 10th IEEE International Conference on Control System, Computing and Engineering, ICCSCE 2020
第一著者: 2-s2.0-85093873154
フォーマット: Conference paper
言語:English
出版事項: Institute of Electrical and Electronics Engineers Inc. 2020
オンライン・アクセス:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85093873154&doi=10.1109%2fICCSCE50387.2020.9204925&partnerID=40&md5=6716080f457c3fd088e028f2eb4eeed4
id Hassan N.I.; Tahir N.M.; Zaman F.H.K.; Hashim H.
spelling Hassan N.I.; Tahir N.M.; Zaman F.H.K.; Hashim H.
2-s2.0-85093873154
People Detection System Using YOLOv3 Algorithm
2020
Proceedings - 10th IEEE International Conference on Control System, Computing and Engineering, ICCSCE 2020


10.1109/ICCSCE50387.2020.9204925
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85093873154&doi=10.1109%2fICCSCE50387.2020.9204925&partnerID=40&md5=6716080f457c3fd088e028f2eb4eeed4
In crowd security systems, precise real-time detection of people in images or videos can be very challenging especially in complex and dense crowds whereby some individuals could possibly be partly or entirely occluded for varying lengths of time. Thus, this paper presents a large Convolutional Neural Network (CNN) that is trained using a single step model, You Only Look Once version 3 (YOLOv3) on Google Colaboratory to process the images within a database and to accurately locate people within the images. YOLOv3 splits the image up into regions and predicts bounding boxes and predicts the probabilities for each region. These bounding boxes are weighted by the projected probabilities and finally, the model is able to make its detection based on the final weights. This model will be using a customised dataset from Google's Open Images with 500 high resolution images. Once trained, the neural network able to successfully generate the test data and achieve a mean average precision (mAP) of 78.3% and a final average loss of 0.6 on top of confidently detecting the people within the images. © 2020 IEEE.
Institute of Electrical and Electronics Engineers Inc.

English
Conference paper

author 2-s2.0-85093873154
spellingShingle 2-s2.0-85093873154
People Detection System Using YOLOv3 Algorithm
author_facet 2-s2.0-85093873154
author_sort 2-s2.0-85093873154
title People Detection System Using YOLOv3 Algorithm
title_short People Detection System Using YOLOv3 Algorithm
title_full People Detection System Using YOLOv3 Algorithm
title_fullStr People Detection System Using YOLOv3 Algorithm
title_full_unstemmed People Detection System Using YOLOv3 Algorithm
title_sort People Detection System Using YOLOv3 Algorithm
publishDate 2020
container_title Proceedings - 10th IEEE International Conference on Control System, Computing and Engineering, ICCSCE 2020
container_volume
container_issue
doi_str_mv 10.1109/ICCSCE50387.2020.9204925
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85093873154&doi=10.1109%2fICCSCE50387.2020.9204925&partnerID=40&md5=6716080f457c3fd088e028f2eb4eeed4
description In crowd security systems, precise real-time detection of people in images or videos can be very challenging especially in complex and dense crowds whereby some individuals could possibly be partly or entirely occluded for varying lengths of time. Thus, this paper presents a large Convolutional Neural Network (CNN) that is trained using a single step model, You Only Look Once version 3 (YOLOv3) on Google Colaboratory to process the images within a database and to accurately locate people within the images. YOLOv3 splits the image up into regions and predicts bounding boxes and predicts the probabilities for each region. These bounding boxes are weighted by the projected probabilities and finally, the model is able to make its detection based on the final weights. This model will be using a customised dataset from Google's Open Images with 500 high resolution images. Once trained, the neural network able to successfully generate the test data and achieve a mean average precision (mAP) of 78.3% and a final average loss of 0.6 on top of confidently detecting the people within the images. © 2020 IEEE.
publisher Institute of Electrical and Electronics Engineers Inc.
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1828987872547438592