Search | Korea Science

Convolutional Neural Network-based System for Vehicle Front-Side Detection (컨볼루션 신경망 기반의 차량 전면부 검출 시스템)

Park, Young-Kyu;Park, Je-Kang;On, Han-Ik;Kang, Dong-Joong
- Journal of Institute of Control, Robotics and Systems
- /
- v.21 no.11
- /
- pp.1008-1016
- /
- 2015
This paper proposes a method for detecting the front side of vehicles. The method can find the car side with a license plate even with complicated and cluttered backgrounds. A convolutional neural network (CNN) is used to solve the detection problem as a unified framework combining feature detection, classification, searching, and localization estimation and improve the reliability of the system with simplicity of usage. The proposed CNN structure avoids sliding window search to find the locations of vehicles and reduces the computing time to achieve real-time processing. Multiple responses of the network for vehicle position are further processed by a weighted clustering and probabilistic threshold decision method. Experiments using real images in parking lots show the reliability of the method.
https://doi.org/10.5302/J.ICROS.2015.15.0163 인용 PDF KSCI

AR Anchor System Using Mobile Based 3D GNN Detection

Jeong, Chi-Seo;Kim, Jun-Sik;Kim, Dong-Kyun;Kwon, Soon-Chul;Jung, Kye-Dong
- International Journal of Internet, Broadcasting and Communication
- /
- v.13 no.1
- /
- pp.54-60
- /
- 2021
AR (Augmented Reality) is a technology that provides virtual content to the real world and provides additional information to objects in real-time through 3D content. In the past, a high-performance device was required to experience AR, but it was possible to implement AR more easily by improving mobile performance and mounting various sensors such as ToF (Time-of-Flight). Also, the importance of mobile augmented reality is growing with the commercialization of high-speed wireless Internet such as 5G. Thus, this paper proposes a system that can provide AR services via GNN (Graph Neural Network) using cameras and sensors on mobile devices. ToF of mobile devices is used to capture depth maps. A 3D point cloud was created using RGB images to distinguish specific colors of objects. Point clouds created with RGB images and Depth Map perform downsampling for smooth communication between mobile and server. Point clouds sent to the server are used for 3D object detection. The detection process determines the class of objects and uses one point in the 3D bounding box as an anchor point. AR contents are provided through app and web through class and anchor of the detected object.
https://doi.org/10.7236/IJIBC.2021.13.1.54 인용 PDF KSCI

Communication Structure for Smart Railway Network (스마트 철도 네트워크를 위한 통신 구조)

Kim, Young-dong
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2021.05a
- /
- pp.197-199
- /
- 2021
High speed railway system is progressed to SRN(Smart Railway Network) having entirely automation function beyond each componet automations. It is necessity to use mobile communication technology of LTE-R(Long Term Evolution - Railway) and 5G-R(5th Generation - Railway) and information technology of convergence based on AI, Big Data, Deep Learning to construct this smart railway networks. In this paper, a communication structure is suggested for SRN. This suggested communication structure for SRN is composed to include safety operation of high speed train, railway system management and customer services, and also have complexing function of these each functions. Results of this study can be used for SRN construction and opeation, and development of railway communication standards.
PDF

Development of an electric kick-board helmet recognition system based on deep learning (딥러닝 기반의 전동킥보드 헬멧착용 인식시스템 개발)

Park, Joon-Ho;Hwang, Ji-Min;Go, Yu-Jeong;Kim, Se-Ha;Lee, Hyun-Seo
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2022.01a
- /
- pp.281-282
- /
- 2022
현재 전동 킥보드 헬멧 미착용으로 인한 사고가 끊임없이 야기되고 있다. 개인형 이동장치 이용자 수가 증가함에 따라 법 개정을 통하여 헬멧 착용이 의무 사항이지만 여전히 낮은 착용률을 나타내고 있다. 본 논문에서는 모든 공유 킥보드 회사에서 사용 가능한 딥러닝 기반의 전동킥보드 헬멧 착용 인식시스템을 제시한다. 타 공유 전동킥보드 회사 앱에서 본 논문의 결과물을 사용할 때는 사용자가 타사 앱에서 헬멧 인식 요청 시 자사 앱에서 헬멧 착용 여부를 인식하여 결과를 전송한다. 자사 앱 사용자는 인식 기록을 조회할 수 있고, 타사 관리자는 사용자의 정보를 조회 및 관리할 수 있다. 본 시스템을 통해 전동킥보드 이용 시 헬멧 착용을 장려하여 착용률 증가와 사고 시 인명피해 감소를 기대한다.
PDF

Performance Analysis of Object Detection Method for Railway Track Equipment Based on YOLO (YOLO 기반 선로 고정장치 객체 탐지 기법의 성능 분석)

Junhwi Park;Changjoon Park;Namjung Kim;Jeonghwan Gwak
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2023.07a
- /
- pp.69-71
- /
- 2023
본 논문은 YOLO 기반 모델의 철도 시스템 내 선로 고정장치 탐지 성능을 비교하고 분석한다. 여기서 철도 시스템은 열차가 주행하기 위한 선로, 침목, 패스너 등의 구성요소를 포함한다. 침목은 지반과 직접적으로 연결되며, 선로를 지반 위에 안정적으로 지지하고 궤간을 정확하게 유지하는 역할을 한다. 또한, 패스너는 선로를 침목에 단단히 고정시키는 역할을 한다. 이러한 선로 고정장치의 부재는 인명 사고로 이어질 수 있어 지속적인 관리와 유지 보수가 필수적이다. 본 논문에서는 철도 시스템의 선로 고정장치 탐지를 위해 YOLO V5 및 V8 딥러닝 모델의 적용 가능성을 실험적으로 접근하며, 두 모델의 탐지 성능을 비교한다. 실험 결과, YOLO V8 및 V5 모델은 모두 뛰어난 성능을 보이는데, 특히 YOLO V8 모델이 더욱 우수한 성능을 보인다. 이로써 YOLO 알고리즘은 선로 고정장치 탐지에 적합하다는 것을 증명한다. 그러나 일부 False Positive Sample이 관측되었음을 확인하고, 이로부터 모델 성능의 개선이 필요하다는 결론을 도출하였다.
PDF

Dynamics-Based Location Prediction and Neural Network Fine-Tuning for Task Offloading in Vehicular Networks

Yuanguang Wu;Lusheng Wang;Caihong Kai;Min Peng
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.17 no.12
- /
- pp.3416-3435
- /
- 2023
Task offloading in vehicular networks is hot topic in the development of autonomous driving. In these scenarios, due to the role of vehicles and pedestrians, task characteristics are changing constantly. The classical deep learning algorithm always uses a pre-trained neural network to optimize task offloading, which leads to system performance degradation. Therefore, this paper proposes a neural network fine-tuning task offloading algorithm, combining with location prediction for pedestrians and vehicles by the Payne model of fluid dynamics and the car-following model, respectively. After the locations are predicted, characteristics of tasks can be obtained and the neural network will be fine-tuned. Finally, the proposed algorithm continuously predicts task characteristics and fine-tunes a neural network to maintain high system performance and meet low delay requirements. From the simulation results, compared with other algorithms, the proposed algorithm still guarantees a lower task offloading delay, especially when congestion occurs.
https://doi.org/10.3837/tiis.2023.12.011 인용 PDF HTML

Major Class Recommendation System based on Deep learning using Network Analysis (네트워크 분석을 활용한 딥러닝 기반 전공과목 추천 시스템)

Lee, Jae Kyu;Park, Heesung;Kim, Wooju
- Journal of Intelligence and Information Systems
- /
- v.27 no.3
- /
- pp.95-112
- /
- 2021
In university education, the choice of major class plays an important role in students' careers. However, in line with the changes in the industry, the fields of major subjects by department are diversifying and increasing in number in university education. As a result, students have difficulty to choose and take classes according to their career paths. In general, students choose classes based on experiences such as choices of peers or advice from seniors. This has the advantage of being able to take into account the general situation, but it does not reflect individual tendencies and considerations of existing courses, and has a problem that leads to information inequality that is shared only among specific students. In addition, as non-face-to-face classes have recently been conducted and exchanges between students have decreased, even experience-based decisions have not been made as well. Therefore, this study proposes a recommendation system model that can recommend college major classes suitable for individual characteristics based on data rather than experience. The recommendation system recommends information and content (music, movies, books, images, etc.) that a specific user may be interested in. It is already widely used in services where it is important to consider individual tendencies such as YouTube and Facebook, and you can experience it familiarly in providing personalized services in content services such as over-the-top media services (OTT). Classes are also a kind of content consumption in terms of selecting classes suitable for individuals from a set content list. However, unlike other content consumption, it is characterized by a large influence of selection results. For example, in the case of music and movies, it is usually consumed once and the time required to consume content is short. Therefore, the importance of each item is relatively low, and there is no deep concern in selecting. Major classes usually have a long consumption time because they have to be taken for one semester, and each item has a high importance and requires greater caution in choice because it affects many things such as career and graduation requirements depending on the composition of the selected classes. Depending on the unique characteristics of these major classes, the recommendation system in the education field supports decision-making that reflects individual characteristics that are meaningful and cannot be reflected in experience-based decision-making, even though it has a relatively small number of item ranges. This study aims to realize personalized education and enhance students' educational satisfaction by presenting a recommendation model for university major class. In the model study, class history data of undergraduate students at University from 2015 to 2017 were used, and students and their major names were used as metadata. The class history data is implicit feedback data that only indicates whether content is consumed, not reflecting preferences for classes. Therefore, when we derive embedding vectors that characterize students and classes, their expressive power is low. With these issues in mind, this study proposes a Net-NeuMF model that generates vectors of students, classes through network analysis and utilizes them as input values of the model. The model was based on the structure of NeuMF using one-hot vectors, a representative model using data with implicit feedback. The input vectors of the model are generated to represent the characteristic of students and classes through network analysis. To generate a vector representing a student, each student is set to a node and the edge is designed to connect with a weight if the two students take the same class. Similarly, to generate a vector representing the class, each class was set as a node, and the edge connected if any students had taken the classes in common. Thus, we utilize Node2Vec, a representation learning methodology that quantifies the characteristics of each node. For the evaluation of the model, we used four indicators that are mainly utilized by recommendation systems, and experiments were conducted on three different dimensions to analyze the impact of embedding dimensions on the model. The results show better performance on evaluation metrics regardless of dimension than when using one-hot vectors in existing NeuMF structures. Thus, this work contributes to a network of students (users) and classes (items) to increase expressiveness over existing one-hot embeddings, to match the characteristics of each structure that constitutes the model, and to show better performance on various kinds of evaluation metrics compared to existing methodologies.
https://doi.org/10.13088/jiis.2021.27.3.095 인용 PDF KSCI

Convolutional Neural Network based Audio Event Classification

Lim, Minkyu;Lee, Donghyun;Park, Hosung;Kang, Yoseb;Oh, Junseok;Park, Jeong-Sik;Jang, Gil-Jin;Kim, Ji-Hwan
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.12 no.6
- /
- pp.2748-2760
- /
- 2018
This paper proposes an audio event classification method based on convolutional neural networks (CNNs). CNN has great advantages of distinguishing complex shapes of image. Proposed system uses the features of audio sound as an input image of CNN. Mel scale filter bank features are extracted from each frame, then the features are concatenated over 40 consecutive frames and as a result, the concatenated frames are regarded as an input image. The output layer of CNN generates probabilities of audio event (e.g. dogs bark, siren, forest). The event probabilities for all images in an audio segment are accumulated, then the audio event having the highest accumulated probability is determined to be the classification result. This proposed method classified thirty audio events with the accuracy of 81.5% for the UrbanSound8K, BBC Sound FX, DCASE2016, and FREESOUND dataset.
https://doi.org/10.3837/tiis.2018.06.017 인용 PDF KSCI

Image Recognition and Clustering for Virtual Reality based on Cognitive Rehabilitation Contents (가상현실 기반 인지재활 콘텐츠를 위한 영상 인식 및 군집화)

Choi, KwonTaeg
- Journal of Digital Contents Society
- /
- v.18 no.7
- /
- pp.1249-1257
- /
- 2017
Due to the 4th industrial revolution and an aged society, many studies are being conducted to apply virtual reality to medical field. Research on dementia is especially active. This paper proposes virtual reality based on cognitive rehabilitation contents using image recognition and clustering method to improve cognitive and physical disabilities caused by dementia. Unlike the existing cognitive rehabilitation system, this paper uses travel photos that reflect the memories of the subjects to be treated. In order to generate automated cognitive rehabilitation contents, we extract face information, food pictures, place information, and time information from photographs, and normalization is performed for clustering. And we present scenarios that can be used as cognitive rehabilitation contents using travel photos in virtual reality space.
https://doi.org/10.9728/dcs.2017.18.7.1249 인용 PDF KSCI

Camera-based Dog Unwanted Behavior Detection (영상 기반 강아지의 이상 행동 탐지)

Atif, Othmane;Lee, Jonguk;Park, Daehee;Chung, Yongwha
- Proceedings of the Korea Information Processing Society Conference
- /
- 2019.05a
- /
- pp.419-422
- /
- 2019
The recent increase in single-person households and family income has led to an increase in the number of pet owners. However, due to the owners' difficulty to communicate with them for 24 hours, pets, and especially dogs, tend to display unwanted behavior that can be harmful to themselves and their environment when left alone. Therefore, detecting those behaviors when the owner is absent is necessary to suppress them and prevent any damage. In this paper, we propose a camera-based system that detects a set of normal and unwanted behaviors using deep learning algorithms to monitor dogs when left alone at home. The frames collected from the camera are arranged into sequences of RGB frames and their corresponding optical flow sequences, and then features are extracted from each data flow using pre-trained VGG-16 models. The extracted features from each sequence are concatenated and input to a bi-directional LSTM network that classifies the dog action into one of the targeted classes. The experimental results show that our method achieves a good performance exceeding 0.9 in precision, recall and f-1 score.
https://doi.org/10.3745/PKIPS.y2019m05a.419 인용 PDF

Search Result 1,194, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)