• Title/Summary/Keyword: Image Recognition System

Search Result 1,723, Processing Time 0.03 seconds

Object Detection on the Road Environment Using Attention Module-based Lightweight Mask R-CNN (주의 모듈 기반 Mask R-CNN 경량화 모델을 이용한 도로 환경 내 객체 검출 방법)

  • Song, Minsoo;Kim, Wonjun;Jang, Rae-Young;Lee, Ryong;Park, Min-Woo;Lee, Sang-Hwan;Choi, Myung-seok
    • Journal of Broadcast Engineering
    • /
    • v.25 no.6
    • /
    • pp.944-953
    • /
    • 2020
  • Object detection plays a crucial role in a self-driving system. With the advances of image recognition based on deep convolutional neural networks, researches on object detection have been actively explored. In this paper, we proposed a lightweight model of the mask R-CNN, which has been most widely used for object detection, to efficiently predict location and shape of various objects on the road environment. Furthermore, feature maps are adaptively re-calibrated to improve the detection performance by applying an attention module to the neural network layer that plays different roles within the mask R-CNN. Various experimental results for real driving scenes demonstrate that the proposed method is able to maintain the high detection performance with significantly reduced network parameters.

Fat Client-Based Abstraction Model of Unstructured Data for Context-Aware Service in Edge Computing Environment (에지 컴퓨팅 환경에서의 상황인지 서비스를 위한 팻 클라이언트 기반 비정형 데이터 추상화 방법)

  • Kim, Do Hyung;Mun, Jong Hyeok;Park, Yoo Sang;Choi, Jong Sun;Choi, Jae Young
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.10 no.3
    • /
    • pp.59-70
    • /
    • 2021
  • With the recent advancements in the Internet of Things, context-aware system that provides customized services become important to consider. The existing context-aware systems analyze data generated around the user and abstract the context information that expresses the state of situations. However, these datasets is mostly unstructured and have difficulty in processing with simple approaches. Therefore, providing context-aware services using the datasets should be managed in simplified method. One of examples that should be considered as the unstructured datasets is a deep learning application. Processes in deep learning applications have a strong coupling in a way of abstracting dataset from the acquisition to analysis phases, it has less flexible when the target analysis model or applications are modified in functional scalability. Therefore, an abstraction model that separates the phases and process the unstructured dataset for analysis is proposed. The proposed abstraction utilizes a description name Analysis Model Description Language(AMDL) to deploy the analysis phases by each fat client is a specifically designed instance for resource-oriented tasks in edge computing environments how to handle different analysis applications and its factors using the AMDL and Fat client profiles. The experiment shows functional scalability through examples of AMDL and Fat client profiles targeting a vehicle image recognition model for vehicle access control notification service, and conducts process-by-process monitoring for collection-preprocessing-analysis of unstructured data.

Object Detection Based on Hellinger Distance IoU and Objectron Application (Hellinger 거리 IoU와 Objectron 적용을 기반으로 하는 객체 감지)

  • Kim, Yong-Gil;Moon, Kyung-Il
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.22 no.2
    • /
    • pp.63-70
    • /
    • 2022
  • Although 2D Object detection has been largely improved in the past years with the advance of deep learning methods and the use of large labeled image datasets, 3D object detection from 2D imagery is a challenging problem in a variety of applications such as robotics, due to the lack of data and diversity of appearances and shapes of objects within a category. Google has just announced the launch of Objectron that has a novel data pipeline using mobile augmented reality session data. However, it also is corresponding to 2D-driven 3D object detection technique. This study explores more mature 2D object detection method, and applies its 2D projection to Objectron 3D lifting system. Most object detection methods use bounding boxes to encode and represent the object shape and location. In this work, we explore a stochastic representation of object regions using Gaussian distributions. We also present a similarity measure for the Gaussian distributions based on the Hellinger Distance, which can be viewed as a stochastic Intersection-over-Union. Our experimental results show that the proposed Gaussian representations are closer to annotated segmentation masks in available datasets. Thus, less accuracy problem that is one of several limitations of Objectron can be relaxed.

The Effect of Rice Co-Brand Assets, Trust, and Attachment on Loyalty (쌀 공동브랜드의 자산, 신뢰, 애착이 충성도에 미치는 영향)

  • Kim, Shine
    • Journal of Digital Convergence
    • /
    • v.20 no.5
    • /
    • pp.401-410
    • /
    • 2022
  • This study deals with the relationship among trust, attachment and brand loyalty of agricultural products' rice co-brands, which are the staple food of the people. The research method established the hypothesis of the study under the foundation of prior research and developed the survey. The subjects of the study were distributed, retrieved, and analyzed the survey of 163 rice farmers in Buyeo-gun, Chungcheongnam-do. The empirical analysis results show that: First, hypothesis 1 of the brand awareness and image that "rice brand assets will be a positive relationship to trust" were statistically adopted. In particular, statistical t values showed a difference in consumer confidence over recognition>images. Second, hypothesis 2 of the trust of agricultural rice brands will be a positive influence on attachment and loyalty' statistically supported. In this regard, brand trust was higher in loyalty than attachment. Third, the attachment of agricultural products to rice brands will be a positive influence on loyalty,' was statistically supported. The strategic implications of this study are as follows. First, consumers should be given clues of trust(ex, GAP of Natioanl Approval Licesing, Fam Tour) as they distrust the perceived quality of the rice in the market. Second, the effect of the origin of rice is questionable, so the spread of the production power system should prevent the mixing of rice varieties, that is the spread of the production history systems.

Intelligent Motion Pattern Recognition Algorithm for Abnormal Behavior Detections in Unmanned Stores (무인 점포 사용자 이상행동을 탐지하기 위한 지능형 모션 패턴 인식 알고리즘)

  • Young-june Choi;Ji-young Na;Jun-ho Ahn
    • Journal of Internet Computing and Services
    • /
    • v.24 no.6
    • /
    • pp.73-80
    • /
    • 2023
  • The recent steep increase in the minimum hourly wage has increased the burden of labor costs, and the share of unmanned stores is increasing in the aftermath of COVID-19. As a result, theft crimes targeting unmanned stores are also increasing, and the "Just Walk Out" system is introduced to prevent such thefts, and LiDAR sensors, weight sensors, etc. are used or manually checked through continuous CCTV monitoring. However, the more expensive sensors are used, the higher the initial cost of operating the store and the higher the cost in many ways, and CCTV verification is difficult for managers to monitor around the clock and is limited in use. In this paper, we would like to propose an AI image processing fusion algorithm that can solve these sensors or human-dependent parts and detect customers who perform abnormal behaviors such as theft at low costs that can be used in unmanned stores and provide cloud-based notifications. In addition, this paper verifies the accuracy of each algorithm based on behavior pattern data collected from unmanned stores through motion capture using mediapipe, object detection using YOLO, and fusion algorithm and proves the performance of the convergence algorithm through various scenario designs.

Digital Library Interface Research Based on EEG, Eye-Tracking, and Artificial Intelligence Technologies: Focusing on the Utilization of Implicit Relevance Feedback (뇌파, 시선추적 및 인공지능 기술에 기반한 디지털 도서관 인터페이스 연구: 암묵적 적합성 피드백 활용을 중심으로)

  • Hyun-Hee Kim;Yong-Ho Kim
    • Journal of the Korean Society for information Management
    • /
    • v.41 no.1
    • /
    • pp.261-282
    • /
    • 2024
  • This study proposed and evaluated electroencephalography (EEG)-based and eye-tracking-based methods to determine relevance by utilizing users' implicit relevance feedback while navigating content in a digital library. For this, EEG/eye-tracking experiments were conducted on 32 participants using video, image, and text data. To assess the usefulness of the proposed methods, deep learning-based artificial intelligence (AI) techniques were used as a competitive benchmark. The evaluation results showed that EEG component-based methods (av_P600 and f_P3b components) demonstrated high classification accuracy in selecting relevant videos and images (faces/emotions). In contrast, AI-based methods, specifically object recognition and natural language processing, showed high classification accuracy for selecting images (objects) and texts (newspaper articles). Finally, guidelines for implementing a digital library interface based on EEG, eye-tracking, and artificial intelligence technologies have been proposed. Specifically, a system model based on implicit relevance feedback has been presented. Moreover, to enhance classification accuracy, methods suitable for each media type have been suggested, including EEG-based, eye-tracking-based, and AI-based approaches.

Development and mathematical performance analysis of custom GPTs-Based chatbots (GPTs 기반 문제해결 맞춤형 챗봇 제작 및 수학적 성능 분석)

  • Kwon, Misun
    • Education of Primary School Mathematics
    • /
    • v.27 no.3
    • /
    • pp.303-320
    • /
    • 2024
  • This study presents the development and performance evaluation of a custom GPT-based chatbot tailored to provide solutions following Polya's problem-solving stages. A beta version of the chatbot was initially deployed to assess its mathematical capabilities, followed by iterative error identification and correction, leading to the final version. The completed chatbot demonstrated an accuracy rate of approximately 89.0%, correctly solving an average of 57.8 out of 65 image-based problems from a 6th-grade elementary mathematics textbook, reflecting a 4 percentage point improvement over the beta version. For a subset of 50 problems, where images were not critical for problem resolution, the chatbot achieved an accuracy rate of approximately 91.0%, solving an average of 45.5 problems correctly. Predominant errors included problem recognition issues, particularly with complex or poorly recognizable images, along with concept confusion and comprehension errors. The custom chatbot exhibited superior mathematical performance compared to the general-purpose ChatGPT. Additionally, its solution process can be adapted to various grade levels, facilitating personalized student instruction. The ease of chatbot creation and customization underscores its potential for diverse applications in mathematics education, such as individualized teacher support and personalized student guidance.

Feasibility of Deep Learning Algorithms for Binary Classification Problems (이진 분류문제에서의 딥러닝 알고리즘의 활용 가능성 평가)

  • Kim, Kitae;Lee, Bomi;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.1
    • /
    • pp.95-108
    • /
    • 2017
  • Recently, AlphaGo which is Bakuk (Go) artificial intelligence program by Google DeepMind, had a huge victory against Lee Sedol. Many people thought that machines would not be able to win a man in Go games because the number of paths to make a one move is more than the number of atoms in the universe unlike chess, but the result was the opposite to what people predicted. After the match, artificial intelligence technology was focused as a core technology of the fourth industrial revolution and attracted attentions from various application domains. Especially, deep learning technique have been attracted as a core artificial intelligence technology used in the AlphaGo algorithm. The deep learning technique is already being applied to many problems. Especially, it shows good performance in image recognition field. In addition, it shows good performance in high dimensional data area such as voice, image and natural language, which was difficult to get good performance using existing machine learning techniques. However, in contrast, it is difficult to find deep leaning researches on traditional business data and structured data analysis. In this study, we tried to find out whether the deep learning techniques have been studied so far can be used not only for the recognition of high dimensional data but also for the binary classification problem of traditional business data analysis such as customer churn analysis, marketing response prediction, and default prediction. And we compare the performance of the deep learning techniques with that of traditional artificial neural network models. The experimental data in the paper is the telemarketing response data of a bank in Portugal. It has input variables such as age, occupation, loan status, and the number of previous telemarketing and has a binary target variable that records whether the customer intends to open an account or not. In this study, to evaluate the possibility of utilization of deep learning algorithms and techniques in binary classification problem, we compared the performance of various models using CNN, LSTM algorithm and dropout, which are widely used algorithms and techniques in deep learning, with that of MLP models which is a traditional artificial neural network model. However, since all the network design alternatives can not be tested due to the nature of the artificial neural network, the experiment was conducted based on restricted settings on the number of hidden layers, the number of neurons in the hidden layer, the number of output data (filters), and the application conditions of the dropout technique. The F1 Score was used to evaluate the performance of models to show how well the models work to classify the interesting class instead of the overall accuracy. The detail methods for applying each deep learning technique in the experiment is as follows. The CNN algorithm is a method that reads adjacent values from a specific value and recognizes the features, but it does not matter how close the distance of each business data field is because each field is usually independent. In this experiment, we set the filter size of the CNN algorithm as the number of fields to learn the whole characteristics of the data at once, and added a hidden layer to make decision based on the additional features. For the model having two LSTM layers, the input direction of the second layer is put in reversed position with first layer in order to reduce the influence from the position of each field. In the case of the dropout technique, we set the neurons to disappear with a probability of 0.5 for each hidden layer. The experimental results show that the predicted model with the highest F1 score was the CNN model using the dropout technique, and the next best model was the MLP model with two hidden layers using the dropout technique. In this study, we were able to get some findings as the experiment had proceeded. First, models using dropout techniques have a slightly more conservative prediction than those without dropout techniques, and it generally shows better performance in classification. Second, CNN models show better classification performance than MLP models. This is interesting because it has shown good performance in binary classification problems which it rarely have been applied to, as well as in the fields where it's effectiveness has been proven. Third, the LSTM algorithm seems to be unsuitable for binary classification problems because the training time is too long compared to the performance improvement. From these results, we can confirm that some of the deep learning algorithms can be applied to solve business binary classification problems.

Analyze Technologies and Trends in Commercialized Radiology Artificial Intelligence Medical Device (상용화된 영상의학 인공지능 의료기기의 기술 및 동향 분석)

  • Chang-Hwa Han
    • Journal of the Korean Society of Radiology
    • /
    • v.17 no.6
    • /
    • pp.881-887
    • /
    • 2023
  • This study aims to analyze the development and current trends of AI-based medical imaging devices commercialized in South Korea. As of September 30, 2023, there were a total of 186 AI-based medical devices licensed, certified, and reported to the Korean Ministry of Food and Drug Safety, of which 138 were related to imaging. The study comprehensively examined the yearly approval trends, equipment types, application areas, and key functions from 2018 to 2023. The study found that the number of AI medical devices started from four products in 2018 and grew steadily until 2023, with a sharp increase after 2020. This can be attributed to the interaction between the advancement of AI technology and the increasing demand in the medical field. By equipment, AI medical devices were developed in the order of CT, X-ray, and MR, which reflects the characteristics and clinical importance of the images of each equipment. This study found that the development of AI medical devices for specific areas such as the thorax, cranial nerves, and musculoskeletal system is active, and the main functions are medical image analysis, detection and diagnosis assistance, and image transmission. These results suggest that AI's pattern recognition and data analysis capabilities are playing an important role in the medical imaging field. In addition, this study examined the number of Korean products that have received international certifications, particularly the US FDA and European CE. The results show that many products have been certified by both organizations, indicating that Korean AI medical devices are in line with international standards and are competitive in the global market. By analyzing the impact of AI technology on medical imaging and its potential for development, this study provides important implications for future research and development directions. However, challenges such as regulatory aspects, data quality and accessibility, and clinical validity are also pointed out, requiring continued research and improvement on these issues.

A Comparative Study of Food Habits and Body Satisfaction of Middle School Students According to Clinical Symptoms (일부 남녀 중학생의 건강 관련 임상증상에 따른 식습관과 체헝관심도에 관한 연구)

  • Sung, Chung-Ja
    • Journal of the Korean Society of Food Science and Nutrition
    • /
    • v.34 no.2
    • /
    • pp.202-208
    • /
    • 2005
  • This study was conducted to examine the food habits, knowledge of nutrition and actual conditions of food ingestion of adolescent middle school students according to questionnaire answers. Questionnaires were completed by 524 students, divided into a healthy group (n=289) and an unhealthy group (n=235) according to clinical signs. Further questions were asked of the two groups in the areas of food habits, knowledge of nutrition and nutritional attitude. The results were as follows: Mean age of all subjects was 14, heights for male and female students were 162.0 em, and 157.2 cm, weights were 53.4 kg, and 49.4, respectively. Heights and weights of male students were greater than those of female students. The body mass index (BMI) for male and female students was 20.3 kg/$m^2$ and 20.0 kg/$m^2$, respectively, and all data were within normal ranges. There were no significant differences in mean age, height, weight, and BMI between the healthy and unhealthy groups. There was no significant difference in body image recognition between the two groups, although the ratio of dissatisfaction with their own body shape was significantly higher in the female unhealthy group (46.1%), than in the female healthy group (33.0%) (p<0.05). In the area of the struggle to control body weight during the previous year, the female unhealthy group (59.4%) was higher than the female healthy group (38.4%) (p<0.01). There was no significant difference in the scores between the two groups in the areas of knowledge of nutrition and the nutritional attitude. Meal frequency and meal patterns were showed that having breakfast less than 4x/week was significantly higher in the female unhealthy group (44.0%), than in the female healthy group (30.7%) (p<0.01). Meal frequency for suppers<4x/week showed that the female unhealthy group (18.8%) was also higher than the female healthy group (10.7%). Therefore, the unhealthy group exhibited a higher pattern of missing both breakfast and supper. The male unhealthy group (16.7%) dined out more frequently than the male healthy group (12.3%) (p<0.01), and female unhealthy group also indulged in snacking significantly more frequently than the female healthy group. The unhealthy group also ate only 1 item for meals more frequently than the healthy group and no significant difference. The conclusion of this study is that adolescent Korean middle school students, who showed a higher incidence of clinical symptoms, representing an unhealthy status, missed breakfast and supper, and dined out and indulged in snacking more frequently. Their quality of breakfast and satisfaction of body image were also lower than the healthy group. These results indicated that there is a high correlation between a Korean adolescent's health status, food habits and body image satisfaction. It is recommended that a more intense program of nutritional education and monitoring be introduce into the current Korean middle-school system in order to optimally support and maximize the health potential of the current population of Korean student.