• Title/Summary/Keyword: recognized images

Search Result 509, Processing Time 0.022 seconds

Development of Convolutional Neural Network Basic Practice Cases (합성곱 신경망 기초 실습 사례 개발)

  • Hur, Kyeong
    • Journal of Practical Engineering Education
    • /
    • v.14 no.2
    • /
    • pp.279-285
    • /
    • 2022
  • In this paper, as a liberal arts course for non-majors, we developed a basic practice case for convolutional neural networks, which is essential for designing a basic convolutional neural network course curriculum. The developed practice case focuses on understanding the working principle of the convolutional neural network and uses a spreadsheet to check the entire visualized process. The developed practice case consisted of generating supervised learning method image training data, implementing the input layer, convolution layer (convolutional layer), pooling layer, and output layer sequentially, and testing the performance of the convolutional neural network on new data. By extending the practice cases developed in this paper, the number of images to be recognized can be expanded, or basic practice cases can be made to create a convolutional neural network that increases the compression rate for high-quality images. Therefore, it can be said that the utility of this convolutional neural network basic practice case is high.

A Study on Vision-based Calibration Method for Bin Picking Robots for Semiconductor Automation (반도체 자동화를 위한 빈피킹 로봇의 비전 기반 캘리브레이션 방법에 관한 연구)

  • Kyo Mun Ku;Ki Hyun Kim;Hyo Yung Kim;Jae Hong Shim
    • Journal of the Semiconductor & Display Technology
    • /
    • v.22 no.1
    • /
    • pp.72-77
    • /
    • 2023
  • In many manufacturing settings, including the semiconductor industry, products are completed by producing and assembling various components. Sorting out from randomly mixed parts and classification operations takes a lot of time and labor. Recently, many efforts have been made to select and assemble correct parts from mixed parts using robots. Automating the sorting and classification of randomly mixed components is difficult since various objects and the positions and attitudes of robots and cameras in 3D space need to be known. Previously, only objects in specific positions were grasped by robots or people sorting items directly. To enable robots to pick up random objects in 3D space, bin picking technology is required. To realize bin picking technology, it is essential to understand the coordinate system information between the robot, the grasping target object, and the camera. Calibration work to understand the coordinate system information between them is necessary to grasp the object recognized by the camera. It is difficult to restore the depth value of 2D images when 3D restoration is performed, which is necessary for bin picking technology. In this paper, we propose to use depth information of RGB-D camera for Z value in rotation and movement conversion used in calibration. Proceed with camera calibration for accurate coordinate system conversion of objects in 2D images, and proceed with calibration of robot and camera. We proved the effectiveness of the proposed method through accuracy evaluations for camera calibration and calibration between robots and cameras.

  • PDF

Enhancing Alzheimer's Disease Classification using 3D Convolutional Neural Network and Multilayer Perceptron Model with Attention Network

  • Enoch A. Frimpong;Zhiguang Qin;Regina E. Turkson;Bernard M. Cobbinah;Edward Y. Baagyere;Edwin K. Tenagyei
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.11
    • /
    • pp.2924-2944
    • /
    • 2023
  • Alzheimer's disease (AD) is a neurological condition that is recognized as one of the primary causes of memory loss. AD currently has no cure. Therefore, the need to develop an efficient model with high precision for timely detection of the disease is very essential. When AD is detected early, treatment would be most likely successful. The most often utilized indicators for AD identification are the Mini-mental state examination (MMSE), and the clinical dementia. However, the use of these indicators as ground truth marking could be imprecise for AD detection. Researchers have proposed several computer-aided frameworks and lately, the supervised model is mostly used. In this study, we propose a novel 3D Convolutional Neural Network Multilayer Perceptron (3D CNN-MLP) based model for AD classification. The model uses Attention Mechanism to automatically extract relevant features from Magnetic Resonance Images (MRI) to generate probability maps which serves as input for the MLP classifier. Three MRI scan categories were considered, thus AD dementia patients, Mild Cognitive Impairment patients (MCI), and Normal Control (NC) or healthy patients. The performance of the model is assessed by comparing basic CNN, VGG16, DenseNet models, and other state of the art works. The models were adjusted to fit the 3D images before the comparison was done. Our model exhibited excellent classification performance, with an accuracy of 91.27% for AD and NC, 80.85% for MCI and NC, and 87.34% for AD and MCI.

Ethics for Artificial Intelligence: Focus on the Use of Radiology Images (인공지능 의료윤리: 영상의학 영상데이터 활용 관점의 고찰)

  • Seong Ho Park
    • Journal of the Korean Society of Radiology
    • /
    • v.83 no.4
    • /
    • pp.759-770
    • /
    • 2022
  • The importance of ethics in research and the use of artificial intelligence (AI) is increasingly recognized not only in the field of healthcare but throughout society. This article intends to provide domestic readers with practical points regarding the ethical issues of using radiological images for AI research, focusing on data security and privacy protection and the right to data. Therefore, this article refers to related domestic laws and government policies. Data security and privacy protection is a key ethical principle for AI, in which proper de-identification of data is crucial. Sharing healthcare data to develop AI in a way that minimizes business interests is another ethical point to be highlighted. The need for data sharing makes the data security and privacy protection even more important as data sharing increases the risk of data breach.

A Study on the correlation between a streetscape image and a signboard density - Focused on roadside buildings occupation density of signboard in the business area - (가로경관이미지와 간판밀도와의 상관관계에 관한 연구 - 상업지역 연도건물의 간판 점유밀도를 중심으로 -)

  • Kim, Yun-Hee;Rhee, Jae-Won
    • Archives of design research
    • /
    • v.18 no.4 s.62
    • /
    • pp.287-296
    • /
    • 2005
  • The street image in a business area is so much affected by Facade that the front side of a roadside building makes. Recently, for the indiscreet and intemperate advertising signboard of the front side of roadside buildings, a streetscape becomes more disordered than before, so now we need to do research about signboards of roadside buildings for a streetscape image. In this research, we focused on a streetscape with difference of occupation density of signboard in the business area via investigation and analysis about occupation density of signboards of the front side of roadside buildings, and we suggested optimum occupation density of signboards for supporting the road image positively. An object of research is the street in the business area that has many pedestrians and active passing zone of cars. We investigated and analyzed how to feel street images on the rate of occupation density of roadside building's signboards of in the chosen street. As a result of using an adjective that we use for estimating street view images for extraction of street images, we could know 2 factors. We named that one is the image of recognition, and the other is the image of feelings. We knew that signboard density of street of heavily recognized images is from 20% to 30% and, signboard density of street of heavily feeling images is from 50% to 60%. We also could know that people feel both images of recognition and images of feeling in specific density, 30 to 50%. Through this result of research, we can suggest Facade on signboard density with the recognition and the feeling and use images of the street view as materials to be more specific and more special.

  • PDF

Optical Music Score Recognition System for Smart Mobile Devices

  • Han, SeJin;Lee, GueeSang
    • International Journal of Contents
    • /
    • v.10 no.4
    • /
    • pp.63-68
    • /
    • 2014
  • In this paper, we propose a smart system that can optically recognize a music score within a document and can play the music after recognition. Many historic handwritten documents have now been digitalized. Converting images of a music score within documents into digital files is particularly difficult and requires considerable resources because a music score consists of a 2D structure with both staff lines and symbols. The proposed system takes an input image using a mobile device equipped with a camera module, and the image is optimized via preprocessing. Binarization, music sheet correction, staff line recognition, vertical line detection, note recognition, and symbol recognition processing are then applied, and a music file is generated in an XML format. The Music XML file is recorded as digital information, and based on that file, we can modify the result, logically correct errors, and finally generate a MIDI file. Our system reduces misrecognition, and a wider range of music score can be recognized because we have implemented distortion correction and vertical line detection. We show that the proposed method is practical, and that is has potential for wide application through an experiment with a variety of music scores.

Development of a 2D Posture Measurement System to Evaluate Musculoskeletal Workload (근골격계 부하 평가를 위한 2차원 자세 측정 시스템 개발)

  • Park, Sung-Joon;Park, Jae-Kyu;Choe, Jae-Ho
    • Journal of the Ergonomics Society of Korea
    • /
    • v.24 no.3
    • /
    • pp.43-52
    • /
    • 2005
  • A two-dimensional posture measurement system was developed to evaluate the risks of work-related musculoskeletal disorders(MSDs) easily on various conditions of work. The posture measurement system is an essential tool to analyze the workload for preventing work-related musculoskeletal disorders. Although several posture measurement systems have been developed for workload assessment, some restrictions in industry still exist because of its difficulty on measuring work postures. In this study, an image recognition algorithm was developed based on a neural network method to measure work posture. Each joint angle of human body was automatically measured from the recognized images through the algorithm, and the measurement system makes it possible to evaluate the risks of work-related musculoskeletal disorders easily on various working conditions. The validation test on upper body postures was carried out to examine the accuracy of the measured joint angle data from the system, and the results showed good measuring performance for each joint angle. The differences between the joint angles measured directly and the angles measured by posture measurement software were not statistically significant. It is expected that the result help to properly estimate physical workload and can be used as a postural analysis system to evaluate the risk of work-related musculoskeletal disorders in industry.

An Analysis of the Repetition Pattern in Green facade focusing on the Biophilic Design (벽면녹화의 패턴 표현방법에 따른 반복패턴 디자인 특성 분석)

  • Jung, HeeYoung;Lee, Hyunsoo
    • Korean Institute of Interior Design Journal
    • /
    • v.25 no.1
    • /
    • pp.81-92
    • /
    • 2016
  • Within green facades, the greening side of the wall is first recognized by people, leaving a visually lasting impression. As facades play a crucial factor in forming street image, their design can be considered most important. In modern days, 'patterns,' one of the popular elements of design, has developed into becoming a method of expressing architects' emotions or images as well as ways of seeking satisfaction. As opposed to recent overseas movement where patterned green facades have been widely utilized, the domestic trend still remains in outdoor green facades without patterns. This study, focusing on overseas patterned green facades, classifies the facade pattern's expressive methods into two greater parts, and four categories. Furthermore, among the elements and properties from Biophilic pattern guidelines, we specifically focus on 'Repetition Pattern,' which corresponds to 'Complexity & Order.' Biophilic design has the notion of pursuing an environment that aids modern people's comfort and well being. Providing information on patterned green facades that have largely gained popularity, this study also presents its aesthetic directions that may be applicable domestically in the future.

Development of a 3D Modeling System using a variety of images based on Ubiquitous Environment (유비쿼터스 기반의 다양한 영상을 활용한 3D Modeling System의 구축)

  • Kim, Woo-Sun;Heo, Joon;Shim, Jae-Hyun;Choi, Woo-Jung
    • 한국방재학회:학술대회논문집
    • /
    • 2007.02a
    • /
    • pp.418-421
    • /
    • 2007
  • It is important to maintain information by application or 3D modeling through the satellite and UAV image which is a real world. The prevention business has recognized the need for accurate 3-D geospatial information around the disaster region to identify objects to 3D modeling. In this paper, we presented an approach to create 3D model and loading, processing the image using GIS techniques, and the digital topographic maps were used for the DEM and the features of the area. The result is a implementation of the simple application that illustrates the objects in 3-D. The presented approach will be used for identifying objects and assisting in regional planning around the airfields.

  • PDF

Recognition of the Passport by Using Fuzzy Binarization and Enhanced Fuzzy Neural Networks

  • Kim, Kwang-Baek
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09a
    • /
    • pp.603-607
    • /
    • 2003
  • The judgment of forged passports plays an important role in the immigration control system, for which the automatic and accurate processing is required because of the rapid increase of travelers. So, as the preprocessing phase for the judgment of forged passports, this paper proposed the novel method for the recognition of passport based on the fuzzy binarization and the fuzzy RBF neural network newly proposed. first, for the extraction of individual codes being recognized, the paper extracts code sequence blocks including individual codes by applying the Sobel masking, the horizontal smearing and the contour tracking algorithm in turn to the passport image, binarizes the extracted blocks by using the fuzzy binarization based on the membership function of trapezoid type, and, as the last step, recovers and extracts individual codes from the binarized areas by applying the CDM masking and the vertical smearing. Next, the paper proposed the enhanced fuzzy RBF neural network that adapts the enhanced fuzzy ART network to the middle layer and applied to the recognition of individual codes. The results of the experiment for performance evaluation on the real passport images showed that the proposed method in the paper has the improved performance in the recognition of passport.

  • PDF