• Title/Summary/Keyword: Image-based Recognition Technology

Search Result 583, Processing Time 0.028 seconds

Design Plan for Digital Textbooks Applying Augmented Reality Image Recognition Technology -A Study on the Digital Textbooks for Middle School Science 1- (증강현실(AR) 영상인식 기술을 적용한 디지털 교과서 디자인 기획 -중학교 과학1 디지털 교과서 중심으로-)

  • Yoo, Young-Mi;Jo, Seong-Hwan
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.6
    • /
    • pp.353-363
    • /
    • 2018
  • According to the Digi Capital forecast, the global augmented reality market is expected to grow rapidly by 2020 to reach 150 billion dollars. In particular, high value added effects are expected in education. As ICT advances, digital textbooks are also leading innovative education by adding interactive functions. Advanced countries, including the U.S., are already using digital textbooks that use augmented reality technology in their classes. In line with this technological outlook, the ministry proposed a design plan that applies augmented reality technology to middle school science 1 digital textbooks. A study on middle school science 1 digital textbooks showed that each unit provided short videos. In addition, an investigation into the augmented reality class case showed that it was difficult to establish experimental equipment, lack of equipment (devices), and 3D design contents that did not continue despite the excellence of learning effects. Based on this demand, we designed an augmented reality scenario and system configuration to be applied to the instrument-specific experiments of middle school science 1 digital textbooks to explore and explore the contents of augmented reality by students. This research will replace the dangerous experiments and time consuming experiments for teachers and students by applying augmented reality to science subjects that are essential for the development of digital textbooks.

Development of Android Smartphone App for Corner Point Feature Extraction using Remote Sensing Image (위성영상정보 기반 코너 포인트 객체 추출 안드로이드 스마트폰 앱 개발)

  • Kang, Sang-Goo;Lee, Ki-Won
    • Korean Journal of Remote Sensing
    • /
    • v.27 no.1
    • /
    • pp.33-41
    • /
    • 2011
  • In the information communication technology, it is world-widely apparent that trend movement from internet web to smartphone app by users demand and developers environment. So it needs kinds of appropriate technological responses from geo-spatial domain regarding this trend. However, most cases in the smartphone app are the map service and location recognition service, and uses of geo-spatial contents are somewhat on the limited level or on the prototype developing stage. In this study, app for extraction of corner point features using geo-spatial imagery and their linkage to database system are developed. Corner extraction is based on Harris algorithm, and all processing modules in database server, application server, and client interface composing app are designed and implemented based on open source. Extracted corner points are applied LOD(Level of Details) process to optimize on display panel. Additional useful function is provided that geo-spatial imagery can be superimposed with the digital map in the same area. It is expected that this app can be utilized to automatic establishment of POI (Point of Interests) or point-based land change detection purposes.

A System of Audio Data Analysis and Masking Personal Information Using Audio Partitioning and Artificial Intelligence API (오디오 데이터 내 개인 신상 정보 검출과 마스킹을 위한 인공지능 API의 활용 및 음성 분할 방법의 연구)

  • Kim, TaeYoung;Hong, Ji Won;Kim, Do Hee;Kim, Hyung-Jong
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.30 no.5
    • /
    • pp.895-907
    • /
    • 2020
  • With the recent increasing influence of multimedia content other than the text-based content, services that help to process information in content brings us great convenience. These services' representative features are searching and masking the sensitive data. It is not difficult to find the solutions that provide searching and masking function for text information and image. However, even though we recognize the necessity of the technology for searching and masking a part of the audio data, it is not easy to find the solution because of the difficulty of the technology. In this study, we propose web application that provides searching and masking functions for audio data using audio partitioning method. While we are achieving the research goal, we evaluated several speech to text conversion APIs to choose a proper API for our purpose and developed regular expressions for searching sensitive information. Lastly we evaluated the accuracy of the developed searching and masking feature. The contribution of this work is in design and implementation of searching and masking a sensitive information from the audio data by the various functionality proving experiments.

A Study on the Type and Sense of Place of the Lighting Design of Urban Public Space (도시 공공공간 조명디자인 유형과 장소성에 관한 연구)

  • Ma, Dong Qing;Yoon, Ji Young
    • Korea Science and Art Forum
    • /
    • v.27
    • /
    • pp.101-114
    • /
    • 2017
  • Based on the relationship between urban public space, urban lighting and the sense of place, this paper aims to analyze the lighting environment types with the sense of place and their characteristics. First, with the theory study as the research foundation, it extracts six spatial factors of public space lighting design and then analyzes 12 relevant cases on the basis. Finally, it divides the 12 cases into four types, Basic types, Storytelling, Interactive and Multi-Media and analyzes the core design factor and characteristics of various types. The results show that: first, functionality, sustainability and aesthetics are the basic factors to realize the urban public space lighting places. Second, the six cases of "Storytelling" show that the theme of specific areas, namely the exploration of "story" is conducive for lighting design to form clear and definite environment recognition. Third, for "Interactive" and "Multi-Media", the intervention of new media technology and new lighting way has made the wide expansion of urban lighting design connotation and extension. The research results show that strengthening the urban location performance by the lighting design could improve the city image, which provides the basis for the development of urban public space lighting design.

A Study on Automatic Precision Landing for Small UAV's Industrial Application (소형 UAV의 산업 응용을 위한 자동 정밀 착륙에 관한 연구)

  • Kim, Jong-Woo;Ha, Seok-Wun;Moon, Yong-Ho
    • Journal of Convergence for Information Technology
    • /
    • v.7 no.3
    • /
    • pp.27-36
    • /
    • 2017
  • In almost industries, such as the logistics industry, marine fisheries, agriculture, industry, and services, small unmanned aerial vehicles are used for aerial photographing or closing flight in areas where human access is difficult or CCTV is not installed. Also, based on the information of small unmanned aerial photographing, application research is actively carried out to efficiently perform surveillance, control, or management. In order to carry out tasks in a mission-based manner in which the set tasks are assigned and the tasks are automatically performed, the small unmanned aerial vehicles must not only fly steadily but also be able to charge the energy periodically, In addition, the unmanned aircraft need to land automatically and precisely at certain points after the end of the mission. In order to accomplish this, an automatic precision landing method that leads landing by continuously detecting and recognizing a marker located at a landing point from a video shot of a small UAV is required. In this paper, it is shown that accurate and stable automatic landing is possible even if simple template matching technique is applied without using various recognition methods that require high specification in using low cost general purpose small unmanned aerial vehicle. Through simulation and actual experiments, the results show that the proposed method will be made good use of industrial fields.

Comparative Analysis of CNN Deep Learning Model Performance Based on Quantification Application for High-Speed Marine Object Classification (고속 해상 객체 분류를 위한 양자화 적용 기반 CNN 딥러닝 모델 성능 비교 분석)

  • Lee, Seong-Ju;Lee, Hyo-Chan;Song, Hyun-Hak;Jeon, Ho-Seok;Im, Tae-ho
    • Journal of Internet Computing and Services
    • /
    • v.22 no.2
    • /
    • pp.59-68
    • /
    • 2021
  • As artificial intelligence(AI) technologies, which have made rapid growth recently, began to be applied to the marine environment such as ships, there have been active researches on the application of CNN-based models specialized for digital videos. In E-Navigation service, which is combined with various technologies to detect floating objects of clash risk to reduce human errors and prevent fires inside ships, real-time processing is of huge importance. More functions added, however, mean a need for high-performance processes, which raises prices and poses a cost burden on shipowners. This study thus set out to propose a method capable of processing information at a high rate while maintaining the accuracy by applying Quantization techniques of a deep learning model. First, videos were pre-processed fit for the detection of floating matters in the sea to ensure the efficient transmission of video data to the deep learning entry. Secondly, the quantization technique, one of lightweight techniques for a deep learning model, was applied to reduce the usage rate of memory and increase the processing speed. Finally, the proposed deep learning model to which video pre-processing and quantization were applied was applied to various embedded boards to measure its accuracy and processing speed and test its performance. The proposed method was able to reduce the usage of memory capacity four times and improve the processing speed about four to five times while maintaining the old accuracy of recognition.

Feasibility of Deep Learning Algorithms for Binary Classification Problems (이진 분류문제에서의 딥러닝 알고리즘의 활용 가능성 평가)

  • Kim, Kitae;Lee, Bomi;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.1
    • /
    • pp.95-108
    • /
    • 2017
  • Recently, AlphaGo which is Bakuk (Go) artificial intelligence program by Google DeepMind, had a huge victory against Lee Sedol. Many people thought that machines would not be able to win a man in Go games because the number of paths to make a one move is more than the number of atoms in the universe unlike chess, but the result was the opposite to what people predicted. After the match, artificial intelligence technology was focused as a core technology of the fourth industrial revolution and attracted attentions from various application domains. Especially, deep learning technique have been attracted as a core artificial intelligence technology used in the AlphaGo algorithm. The deep learning technique is already being applied to many problems. Especially, it shows good performance in image recognition field. In addition, it shows good performance in high dimensional data area such as voice, image and natural language, which was difficult to get good performance using existing machine learning techniques. However, in contrast, it is difficult to find deep leaning researches on traditional business data and structured data analysis. In this study, we tried to find out whether the deep learning techniques have been studied so far can be used not only for the recognition of high dimensional data but also for the binary classification problem of traditional business data analysis such as customer churn analysis, marketing response prediction, and default prediction. And we compare the performance of the deep learning techniques with that of traditional artificial neural network models. The experimental data in the paper is the telemarketing response data of a bank in Portugal. It has input variables such as age, occupation, loan status, and the number of previous telemarketing and has a binary target variable that records whether the customer intends to open an account or not. In this study, to evaluate the possibility of utilization of deep learning algorithms and techniques in binary classification problem, we compared the performance of various models using CNN, LSTM algorithm and dropout, which are widely used algorithms and techniques in deep learning, with that of MLP models which is a traditional artificial neural network model. However, since all the network design alternatives can not be tested due to the nature of the artificial neural network, the experiment was conducted based on restricted settings on the number of hidden layers, the number of neurons in the hidden layer, the number of output data (filters), and the application conditions of the dropout technique. The F1 Score was used to evaluate the performance of models to show how well the models work to classify the interesting class instead of the overall accuracy. The detail methods for applying each deep learning technique in the experiment is as follows. The CNN algorithm is a method that reads adjacent values from a specific value and recognizes the features, but it does not matter how close the distance of each business data field is because each field is usually independent. In this experiment, we set the filter size of the CNN algorithm as the number of fields to learn the whole characteristics of the data at once, and added a hidden layer to make decision based on the additional features. For the model having two LSTM layers, the input direction of the second layer is put in reversed position with first layer in order to reduce the influence from the position of each field. In the case of the dropout technique, we set the neurons to disappear with a probability of 0.5 for each hidden layer. The experimental results show that the predicted model with the highest F1 score was the CNN model using the dropout technique, and the next best model was the MLP model with two hidden layers using the dropout technique. In this study, we were able to get some findings as the experiment had proceeded. First, models using dropout techniques have a slightly more conservative prediction than those without dropout techniques, and it generally shows better performance in classification. Second, CNN models show better classification performance than MLP models. This is interesting because it has shown good performance in binary classification problems which it rarely have been applied to, as well as in the fields where it's effectiveness has been proven. Third, the LSTM algorithm seems to be unsuitable for binary classification problems because the training time is too long compared to the performance improvement. From these results, we can confirm that some of the deep learning algorithms can be applied to solve business binary classification problems.

Analyze Technologies and Trends in Commercialized Radiology Artificial Intelligence Medical Device (상용화된 영상의학 인공지능 의료기기의 기술 및 동향 분석)

  • Chang-Hwa Han
    • Journal of the Korean Society of Radiology
    • /
    • v.17 no.6
    • /
    • pp.881-887
    • /
    • 2023
  • This study aims to analyze the development and current trends of AI-based medical imaging devices commercialized in South Korea. As of September 30, 2023, there were a total of 186 AI-based medical devices licensed, certified, and reported to the Korean Ministry of Food and Drug Safety, of which 138 were related to imaging. The study comprehensively examined the yearly approval trends, equipment types, application areas, and key functions from 2018 to 2023. The study found that the number of AI medical devices started from four products in 2018 and grew steadily until 2023, with a sharp increase after 2020. This can be attributed to the interaction between the advancement of AI technology and the increasing demand in the medical field. By equipment, AI medical devices were developed in the order of CT, X-ray, and MR, which reflects the characteristics and clinical importance of the images of each equipment. This study found that the development of AI medical devices for specific areas such as the thorax, cranial nerves, and musculoskeletal system is active, and the main functions are medical image analysis, detection and diagnosis assistance, and image transmission. These results suggest that AI's pattern recognition and data analysis capabilities are playing an important role in the medical imaging field. In addition, this study examined the number of Korean products that have received international certifications, particularly the US FDA and European CE. The results show that many products have been certified by both organizations, indicating that Korean AI medical devices are in line with international standards and are competitive in the global market. By analyzing the impact of AI technology on medical imaging and its potential for development, this study provides important implications for future research and development directions. However, challenges such as regulatory aspects, data quality and accessibility, and clinical validity are also pointed out, requiring continued research and improvement on these issues.

A Study on the Causes That Have Influence over the Effect of PPL in the Game (게임 속 PPL의 효과에 영향을 미치는 요인에 관한 연구)

  • Kim, Young-Rak;Cho, Youn-Gon;Choi, Gui-Young
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.8
    • /
    • pp.1256-1262
    • /
    • 2010
  • The game has been rapidly evolving through various media such as computer, console game machine, cellular phone and PMP based on the advanced development of scientific technology. In terms of demand, the interest in and desire to consume the game as a way of spending people's spare time have been on the increase constantly while the level of income has been improved. Eventually, the game has gradually expanded its scope of supply and demand, has established its own status as one of the media that is scientifically-intensive and has been developed into a game industry, a large-scale industry. Unlike image media, the methods of exposure in PPL are varied in accordance with the genre of games. This study divides the causes that have influence over the effect of PPL in the game into the genre of game and the skill of gamer. The results of the experiment on how much the aforementioned two elements have influence over the effect of PPL are as in the following: It has been demonstrated that the effect of PPL could appear different according to the genre of game and the skill of gamer on the game. Besides, the genre of game that is dynamic in its screen change in the game has relatively lower effect of PPL than that is not dynamic. Meanwhile, the persons who are highly skilled in the game have higher degree of recognition and preference to the inserted PPL than those who are lowly skilled. In this regard, it has given us a theoretical ground that the fees system for PPL ads should be established variously in accordance with the genre of game and the level of online game users.

A Comparative Study on the Effective Deep Learning for Fingerprint Recognition with Scar and Wrinkle (상처와 주름이 있는 지문 판별에 효율적인 심층 학습 비교연구)

  • Kim, JunSeob;Rim, BeanBonyka;Sung, Nak-Jun;Hong, Min
    • Journal of Internet Computing and Services
    • /
    • v.21 no.4
    • /
    • pp.17-23
    • /
    • 2020
  • Biometric information indicating measurement items related to human characteristics has attracted great attention as security technology with high reliability since there is no fear of theft or loss. Among these biometric information, fingerprints are mainly used in fields such as identity verification and identification. If there is a problem such as a wound, wrinkle, or moisture that is difficult to authenticate to the fingerprint image when identifying the identity, the fingerprint expert can identify the problem with the fingerprint directly through the preprocessing step, and apply the image processing algorithm appropriate to the problem. Solve the problem. In this case, by implementing artificial intelligence software that distinguishes fingerprint images with cuts and wrinkles on the fingerprint, it is easy to check whether there are cuts or wrinkles, and by selecting an appropriate algorithm, the fingerprint image can be easily improved. In this study, we developed a total of 17,080 fingerprint databases by acquiring all finger prints of 1,010 students from the Royal University of Cambodia, 600 Sokoto open data sets, and 98 Korean students. In order to determine if there are any injuries or wrinkles in the built database, criteria were established, and the data were validated by experts. The training and test datasets consisted of Cambodian data and Sokoto data, and the ratio was set to 8: 2. The data of 98 Korean students were set up as a validation data set. Using the constructed data set, five CNN-based architectures such as Classic CNN, AlexNet, VGG-16, Resnet50, and Yolo v3 were implemented. A study was conducted to find the model that performed best on the readings. Among the five architectures, ResNet50 showed the best performance with 81.51%.