• Title/Summary/Keyword: Training data generation

Search Result 228, Processing Time 0.024 seconds

Development of Deep Learning Structure for Defective Pixel Detection of Next-Generation Smart LED Display Board using Imaging Device (영상장치를 이용한 차세대 스마트 LED 전광판의 불량픽셀 검출을 위한 딥러닝 구조 개발)

  • Sun-Gu Lee;Tae-Yoon Lee;Seung-Ho Lee
    • Journal of IKEEE
    • /
    • v.27 no.3
    • /
    • pp.345-349
    • /
    • 2023
  • In this paper, we propose a study on the development of deep learning structure for defective pixel detection of next-generation smart LED display board using imaging device. In this research, a technique utilizing imaging devices and deep learning is introduced to automatically detect defects in outdoor LED billboards. Through this approach, the effective management of LED billboards and the resolution of various errors and issues are aimed. The research process consists of three stages. Firstly, the planarized image data of the billboard is processed through calibration to completely remove the background and undergo necessary preprocessing to generate a training dataset. Secondly, the generated dataset is employed to train an object recognition network. This network is composed of a Backbone and a Head. The Backbone employs CSP-Darknet to extract feature maps, while the Head utilizes extracted feature maps as the basis for object detection. Throughout this process, the network is adjusted to align the Confidence score and Intersection over Union (IoU) error, sustaining continuous learning. In the third stage, the created model is employed to automatically detect defective pixels on actual outdoor LED billboards. The proposed method, applied in this paper, yielded results from accredited measurement experiments that achieved 100% detection of defective pixels on real LED billboards. This confirms the improved efficiency in managing and maintaining LED billboards. Such research findings are anticipated to bring about a revolutionary advancement in the management of LED billboards.

Quality Evaluation of Automatically Generated Metadata Using ChatGPT: Focusing on Dublin Core for Korean Monographs (ChatGPT가 자동 생성한 더블린 코어 메타데이터의 품질 평가: 국내 도서를 대상으로)

  • SeonWook Kim;HyeKyung Lee;Yong-Gu Lee
    • Journal of the Korean Society for information Management
    • /
    • v.40 no.2
    • /
    • pp.183-209
    • /
    • 2023
  • The purpose of this study is to evaluate the Dublin Core metadata generated by ChatGPT using book covers, title pages, and colophons from a collection of books. To achieve this, we collected book covers, title pages, and colophons from 90 books and inputted them into ChatGPT to generate Dublin Core metadata. The performance was evaluated in terms of completeness and accuracy. The overall results showed a satisfactory level of completeness at 0.87 and accuracy at 0.71. Among the individual elements, Title, Creator, Publisher, Date, Identifier, Rights, and Language exhibited higher performance. Subject and Description elements showed relatively lower performance in terms of completeness and accuracy, but it confirmed the generation capability known as the inherent strength of ChatGPT. On the other hand, books in the sections of social sciences and technology of DDC showed slightly lower accuracy in the Contributor element. This was attributed to ChatGPT's attribution extraction errors, omissions in the original bibliographic description contents for metadata, and the language composition of the training data used by ChatGPT.

A Study on Dataset Generation Method for Korean Language Information Extraction from Generative Large Language Model and Prompt Engineering (생성형 대규모 언어 모델과 프롬프트 엔지니어링을 통한 한국어 텍스트 기반 정보 추출 데이터셋 구축 방법)

  • Jeong Young Sang;Ji Seung Hyun;Kwon Da Rong Sae
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.11
    • /
    • pp.481-492
    • /
    • 2023
  • This study explores how to build a Korean dataset to extract information from text using generative large language models. In modern society, mixed information circulates rapidly, and effectively categorizing and extracting it is crucial to the decision-making process. However, there is still a lack of Korean datasets for training. To overcome this, this study attempts to extract information using text-based zero-shot learning using a generative large language model to build a purposeful Korean dataset. In this study, the language model is instructed to output the desired result through prompt engineering in the form of "system"-"instruction"-"source input"-"output format", and the dataset is built by utilizing the in-context learning characteristics of the language model through input sentences. We validate our approach by comparing the generated dataset with the existing benchmark dataset, and achieve 25.47% higher performance compared to the KLUE-RoBERTa-large model for the relation information extraction task. The results of this study are expected to contribute to AI research by showing the feasibility of extracting knowledge elements from Korean text. Furthermore, this methodology can be utilized for various fields and purposes, and has potential for building various Korean datasets.

Exploring the power of physics-informed neural networks for accurate and efficient solutions to 1D shallow water equations (물리 정보 신경망을 이용한 1차원 천수방정식의 해석)

  • Nguyen, Van Giang;Nguyen, Van Linh;Jung, Sungho;An, Hyunuk;Lee, Giha
    • Journal of Korea Water Resources Association
    • /
    • v.56 no.12
    • /
    • pp.939-953
    • /
    • 2023
  • Shallow water equations (SWE) serve as fundamental equations governing the movement of the water. Traditional numerical approaches for solving these equations generally face various challenges, such as sensitivity to mesh generation, and numerical oscillation, or become more computationally unstable around shock and discontinuities regions. In this study, we present a novel approach that leverages the power of physics-informed neural networks (PINNs) to approximate the solution of the SWE. PINNs integrate physical law directly into the neural network architecture, enabling the accurate approximation of solutions to the SWE. We provide a comprehensive methodology for formulating the SWE within the PINNs framework, encompassing network architecture, training strategy, and data generation techniques. Through the results obtained from experiments, we found that PINNs could be an accurate output solution of SWE when its results were compared with the analytical method. In addition, PINNs also present better performance over the Artificial Neural Network. This study highlights the transformative potential of PINNs in revolutionizing water resources research, offering a new paradigm for accurate and efficient solutions to the SVE.

Fundamental research to investigate methods of vocational competency enforcement in field of home economics education - revision of the current NCS based vocational highschool education curriculum and investigation in change of direction in vocational home economics education - (가정과교육에서의 직업역량 강화 방안 탐색을 위한 기초 연구 - NCS 기반 고교 직업교육과정 개정과 가사실업계 직업교육의 변화 방향 탐색 -)

  • Jang, Myung Hee
    • Journal of Korean Home Economics Education Association
    • /
    • v.26 no.4
    • /
    • pp.129-146
    • /
    • 2014
  • This study is a fundamental research in the field of home economics education to enforce vocational competencies. It was carried out in the purpose of examining the recent economical and social environmental changes and its management system related to the vocational training in the field of home economics education. It seeks change in direction in relation to the National Competency Standard(NCS) based on revisions in the educational system. The method of study was mostly through reference and data analysis, professional advisory and public hearing. The main research results are as follows. First, the main environmental change factors in relation to vocational training have been integrated to the changes in; population structure, gender related economic activities, generation composition, communications technology, and innovation of living technique. These change factors are forecasting innovations in related industries, lifestyle changes, demand for manpower and changes in capabilities required for each specific profession. Second, according to the analysis of current home economics education training, vocational home educations high school accounts for 9.4% of the total number of specialized high schools, where 8 standard departments are specialized in and characterized into 137 different department names. Despite differences among departments, overall employment rate of graduates were measured 44.7%, which rates above the entrance rate of 41.9%. These numbers show great change since 2010(overall employment rate 16.9%, entrance rate 75.2%), a meaningful outcome resulting from changes in policy from the previous employment-centered education system. Third, NCS based on high school vocational home economics education system revision and investigations in change of direction in vocational home economics, this study attempts to provide background for revision from the development of NCS. It also provides proposals for restructuring division of current classification and departments of home economics education, and propositions for further future research.

  • PDF

A Study on Knowledge Entity Extraction Method for Individual Stocks Based on Neural Tensor Network (뉴럴 텐서 네트워크 기반 주식 개별종목 지식개체명 추출 방법에 관한 연구)

  • Yang, Yunseok;Lee, Hyun Jun;Oh, Kyong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.25-38
    • /
    • 2019
  • Selecting high-quality information that meets the interests and needs of users among the overflowing contents is becoming more important as the generation continues. In the flood of information, efforts to reflect the intention of the user in the search result better are being tried, rather than recognizing the information request as a simple string. Also, large IT companies such as Google and Microsoft focus on developing knowledge-based technologies including search engines which provide users with satisfaction and convenience. Especially, the finance is one of the fields expected to have the usefulness and potential of text data analysis because it's constantly generating new information, and the earlier the information is, the more valuable it is. Automatic knowledge extraction can be effective in areas where information flow is vast, such as financial sector, and new information continues to emerge. However, there are several practical difficulties faced by automatic knowledge extraction. First, there are difficulties in making corpus from different fields with same algorithm, and it is difficult to extract good quality triple. Second, it becomes more difficult to produce labeled text data by people if the extent and scope of knowledge increases and patterns are constantly updated. Third, performance evaluation is difficult due to the characteristics of unsupervised learning. Finally, problem definition for automatic knowledge extraction is not easy because of ambiguous conceptual characteristics of knowledge. So, in order to overcome limits described above and improve the semantic performance of stock-related information searching, this study attempts to extract the knowledge entity by using neural tensor network and evaluate the performance of them. Different from other references, the purpose of this study is to extract knowledge entity which is related to individual stock items. Various but relatively simple data processing methods are applied in the presented model to solve the problems of previous researches and to enhance the effectiveness of the model. From these processes, this study has the following three significances. First, A practical and simple automatic knowledge extraction method that can be applied. Second, the possibility of performance evaluation is presented through simple problem definition. Finally, the expressiveness of the knowledge increased by generating input data on a sentence basis without complex morphological analysis. The results of the empirical analysis and objective performance evaluation method are also presented. The empirical study to confirm the usefulness of the presented model, experts' reports about individual 30 stocks which are top 30 items based on frequency of publication from May 30, 2017 to May 21, 2018 are used. the total number of reports are 5,600, and 3,074 reports, which accounts about 55% of the total, is designated as a training set, and other 45% of reports are designated as a testing set. Before constructing the model, all reports of a training set are classified by stocks, and their entities are extracted using named entity recognition tool which is the KKMA. for each stocks, top 100 entities based on appearance frequency are selected, and become vectorized using one-hot encoding. After that, by using neural tensor network, the same number of score functions as stocks are trained. Thus, if a new entity from a testing set appears, we can try to calculate the score by putting it into every single score function, and the stock of the function with the highest score is predicted as the related item with the entity. To evaluate presented models, we confirm prediction power and determining whether the score functions are well constructed by calculating hit ratio for all reports of testing set. As a result of the empirical study, the presented model shows 69.3% hit accuracy for testing set which consists of 2,526 reports. this hit ratio is meaningfully high despite of some constraints for conducting research. Looking at the prediction performance of the model for each stocks, only 3 stocks, which are LG ELECTRONICS, KiaMtr, and Mando, show extremely low performance than average. this result maybe due to the interference effect with other similar items and generation of new knowledge. In this paper, we propose a methodology to find out key entities or their combinations which are necessary to search related information in accordance with the user's investment intention. Graph data is generated by using only the named entity recognition tool and applied to the neural tensor network without learning corpus or word vectors for the field. From the empirical test, we confirm the effectiveness of the presented model as described above. However, there also exist some limits and things to complement. Representatively, the phenomenon that the model performance is especially bad for only some stocks shows the need for further researches. Finally, through the empirical study, we confirmed that the learning method presented in this study can be used for the purpose of matching the new text information semantically with the related stocks.

Estimate Saliency map based on Multi Feature Assistance of Learning Algorithm (다중 특징을 지원하는 학습 기반의 saliency map에 관한 연구)

  • Han, Hyun-Ho;Lee, Gang-Seong;Park, Young-Soo;Lee, Sang-Hun
    • Journal of the Korea Convergence Society
    • /
    • v.8 no.6
    • /
    • pp.29-36
    • /
    • 2017
  • In this paper, we propose a method for generating improved saliency map by learning multiple features to improve the accuracy and reliability of saliency map which has similar result to human visual perception type. In order to overcome the inaccurate result of reverse selection or partial loss in color based salient area estimation in existing salience map generation, the proposed method generates multi feature data based on learning. The features to be considered in the image are analyzed through the process of distinguishing the color pattern and the region having the specificity in the original image, and the learning data is composed by the combination of the similar protrusion area definition and the specificity area using the LAB color space based color analysis. After combining the training data with the extrinsic information obtained from low level features such as frequency, color, and focus information, we reconstructed the final saliency map to minimize the inaccurate saliency area. For the experiment, we compared the ground truth image with the experimental results and obtained the precision-recall value.

Deep Learning-based Professional Image Interpretation Using Expertise Transplant (전문성 이식을 통한 딥러닝 기반 전문 이미지 해석 방법론)

  • Kim, Taejin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.2
    • /
    • pp.79-104
    • /
    • 2020
  • Recently, as deep learning has attracted attention, the use of deep learning is being considered as a method for solving problems in various fields. In particular, deep learning is known to have excellent performance when applied to applying unstructured data such as text, sound and images, and many studies have proven its effectiveness. Owing to the remarkable development of text and image deep learning technology, interests in image captioning technology and its application is rapidly increasing. Image captioning is a technique that automatically generates relevant captions for a given image by handling both image comprehension and text generation simultaneously. In spite of the high entry barrier of image captioning that analysts should be able to process both image and text data, image captioning has established itself as one of the key fields in the A.I. research owing to its various applicability. In addition, many researches have been conducted to improve the performance of image captioning in various aspects. Recent researches attempt to create advanced captions that can not only describe an image accurately, but also convey the information contained in the image more sophisticatedly. Despite many recent efforts to improve the performance of image captioning, it is difficult to find any researches to interpret images from the perspective of domain experts in each field not from the perspective of the general public. Even for the same image, the part of interests may differ according to the professional field of the person who has encountered the image. Moreover, the way of interpreting and expressing the image also differs according to the level of expertise. The public tends to recognize the image from a holistic and general perspective, that is, from the perspective of identifying the image's constituent objects and their relationships. On the contrary, the domain experts tend to recognize the image by focusing on some specific elements necessary to interpret the given image based on their expertise. It implies that meaningful parts of an image are mutually different depending on viewers' perspective even for the same image. So, image captioning needs to implement this phenomenon. Therefore, in this study, we propose a method to generate captions specialized in each domain for the image by utilizing the expertise of experts in the corresponding domain. Specifically, after performing pre-training on a large amount of general data, the expertise in the field is transplanted through transfer-learning with a small amount of expertise data. However, simple adaption of transfer learning using expertise data may invoke another type of problems. Simultaneous learning with captions of various characteristics may invoke so-called 'inter-observation interference' problem, which make it difficult to perform pure learning of each characteristic point of view. For learning with vast amount of data, most of this interference is self-purified and has little impact on learning results. On the contrary, in the case of fine-tuning where learning is performed on a small amount of data, the impact of such interference on learning can be relatively large. To solve this problem, therefore, we propose a novel 'Character-Independent Transfer-learning' that performs transfer learning independently for each character. In order to confirm the feasibility of the proposed methodology, we performed experiments utilizing the results of pre-training on MSCOCO dataset which is comprised of 120,000 images and about 600,000 general captions. Additionally, according to the advice of an art therapist, about 300 pairs of 'image / expertise captions' were created, and the data was used for the experiments of expertise transplantation. As a result of the experiment, it was confirmed that the caption generated according to the proposed methodology generates captions from the perspective of implanted expertise whereas the caption generated through learning on general data contains a number of contents irrelevant to expertise interpretation. In this paper, we propose a novel approach of specialized image interpretation. To achieve this goal, we present a method to use transfer learning and generate captions specialized in the specific domain. In the future, by applying the proposed methodology to expertise transplant in various fields, we expected that many researches will be actively conducted to solve the problem of lack of expertise data and to improve performance of image captioning.

A Development of Curriculum for Information Security Professional Manpower Training (정보보안 전문인력 양성을 위한 교육과정 개발)

  • Lee, Moongoo
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.54 no.1
    • /
    • pp.46-52
    • /
    • 2017
  • Social attention to information security field is inspired, and manpower demand forecast of this area is getting high. This study surveyed information security knowledge of practitioners who work in a field of information security such as computer and network system. We analyzed a connection between survey data, information protection job system that was suggested by NICE, IT skills that NCS and KISA classified and security field classification system. Base on data that analyzed, this study suggests a curriculum that trains professional manpower who perform duties in the field of information security. Suggested curriculum can be applied to 2 year college, 3 year college and 4 year college. Suggested curriculum provides courses that students who want to work in a field of information security must learn during the college. Suggested courses are closely connected to a related field and detailed guideline is indicated to each course to educate. Suggested curriculum is required, and it combines a theoretical education that become basis and a practical education so that it is not weighted to learn theory and is not only focusing on learning simple commands. This curriculum is established to educate students countermeasures of hacking and security defend that based on scenario that connected to executive ability. This curriculum helps to achieve certificates related to a field more than paper qualification. Also, we expect this curriculum helps to train convergent information security manpower for next generation.

Current Transmission of Intangible Cultural Property in Ibuk 5-do and Proposal for its Development (이북5도 무형문화재의 전승현황과 발전방안)

  • Cha, Ji-eon
    • The Journal of the Korea Contents Association
    • /
    • v.19 no.8
    • /
    • pp.653-666
    • /
    • 2019
  • This study is designed to find out the protection system of intangible cultural property as the current national policy and analyze the foundational problems in order to suggest the future vision in a way of transmitting the traditional cultural heritage to the descendants, based on the policy support for the intangible cultural property under the control of Commission for Ibuk 5-do (5 Northern Provinces of Korean peninsula). The research issue is first, the aging of the holders of the important cultural properties in Ibuk 5-do and the overcoming of the incomplete transmission system. The second issue is the protective policy of the transmission from the national perspective and the justification of the government's financial support. The third issue is the development of cultural contents by using the intangible cultural properties in Ibuk 5-do and the suggestion of the creative development for the resourcing of culture. As for the research methods, reference research and analysis on interview data and data from the related entities was done were analyzed and the results were actively adopted. The research results showed the followings to be required; first, the aged holders of the cultural properties shall be treated well and at the same time, the holders recognized to hold the transmission ability shall be dually designated. Second, the training system for the younger generation to keep the traditional cultural heritage shall be established; Third, the educational program and school education shall be extended so that the importance of the transmission of the intangible traditional cultural properties in Ibuk 5-do (northern 5 provinces of Korea) shall be recognized; Fourth, the recording system shall be constructed and its application system shall be required. In the end, in order for such development methods to be implemented effectively, active government support is desperately required.