• Title/Summary/Keyword: Text image

Search Result 969, Processing Time 0.025 seconds

Considerations for Applying Korean Natural Language Processing Technology in Records Management (기록관리 분야에서 한국어 자연어 처리 기술을 적용하기 위한 고려사항)

  • Haklae, Kim
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.22 no.4
    • /
    • pp.129-149
    • /
    • 2022
  • Records have temporal characteristics, including the past and present; linguistic characteristics not limited to a specific language; and various types categorized in a complex way. Processing records such as text, video, and audio in the life cycle of records' creation, preservation, and utilization entails exhaustive effort and cost. Primary natural language processing (NLP) technologies, such as machine translation, document summarization, named-entity recognition, and image recognition, can be widely applied to electronic records and analog digitization. In particular, Korean deep learning-based NLP technologies effectively recognize various record types and generate record management metadata. This paper provides an overview of Korean NLP technologies and discusses considerations for applying NLP technology in records management. The process of using NLP technologies, such as machine translation and optical character recognition for digital conversion of records, is introduced as an example implemented in the Python environment. In contrast, a plan to improve environmental factors and record digitization guidelines for applying NLP technology in the records management field is proposed for utilizing NLP technology.

A Study on the Image of Kim Soo-young in the Media -Focused on the drama "The Count of Myeong-dong"(2004) (영상매체에 나타난 김수영 이미지 연구 -드라마 <명동백작>(2004)을 중심으로)

  • Son, Mi-young
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.6
    • /
    • pp.89-96
    • /
    • 2022
  • This study examines the strategy of delivering the drama's poet Kim Soo-young and his literary works to the public through the drama (2004). This drama shows Kim's inner self and his literary view by inserting poems into scenes where the poet suffers internal conflict, while presenting relatively less well-known poems to broaden the public's understanding of poetry. In addition, the drama maintains viewers' interest by properly placing elements of conflict, and effectively shows how the conflict affected his life and the world of time. Therefore, the drama is a meaningful text that embodies a poet named Kim Soo-young in three dimensions along with the historical transformation and social problems of the time and the literary chapter of the time through the video.

The Edge Computing System for the Detection of Water Usage Activities with Sound Classification (음향 기반 물 사용 활동 감지용 엣지 컴퓨팅 시스템)

  • Seung-Ho Hyun;Youngjoon Chee
    • Journal of Biomedical Engineering Research
    • /
    • v.44 no.2
    • /
    • pp.147-156
    • /
    • 2023
  • Efforts to employ smart home sensors to monitor the indoor activities of elderly single residents have been made to assess the feasibility of a safe and healthy lifestyle. However, the bathroom remains an area of blind spot. In this study, we have developed and evaluated a new edge computer device that can automatically detect water usage activities in the bathroom and record the activity log on a cloud server. Three kinds of sound as flushing, showering, and washing using wash basin generated during water usage were recorded and cut into 1-second scenes. These sound clips were then converted into a 2-dimensional image using MEL-spectrogram. Sound data augmentation techniques were adopted to obtain better learning effect from smaller number of data sets. These techniques, some of which are applied in time domain and others in frequency domain, increased the number of training data set by 30 times. A deep learning model, called CRNN, combining Convolutional Neural Network and Recurrent Neural Network was employed. The edge device was implemented using Raspberry Pi 4 and was equipped with a condenser microphone and amplifier to run the pre-trained model in real-time. The detected activities were recorded as text-based activity logs on a Firebase server. Performance was evaluated in two bathrooms for the three water usage activities, resulting in an accuracy of 96.1% and 88.2%, and F1 Score of 96.1% and 87.8%, respectively. Most of the classification errors were observed in the water sound from washing. In conclusion, this system demonstrates the potential for use in recording the activities as a lifelog of elderly single residents to a cloud server over the long-term.

Design and Implementation of a Data Visualization Assessment Module in Jupyter Notebook

  • HakNeung Go;Youngjun Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.9
    • /
    • pp.167-176
    • /
    • 2023
  • In this paper, we designed and implemented a graph assessment module that can evaluate graphs in an programming assessment system based on text and numbers. The assessment method of the graph assessment module is self-evaluation that outputs two graphs generated by codes submitted by learners and by answers, automatic-evaluation that converts each graph image into an array, and gives feedback if it is wrong. The data used to generate the graph can be inputted directly or used from external data, and the method of generatng graph that can be evaluated is MATLAB style in matplotlib, and the graph shape that can be evaluated is presented in mathematics and curriculum. Through expert review, it was confirmed that the content elements of the assessment module, the possibility of learning, and the validity of the learner's needs were met. The graph assessment module developed in this study has expanded the evaluation area of the programming automatic asssessment system and is expected to help students learn data visualization.

Implication of Physical Education Based on the Zhaung Zi's Philosophy (장자철학의 체육적 함의)

  • Lee, Hak-Jun
    • 한국체육학회지인문사회과학편
    • /
    • v.51 no.4
    • /
    • pp.23-31
    • /
    • 2012
  • The purpose of this study was to inquiry implication of physical education based on the Zhaung Zi's philosophy. In order to investigate this purpose, I analyzed the text of Zhaung Zi. The result of the study was as followed. Zhaung-Zi oriented ideal image of human beings who has attained the stage of play(遊). He is an acquaintance (至仁), a man of god(神人), a true man(眞人). The purpose of physical education in Zhuang-Zi is paly(遊) which play well naturally. 'play is the gaming and play in which we can see the true face of the world and ourselves and can become on with the object in the world. Forget-enjoyment(至樂) of victory, records, and results are the purpose of physical education which can be found in Zhuang-Zi. The methods of physical education is the whole mind(專一), xixium(虛心), the feast of mind(心齋), forgetting everything(坐忘) etc. Physical education is to harmonious study with nature not artificially. The relation between teacher(coach) and student(player) is a relationship of mutual respect and consideration. The teacher(player) have to find the potential ability of student and he can help student realize potential ability of them.

Analysis of research status on domestic AI education (국내 인공지능 교육에 대한 연구 현황 분석)

  • Park, Mingyu;Han, Kyujung;Sin, Subeom
    • 한국정보교육학회:학술대회논문집
    • /
    • 2021.08a
    • /
    • pp.69-76
    • /
    • 2021
  • The purpose of this study is to identify research trends on artificial intelligence education. We analyzed 164 domestic journal papers related to AI education published since 2016. The criteria for thesis analysis are number of publications by year, journal name, research topic, research type, data collection method, research subject, and subject. The main research areas and areas that require further research are reviewed. The method of the study was analyzed based on the topic and summary of the selected thesis, but the text was checked if it was unclear. As a result of the study, research on 'artificial intelligence education' started in earnest after 2017, and has been rapidly increasing in recent years. As a result of the analysis, there were many studies on artificial intelligence education programs and content development, and artificial intelligence perception and image. As for the type of research, there were many quantitative studies, and the development research method was used a lot as a data collection method. In the study subjects, elementary school had a high proportion, and in subject, it was found that there were many practicial subject(technology) dealing with artificial intelligence contents.

  • PDF

Research on Core patent mining methods based on key components of Generative AI (생성형 인공지능 기술의 핵심 구성 요소 기반 주요 특허 발굴 방법에 관한 연구)

  • Gayun Kim;Beom-Seok Kim;Jinhong Yang
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.16 no.5
    • /
    • pp.292-300
    • /
    • 2023
  • This paper proposes a patent discovery method and strategy for Generative AI-related patents by utilizing qualitative evaluation indicators established based on the core components of the technology. Currently, the evaluation of patent quality relies on quantitative indicators, but existing quantitative indicators cannot represent the characteristics of Generative AI technology, making it difficult to accurately evaluate. Therefore, there is a need for additional qualitative indicators that consider technical characteristics based on patent claims, which can reveal the actual strength of the patent. In this paper, we propose a new evaluation index considering the technical characteristics of Generative AI. Core patents were selected using the proposed evaluation index, and the appropriateness of the proposed index was verified through the existing quantitative evaluation method for the selected core patents.

Deep-Learning-based smartphone application for automatic recognition of ingredients on curved containers (곡면 용기에 표시된 성분표 자동 인식을 위한 인공지능 기반 스마트폰 애플리케이션)

  • Hieyong Jeong;Choonsung Shin
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.28 no.6
    • /
    • pp.29-43
    • /
    • 2023
  • Consumers should look at the ingredients of cosmetics or food for their health and purchase them after checking whether they contain allergy-causing ingredients. Therefore, this paper aimed to develop an artificial intelligence-based smartphone application for automatically recognizing the ingredients displayed on a curved container and delivering it to consumers in an easy-to-understand manner. The app needs to allow consumers to immediately comprehend the restricted ingredients by recognizing the ingredients' words in the cropped image. Two major issues should be solved during the development process: First, although there were flat containers for cosmetics or food, most were curved containers. Thus, it was necessary to recognize the ingredient table displayed on the curved containers. Second, since the ingredients' words were displayed on the curved surface, the transformed or line-changed words also needed to be recognized. The proposed new methods were enough to solve the above two problems. The application developed through various tests verified that there was no problem recognizing the ingredients' words contained in a cylindrical curved container.

Generating Radiology Reports via Multi-feature Optimization Transformer

  • Rui Wang;Rong Hua
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.10
    • /
    • pp.2768-2787
    • /
    • 2023
  • As an important research direction of the application of computer science in the medical field, the automatic generation technology of radiology report has attracted wide attention in the academic community. Because the proportion of normal regions in radiology images is much larger than that of abnormal regions, words describing diseases are often masked by other words, resulting in significant feature loss during the calculation process, which affects the quality of generated reports. In addition, the huge difference between visual features and semantic features causes traditional multi-modal fusion method to fail to generate long narrative structures consisting of multiple sentences, which are required for medical reports. To address these challenges, we propose a multi-feature optimization Transformer (MFOT) for generating radiology reports. In detail, a multi-dimensional mapping attention (MDMA) module is designed to encode the visual grid features from different dimensions to reduce the loss of primary features in the encoding process; a feature pre-fusion (FP) module is constructed to enhance the interaction ability between multi-modal features, so as to generate a reasonably structured radiology report; a detail enhanced attention (DEA) module is proposed to enhance the extraction and utilization of key features and reduce the loss of key features. In conclusion, we evaluate the performance of our proposed model against prevailing mainstream models by utilizing widely-recognized radiology report datasets, namely IU X-Ray and MIMIC-CXR. The experimental outcomes demonstrate that our model achieves SOTA performance on both datasets, compared with the base model, the average improvement of six key indicators is 19.9% and 18.0% respectively. These findings substantiate the efficacy of our model in the domain of automated radiology report generation.

Application Design for Food Allergy Management (식품 알레르기 관리에 관한 애플리케이션 설계)

  • Ji-Uk Han;Nam-Bin Kim;Ye-Won Lee;Byeong-Seung Yang;Won-Whoi Huh
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.24 no.2
    • /
    • pp.197-203
    • /
    • 2024
  • Food allergies are common and accidents occur annually. However, many people lack knowledge of the severity of allergies and food ingredients. Allergy-related applications currently on the market have problems such as providing information by relying only on certain certified products, food ingredients, and barcodes. This design plans a customized service application for food allergy patients. In this application, after extracting the text of the image using OCR technology, the food ingredients were read and displayed in large letters. In addition, if the user selects an ingredient that cannot be consumed through filtering technology, the restricted food is quickly and conveniently shown when searching for food ingredients. Finally, when scanning a barcode or searching for a product, food ingredient information is provided through barcode scanning and search engine technology that provides ingredient information of the product. Therefore, the purpose of this paper is to design an app in which users with food allergies can easily check food ingredients and avoid allergic reactions using databases and various information search methods.