• Title/Summary/Keyword: semantic category

Search Result 119, Processing Time 0.027 seconds

The Method of Hierarchical Emotion Evaluation using Intuitive Categorization (직감적 범주화를 이용한 계층적 감성평가방법)

  • Kim, Don-Han
    • Science of Emotion and Sensibility
    • /
    • v.12 no.1
    • /
    • pp.45-54
    • /
    • 2009
  • Categorization in a vital means for dealing with the multitudes of entities in the world surrounding people. Among others, the perceptual and the evaluative similarities factors strongly affect categorization. The conventional SD-type procedure are insufficient in this regard, since it requires an individual subject to make isolated judgments about each stimulus to identify categorization in terms of a group tendency. It disregards the individual categorization in which the similarities are of great importance. Thus in this study the phased emotional evaluation method is suggested based on the intuitive categorization of stimuli and on the similarity judgement of representative/ non-representative case in each category. To verify the effectiveness of the suggested evaluation method the scanned jewelry images are selected as test stimuli for emotional evaluation experiment. As a result of the evaluation experiment, the conventional SD-type procedure is complemented by the emotional evaluation method in phases of the task of intuitive categorization, the selection of the representative images and the setup of the evaluation score of the representative images to internally supplied anchors of evaluating non-representative images.

  • PDF

Functional Expansion of Morphological Analyzer Based on Longest Phrase Matching For Efficient Korean Parsing (효율적인 한국어 파싱을 위한 최장일치 기반의 형태소 분석기 기능 확장)

  • Lee, Hyeon-yoeng;Lee, Jong-seok;Kang, Byeong-do;Yang, Seung-weon
    • Journal of Digital Contents Society
    • /
    • v.17 no.3
    • /
    • pp.203-210
    • /
    • 2016
  • Korean is free of omission of sentence elements and modifying scope, so managing it on morphological analyzer is better than parser. In this paper, we propose functional expansion methods of the morphological analyzer to ease the burden of parsing. This method is a longest phrase matching method. When the series of several morpheme have one syntax category by processing of Unknown-words, Compound verbs, Compound nouns, Numbers and Symbols, our method combines them into a syntactic unit. And then, it is to treat by giving them a semantic features as syntax unit. The proposed morphological analysis method removes unnecessary morphological ambiguities and deceases results of morphological analysis, so improves accuracy of tagger and parser. By empirical results, we found that our method deceases 73.4% of Parsing tree and 52.4% of parsing time on average.

A Study on Deep Learning Optimization by Land Cover Classification Item Using Satellite Imagery (위성영상을 활용한 토지피복 분류 항목별 딥러닝 최적화 연구)

  • Lee, Seong-Hyeok;Lee, Moung-jin
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.6_2
    • /
    • pp.1591-1604
    • /
    • 2020
  • This study is a study on classifying land cover by applying high-resolution satellite images to deep learning algorithms and verifying the performance of algorithms for each spatial object. For this, the Fully Convolutional Network-based algorithm was selected, and a dataset was constructed using Kompasat-3 satellite images, land cover maps, and forest maps. By applying the constructed data set to the algorithm, each optimal hyperparameter was calculated. Final classification was performed after hyperparameter optimization, and the overall accuracy of DeeplabV3+ was calculated the highest at 81.7%. However, when looking at the accuracy of each category, SegNet showed the best performance in roads and buildings, and U-Net showed the highest accuracy in hardwood trees and discussion items. In the case of Deeplab V3+, it performed better than the other two models in fields, facility cultivation, and grassland. Through the results, the limitations of applying one algorithm for land cover classification were confirmed, and if an appropriate algorithm for each spatial object is applied in the future, it is expected that high quality land cover classification results can be produced.

A Novel Two-Stage Training Method for Unbiased Scene Graph Generation via Distribution Alignment

  • Dongdong Jia;Meili Zhou;Wei WEI;Dong Wang;Zongwen Bai
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.12
    • /
    • pp.3383-3397
    • /
    • 2023
  • Scene graphs serve as semantic abstractions of images and play a crucial role in enhancing visual comprehension and reasoning. However, the performance of Scene Graph Generation is often compromised when working with biased data in real-world situations. While many existing systems focus on a single stage of learning for both feature extraction and classification, some employ Class-Balancing strategies, such as Re-weighting, Data Resampling, and Transfer Learning from head to tail. In this paper, we propose a novel approach that decouples the feature extraction and classification phases of the scene graph generation process. For feature extraction, we leverage a transformer-based architecture and design an adaptive calibration function specifically for predicate classification. This function enables us to dynamically adjust the classification scores for each predicate category. Additionally, we introduce a Distribution Alignment technique that effectively balances the class distribution after the feature extraction phase reaches a stable state, thereby facilitating the retraining of the classification head. Importantly, our Distribution Alignment strategy is model-independent and does not require additional supervision, making it applicable to a wide range of SGG models. Using the scene graph diagnostic toolkit on Visual Genome and several popular models, we achieved significant improvements over the previous state-of-the-art methods with our model. Compared to the TDE model, our model improved mR@100 by 70.5% for PredCls, by 84.0% for SGCls, and by 97.6% for SGDet tasks.

The meaning based on Yin-Yang and Five Elements Principle in Semantic Landscape Composition of 'the Forty Eight Poems of Soswaewon' ('소쇄원(瀟灑園) 48영'의 의미경관 구성에 있어서 음양오행론적(陰陽五行論的) 의미(意味))

  • Jang, Il-Young;Shin, Sang-Sup
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.31 no.2
    • /
    • pp.43-57
    • /
    • 2013
  • The purpose of this study is to identify potential semantic landscape makeup of "the Forty Eight Poems of Soswaewon" according to Yin-Yang and Five Elements Principle(陰陽五行論). that speculation system between human's nature and cosmical universal order. Existing academic discussions made so far concerning this topic can be summed up as follows: 1. Among Yin-Yang-based landscape makeups of the Forty Eight Poems of Soswaewon, poetic writings for embodiment of interactions between nature and human behaviors focused on depicting dynamic aspects of a poetic narrator when he appreciates or explores hills and streams as of to live free from worldly cares. Primarily, many of those writings were created on the east and south primarily through assignment of yang. On the other hand, poetic writings for embodiment of nature and seasonal scenery - as static landscape makeup of yin - were often created on or near the north and west for many times. Those writings focusing on embodiment of nature and artificial scenery as a work are divided into two categories: One category refers to author Kim In-hu's expression of semantic landscape from seasonal scenery in nature. The other refers to his depiction of realistic garden images as they are. In the Forty Eight Poems of Soswaewon, the poetic writings show that author Kim focused on embodying seasonal scenery rather than expressing human behaviors. In addition, both Poem No. 1 and Poem No. 48(last poem; titled 'Jangwon Jeyeong') were created in a same place, which author Kim sought to understand the place as a space of beginning and end where yin and yang - i.e. the principle of natural cycle - are inherent. 2. According to construction about landscape in the Forty Eight Poems of Soswaewon on the basis of Ohaeng-ron (five natural element principle), it was found that tree(木) and fire(火) are typical examples of a world combined by emanation. First, many of poetic writings depicting the sentiments of tree focused on embodying seasonal scenery and were located in the place of Ogogmun(五曲門) area in the east, from overall perspective of Soswaewon. The content of these poems shows generation and curve / straightness in flexibility and simplicity. Many of poems depicting the sentiments of fire(火) focused on embodying human behaviors, and they were created in Aeyangdan area on the south of Soswaewon over which sun rises at noon. These poems are all on a status of side movement that is characterized by emanation and ascension which belong to attributes of yang. 3. With regard to Ohaeng-ron's interpretation about landscape in the Forty Eight Poems of Soswaewon, it was found that metal(金) and water(水) are typical examples of world combined by convergence. First, it was found that all of poems depicting sentiments of metal focused on embodying seasonal scenery, and were created in a bamboo grove area on the west from overall perspective of Soswaewon. They represent scenery of autumn among 4 seasons to symbolize faithfulness vested in a man of virtue(seonbi) with integrity and righteousness. Poems depicting sentiments of water were created in vicinity of Jewoldang on the north, possibly topmost of Soswaewon. They were divided into two categories: One category refers to poems embodying actions of welcoming the first full moon deep in the night after sunset, and the other refers to poems embodying natural scenery of snowscape. All of those poems focused on expressing any atmosphere of turning into yin via convergence. 4. With regard to Ohaeng-ron's interpretation of landscape in the Forty Eight Poems of Soswaewon, it was found that poems depicting sentiments of earth(土), a complex body of convergence and emanation, were created in vicinity of mountain stream around Gwangpunggak which is located in the center of Soswaewon. These poems focused on carrying actions of author Kim by way of natural phenomena and artificial scenery.

A Study on Market Size Estimation Method by Product Group Using Word2Vec Algorithm (Word2Vec을 활용한 제품군별 시장규모 추정 방법에 관한 연구)

  • Jung, Ye Lim;Kim, Ji Hui;Yoo, Hyoung Sun
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.1-21
    • /
    • 2020
  • With the rapid development of artificial intelligence technology, various techniques have been developed to extract meaningful information from unstructured text data which constitutes a large portion of big data. Over the past decades, text mining technologies have been utilized in various industries for practical applications. In the field of business intelligence, it has been employed to discover new market and/or technology opportunities and support rational decision making of business participants. The market information such as market size, market growth rate, and market share is essential for setting companies' business strategies. There has been a continuous demand in various fields for specific product level-market information. However, the information has been generally provided at industry level or broad categories based on classification standards, making it difficult to obtain specific and proper information. In this regard, we propose a new methodology that can estimate the market sizes of product groups at more detailed levels than that of previously offered. We applied Word2Vec algorithm, a neural network based semantic word embedding model, to enable automatic market size estimation from individual companies' product information in a bottom-up manner. The overall process is as follows: First, the data related to product information is collected, refined, and restructured into suitable form for applying Word2Vec model. Next, the preprocessed data is embedded into vector space by Word2Vec and then the product groups are derived by extracting similar products names based on cosine similarity calculation. Finally, the sales data on the extracted products is summated to estimate the market size of the product groups. As an experimental data, text data of product names from Statistics Korea's microdata (345,103 cases) were mapped in multidimensional vector space by Word2Vec training. We performed parameters optimization for training and then applied vector dimension of 300 and window size of 15 as optimized parameters for further experiments. We employed index words of Korean Standard Industry Classification (KSIC) as a product name dataset to more efficiently cluster product groups. The product names which are similar to KSIC indexes were extracted based on cosine similarity. The market size of extracted products as one product category was calculated from individual companies' sales data. The market sizes of 11,654 specific product lines were automatically estimated by the proposed model. For the performance verification, the results were compared with actual market size of some items. The Pearson's correlation coefficient was 0.513. Our approach has several advantages differing from the previous studies. First, text mining and machine learning techniques were applied for the first time on market size estimation, overcoming the limitations of traditional sampling based- or multiple assumption required-methods. In addition, the level of market category can be easily and efficiently adjusted according to the purpose of information use by changing cosine similarity threshold. Furthermore, it has a high potential of practical applications since it can resolve unmet needs for detailed market size information in public and private sectors. Specifically, it can be utilized in technology evaluation and technology commercialization support program conducted by governmental institutions, as well as business strategies consulting and market analysis report publishing by private firms. The limitation of our study is that the presented model needs to be improved in terms of accuracy and reliability. The semantic-based word embedding module can be advanced by giving a proper order in the preprocessed dataset or by combining another algorithm such as Jaccard similarity with Word2Vec. Also, the methods of product group clustering can be changed to other types of unsupervised machine learning algorithm. Our group is currently working on subsequent studies and we expect that it can further improve the performance of the conceptually proposed basic model in this study.

A Study of the Definition and Components of Data Literacy for K-12 AI Education (초·중등 AI 교육을 위한 데이터 리터러시 정의 및 구성 요소 연구)

  • Kim, Seulki;Kim, Taeyoung
    • Journal of The Korean Association of Information Education
    • /
    • v.25 no.5
    • /
    • pp.691-704
    • /
    • 2021
  • The development of AI technology has brought about a big change in our lives. The importance of AI and data education is also growing as AI's influence from life to society to the economy grows. In response, the OECD Education Research Report and various domestic information and curriculum studies deal with data literacy and present it as an essential competency. However, the definition of data literacy and the content and scope of the components vary among researchers. Thus, we analyze the semantic similarity of words through Word2Vec deep learning natural language processing methods along with the definitions of key data literacy studies and analysis of word frequency utilized in components, to present objective and comprehensive definition and components. It was revised and supplemented by expert review, and we defined data literacy as the 'basic ability of knowledge construction and communication to collect, analyze, and use data and process it as information for problem solving'. Furthermore we propose the components of each category of knowledge, skills, values and attitudes. We hope that the definition and components of data literacy derived from this study will serve as a good foundation for the systematization and education research of AI education related to students' future competency.

A Comparison of Image Classification System for Building Waste Data based on Deep Learning (딥러닝기반 건축폐기물 이미지 분류 시스템 비교)

  • Jae-Kyung Sung;Mincheol Yang;Kyungnam Moon;Yong-Guk Kim
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.23 no.3
    • /
    • pp.199-206
    • /
    • 2023
  • This study utilizes deep learning algorithms to automatically classify construction waste into three categories: wood waste, plastic waste, and concrete waste. Two models, VGG-16 and ViT (Vision Transformer), which are convolutional neural network image classification algorithms and NLP-based models that sequence images, respectively, were compared for their performance in classifying construction waste. Image data for construction waste was collected by crawling images from search engines worldwide, and 3,000 images, with 1,000 images for each category, were obtained by excluding images that were difficult to distinguish with the naked eye or that were duplicated and would interfere with the experiment. In addition, to improve the accuracy of the models, data augmentation was performed during training with a total of 30,000 images. Despite the unstructured nature of the collected image data, the experimental results showed that VGG-16 achieved an accuracy of 91.5%, and ViT achieved an accuracy of 92.7%. This seems to suggest the possibility of practical application in actual construction waste data management work. If object detection techniques or semantic segmentation techniques are utilized based on this study, more precise classification will be possible even within a single image, resulting in more accurate waste classification

A Study on Types and Characteristics of 'Cultural Landscapes' with Big Data Analysis: Focusing on the Case of Shinan-gun, Jeollanam-do (빅데이터 분석을 통한 '문화경관' 유형과 특성 연구: 전라남도 신안군 사례를 중심으로)

  • OH Jungshim
    • Korean Journal of Heritage: History & Science
    • /
    • v.56 no.1
    • /
    • pp.162-180
    • /
    • 2023
  • The World Heritage Committee decided to make "cultural landscapes" a world heritage category in the 16th Session of the UNESCO General Conference. The decision was made from a recognition of the importance of interactions between human beings and the natural environment or between cultural heritage and natural heritage. Many countries have created policies and institutions to protect their own cultural landscapes along with the changing times. Korea, however, has not obviously defined the concepts and categories of its cultural landscapes, but manages policies and institutions based on the concept of a scenic spot, which has some similar meanings. In addition, it even borrows the "list of landscape adjectives," one of the representative methods for managing landscapes, from foreign countries. With this background, this paper suggested how to define cultural landscapes according to the global development flow. It created a list of cultural landscape adjectives by gathering the adjectives that can properly express local cultural landscapes in Korea. In particular, it collected 4,556 articles from a local newspaper by focusing on the case of Shinan-gun, Jeollanam-do, and analyzed key words and adjectives included in them by using big data analysis. The results suggested by this paper, such as the "classification table of cultural landscape types," "list of cultural landscape adjectives" and "network map of nouns/adjectives" can be applied to research on other localities, and furthermore, used as basic data for finding and protecting the characteristics of local cultural landscapes in Korea.

Analyzing Different Contexts for Energy Terms through Text Mining of Online Science News Articles (온라인 과학 기사 텍스트 마이닝을 통해 분석한 에너지 용어 사용의 맥락)

  • Oh, Chi Yeong;Kang, Nam-Hwa
    • Journal of Science Education
    • /
    • v.45 no.3
    • /
    • pp.292-303
    • /
    • 2021
  • This study identifies the terms frequently used together with energy in online science news articles and topics of the news reports to find out how the term energy is used in everyday life and to draw implications for science curriculum and instruction about energy. A total of 2,171 online news articles in science category published by 11 major newspaper companies in Korea for one year from March 1, 2018 were selected by using energy as a search term. As a result of natural language processing, a total of 51,224 sentences consisting of 507,901 words were compiled for analysis. Using the R program, term frequency analysis, semantic network analysis, and structural topic modeling were performed. The results show that the terms with exceptionally high frequencies were technology, research, and development, which reflected the characteristics of news articles that report new findings. On the other hand, terms used more than once per two articles were industry-related terms (industry, product, system, production, market) and terms that were sufficiently expected as energy-related terms such as 'electricity' and 'environment.' Meanwhile, 'sun', 'heat', 'temperature', and 'power generation', which are frequently used in energy-related science classes, also appeared as terms belonging to the highest frequency. From a network analysis, two clusters were found including terms related to industry and technology and terms related to basic science and research. From the analysis of terms paired with energy, it was also found that terms related to the use of energy such as 'energy efficiency,' 'energy saving,' and 'energy consumption' were the most frequently used. Out of 16 topics found, four contexts of energy were drawn including 'high-tech industry,' 'industry,' 'basic science,' and 'environment and health.' The results suggest that the introduction of the concept of energy degradation as a starting point for energy classes can be effective. It also shows the need to introduce high-tech industries or the context of environment and health into energy learning.