• Title/Summary/Keyword: Semantic Values

Search Result 139, Processing Time 0.025 seconds

Reconsideration of the Linguistic Category of Mediation in Language: a Comparative Approach between French and Korean (언어의 '매개작용' 범주 고찰: 프랑스어와 한국어 비교 연구)

  • Suh, Jungyeon
    • Cross-Cultural Studies
    • /
    • v.46
    • /
    • pp.297-325
    • /
    • 2017
  • In this paper, I would like to reconsider the evidential category (or the mediation category) in languages with language specific values, especially in Korean and French evidentials. We tried to analyze how the evidentials are represented in both languages including their linguistic markers (grammatical, lexical or discursive) and their semantic meanings. According to the precedent studies from the general linguistic point of view, we would like to reconsider the semantic meanings of both languages' grammatical markers, the so-called Korean retrospective marker '-te-' and French conditionals in the framework of the enunciative operation theory suggested by $Descl{\acute{e}}s$ & $Guentch{\acute{e}}va$ (2000), which proposed to classify the type of discourse by the language-independent description tools conceived after the enunciation theory suggested by Bally (1965), Benveniste (1956), Culioli (1973). Through this approach, we would like to contribute to establishing the linguistic basis not only for the general linguistic research to determine the invariant meaning of linguistic evidentials and their system, but also for the applied linguistics to the language engineering field.

Hierarchical Overlapping Clustering to Detect Complex Concepts (중복을 허용한 계층적 클러스터링에 의한 복합 개념 탐지 방법)

  • Hong, Su-Jeong;Choi, Joong-Min
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.1
    • /
    • pp.111-125
    • /
    • 2011
  • Clustering is a process of grouping similar or relevant documents into a cluster and assigning a meaningful concept to the cluster. By this process, clustering facilitates fast and correct search for the relevant documents by narrowing down the range of searching only to the collection of documents belonging to related clusters. For effective clustering, techniques are required for identifying similar documents and grouping them into a cluster, and discovering a concept that is most relevant to the cluster. One of the problems often appearing in this context is the detection of a complex concept that overlaps with several simple concepts at the same hierarchical level. Previous clustering methods were unable to identify and represent a complex concept that belongs to several different clusters at the same level in the concept hierarchy, and also could not validate the semantic hierarchical relationship between a complex concept and each of simple concepts. In order to solve these problems, this paper proposes a new clustering method that identifies and represents complex concepts efficiently. We developed the Hierarchical Overlapping Clustering (HOC) algorithm that modified the traditional Agglomerative Hierarchical Clustering algorithm to allow overlapped clusters at the same level in the concept hierarchy. The HOC algorithm represents the clustering result not by a tree but by a lattice to detect complex concepts. We developed a system that employs the HOC algorithm to carry out the goal of complex concept detection. This system operates in three phases; 1) the preprocessing of documents, 2) the clustering using the HOC algorithm, and 3) the validation of semantic hierarchical relationships among the concepts in the lattice obtained as a result of clustering. The preprocessing phase represents the documents as x-y coordinate values in a 2-dimensional space by considering the weights of terms appearing in the documents. First, it goes through some refinement process by applying stopwords removal and stemming to extract index terms. Then, each index term is assigned a TF-IDF weight value and the x-y coordinate value for each document is determined by combining the TF-IDF values of the terms in it. The clustering phase uses the HOC algorithm in which the similarity between the documents is calculated by applying the Euclidean distance method. Initially, a cluster is generated for each document by grouping those documents that are closest to it. Then, the distance between any two clusters is measured, grouping the closest clusters as a new cluster. This process is repeated until the root cluster is generated. In the validation phase, the feature selection method is applied to validate the appropriateness of the cluster concepts built by the HOC algorithm to see if they have meaningful hierarchical relationships. Feature selection is a method of extracting key features from a document by identifying and assigning weight values to important and representative terms in the document. In order to correctly select key features, a method is needed to determine how each term contributes to the class of the document. Among several methods achieving this goal, this paper adopted the $x^2$�� statistics, which measures the dependency degree of a term t to a class c, and represents the relationship between t and c by a numerical value. To demonstrate the effectiveness of the HOC algorithm, a series of performance evaluation is carried out by using a well-known Reuter-21578 news collection. The result of performance evaluation showed that the HOC algorithm greatly contributes to detecting and producing complex concepts by generating the concept hierarchy in a lattice structure.

Selective Word Embedding for Sentence Classification by Considering Information Gain and Word Similarity (문장 분류를 위한 정보 이득 및 유사도에 따른 단어 제거와 선택적 단어 임베딩 방안)

  • Lee, Min Seok;Yang, Seok Woo;Lee, Hong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.105-122
    • /
    • 2019
  • Dimensionality reduction is one of the methods to handle big data in text mining. For dimensionality reduction, we should consider the density of data, which has a significant influence on the performance of sentence classification. It requires lots of computations for data of higher dimensions. Eventually, it can cause lots of computational cost and overfitting in the model. Thus, the dimension reduction process is necessary to improve the performance of the model. Diverse methods have been proposed from only lessening the noise of data like misspelling or informal text to including semantic and syntactic information. On top of it, the expression and selection of the text features have impacts on the performance of the classifier for sentence classification, which is one of the fields of Natural Language Processing. The common goal of dimension reduction is to find latent space that is representative of raw data from observation space. Existing methods utilize various algorithms for dimensionality reduction, such as feature extraction and feature selection. In addition to these algorithms, word embeddings, learning low-dimensional vector space representations of words, that can capture semantic and syntactic information from data are also utilized. For improving performance, recent studies have suggested methods that the word dictionary is modified according to the positive and negative score of pre-defined words. The basic idea of this study is that similar words have similar vector representations. Once the feature selection algorithm selects the words that are not important, we thought the words that are similar to the selected words also have no impacts on sentence classification. This study proposes two ways to achieve more accurate classification that conduct selective word elimination under specific regulations and construct word embedding based on Word2Vec embedding. To select words having low importance from the text, we use information gain algorithm to measure the importance and cosine similarity to search for similar words. First, we eliminate words that have comparatively low information gain values from the raw text and form word embedding. Second, we select words additionally that are similar to the words that have a low level of information gain values and make word embedding. In the end, these filtered text and word embedding apply to the deep learning models; Convolutional Neural Network and Attention-Based Bidirectional LSTM. This study uses customer reviews on Kindle in Amazon.com, IMDB, and Yelp as datasets, and classify each data using the deep learning models. The reviews got more than five helpful votes, and the ratio of helpful votes was over 70% classified as helpful reviews. Also, Yelp only shows the number of helpful votes. We extracted 100,000 reviews which got more than five helpful votes using a random sampling method among 750,000 reviews. The minimal preprocessing was executed to each dataset, such as removing numbers and special characters from text data. To evaluate the proposed methods, we compared the performances of Word2Vec and GloVe word embeddings, which used all the words. We showed that one of the proposed methods is better than the embeddings with all the words. By removing unimportant words, we can get better performance. However, if we removed too many words, it showed that the performance was lowered. For future research, it is required to consider diverse ways of preprocessing and the in-depth analysis for the co-occurrence of words to measure similarity values among words. Also, we only applied the proposed method with Word2Vec. Other embedding methods such as GloVe, fastText, ELMo can be applied with the proposed methods, and it is possible to identify the possible combinations between word embedding methods and elimination methods.

Signification Education for Communication of Creative Semiotic System on Social and Cultural Value - Focused on Advertising Story - ('사회문화적 가치'에 대한 창조적 기호계(semiosphere)와 의사소통을 위한 의미 표현 교육 - 광고스토리를 중심으로 -)

  • Lim, Ji-Won
    • Journal of Korea Entertainment Industry Association
    • /
    • v.13 no.5
    • /
    • pp.145-153
    • /
    • 2019
  • The present study is a discussion in which the flow of 'social and cultural values' inherent in the creative advertising story is considered against Bart's symbolism and the creative symbol system, and attempted to reproduce the work through the cognitive thinking of the inmates. The interaction of correct social and cultural communication is not just a strategy for persuasion and effectiveness. Starting with these issues, I thought that experiencing the 'symbolic production' and 'cognition interpretation' of the most creative, aesthetic and implicit advertising stories was the realization of concrete cultural values. The reason why I pay attention to advertising as a target tool of the original school is that it gives anyone access to the social and cultural values based on the productivity of meaning, the sharing of meaning and social small-call work by paying attention to the most implicit symbols in a short period of time. I also think that with the trend of the times, it is well worth it as a tool of positive communication for social and cultural member harmony and solving future problems. The reality of social and cultural advertising stories conducted in conjunction with the analysis of meaning at the cognitive thought level is very appropriate to apply in creative classes for college students. The Dong-A Ilbo is a discussion that suggested that the work of realizing the cognitive meaning of advertising stories, a "symbol complex" based on creativity in a complex, multi-media era, will become an age-old communication tool to join university students' strategies for solving future problems

Semantics-Preserving Mutation-Based Fuzzing on JavaScript Interpreters (자바스크립트 엔진에 대한 시맨틱 보존적 변이기반 퍼징)

  • Oh, DongHyeon;Choi, JaeSeung;Cha, SangKil
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.30 no.4
    • /
    • pp.573-582
    • /
    • 2020
  • Fuzzing is a method of testing software by randomly generating test cases. Since its introduction, a variety of fuzzing techniques have been studied. Among them, mutation-based fuzzing is an efficient method that finds real-world bugs even though it uses a simple approach such as probabilistic bit-flipping and character substitution. However, the interpreter fuzzing has difficulty in applying general mutation techniques because the interpreter requires grammar and semantic correctness input values. In this paper, we present a novel mutation-based fuzzing on JavaScript interpreters with a dynamic data flow analysis. To this end, we implement JMFuzzer that can generate various types of mutated test cases that operate normally without runtime errors in JavaScript interpreter considering syntax and semantics. As a result, we found numerous unknown vulnerabilities in the latest JavaScript interpreters. We reported all of them to the vendors.

Students' Perception of Smart Learning in Distance Higher Education (스마트러닝에 대한 원격대학 학습자의 인식)

  • Choi, Hyoseon;Woo, Younghee;Jung, Hyojung
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.10
    • /
    • pp.584-593
    • /
    • 2013
  • The purpose of this research is to analyze students' perception of smart learning focusing on its definitions, roles and values in distance higher education. In the online survey, 1,950 students of 'A' open university were participated. The results show that the students viewed the smart learning to be more 'absorbing', 'interactive' and 'collaborative' than the existing e-learning, as it compiles their experiences into learning. However, the respondents' perceptions of smart learning varied among different age groups: more students in their 40s and 50s responded that smart learning was 'customized', 'humanlike', 'interactive', 'comfortable', 'stable', 'familiar', 'unstressful', and 'practical' than students in their 20s and 30s, and they tend to view the main feature of smart learning to be the compilation of learner experiences.

A Study on the Interpretation of Amenity Structure for the Creation of Urban Landscape (쾌적한 도시환경의 창출을 위한 도시 어메니티 구조에 관한 연구)

  • 김승환;변문기
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.18 no.4
    • /
    • pp.101-115
    • /
    • 1991
  • A study on the method of evaluation the urban amenity structure in Pusan city was established. Finally a survey sites out of 41 regions were selected on the basis of questionnaires : Taejong-dae and Haeun-dae as a seascape, Pumosa and Daesin-park as a mountain, Daechong-park and Seongjigok-park as a mixed, and Chungryulsa, Yongdoosan-park and U. N. Cemetry as a urban type. The abstracted results of amenity elements were revealed as natural environments including convex type as beach, reservoir, valley and mountain, and plant elements including woods and flower beds which raised amenity. The elements of social surroundings including children's playing, the aged's rest, and elements of structures including historic and memorial structures and high buildings. Amenity element made up of each space by region were abstracted from the Semantic Differential method. According to the factor analysis on the ground SD scale values, Kaiser's measure of sampling adequacy for 24 variables is 08602 and very high. Four factors including pleasantness, healthiness, convenience and safety showed 54.42 percent for total variance. By means of multiple regression, the model was as follows : Y=1.6636+0.3684X4+0.1955X11+0.1614X15-0.1688X23+0.1468X24. Therefore, Y:amenity, X4:beautiful-ugly, X11:clean-dirty, X15:creative-imitative, X23:cozy-dreary, X24:free-restrained. All variables in the model were significant at 0.001 level. According to the results of regression on satisfaction, the variables of satisfaction affecting amenity are the size of green space, the condition of management and the harmony with the surroundings. I think the considerating on the above could improve amenity of each region and further Pusan city.

  • PDF

Color Recommendation for Text Based on Colors Associated with Words

  • Liba, Saki;Nakamura, Tetsuaki;Sakamoto, Maki
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.17 no.1
    • /
    • pp.21-29
    • /
    • 2012
  • In this paper, we propose a new method to select colors representing the meaning of text contents based on the cognitive relation between words and colors, Our method is designed on the previous study revealing the existence of crucial words to estimate the colors associated with the meaning of text contents, Using the associative probability of each color with a given word and the strength of color association of the word, we estimate the probability of colors associated with a given text. The goal of this study is to propose a system to recommend the cognitively plausible colors for the meaning of the input text. To build a versatile and efficient database used by our system, two psychological experiments were conducted by using news site articles. In experiment 1, we collected 498 words which were chosen by the participants as having the strong association with color. Subsequently, we investigated which color was associated with each word in experiment 2. In addition to those data, we employed the estimated values of the strength of color association and the colors associated with the words included in a very large corpus of newspapers (approximately 130,000 words) based on the similarity between the words obtained by Latent Semantic Analysis (LSA). Therefore our method allows us to select colors for a large variety of words or sentences. Finally, we verified that our system cognitively succeeded in proposing the colors associated with the meaning of the input text, comparing the correct colors answered by participants with the estimated colors by our method. Our system is expected to be of use in various types of situations such as the data visualization, the information retrieval, the art or web pages design, and so on.

A Study on Craft Design Using Storytelling -Focusing on the story of the Byeoljubujeon- (스토리텔링을 활용한 공예디자인에 관한 연구 -별주부전 이야기를 중심으로-)

  • Choi, Jung-Hwa
    • Journal of Digital Convergence
    • /
    • v.15 no.8
    • /
    • pp.359-366
    • /
    • 2017
  • As the quality of life increases and the aesthetic value of the product is emphasized, it tend to consider the sensitivity of consumers in designing goods. The importance of storytelling is becoming more prominent as the purchasing factor of products shifted from the center of the product to emotional of products. In this paper, the purpose of developing emotional marketing craft design using storytelling is to understand the storytelling concept and analyze the case of craft design using storytelling. Craft is an easy-to-value or aesthetic value of the goods in the fields of design, diversity and originality can be pursued and has unlimited potential. Now, in addition to the semantic value of expressive values with a consumer sensibility needs requires the development of this research craft crafts design industry has established itself as a high value-added industry could do more to take advantage of the Foundation.

A Study of the normativeness on the Influence of the Memphis on the Comtemporary Fashion Design - Focused on the End of the 20th Century - (멤피스(Memphis)디자인이 현대 패션에 미친 조형적 특징에 관한 연구 -20세기말을 중심으로-)

  • 임영자;한윤숙
    • Journal of the Korean Society of Costume
    • /
    • v.51 no.1
    • /
    • pp.5-20
    • /
    • 2001
  • The purpose of this study suggest the fashion of communication for 21th century fashion. Especially, Memphis fashion have the possibility of communicating through objects. The results of this study are as follows : First, Memphis idea is to make design into a sophisticated, conscious instrument of communication. As the Memphis fashion points out : design is an extraordinary tool for communicating because its intrinsic characteristic is the fact that it is used and distributed anyway, even without communicating anything. The Memphis fashion is trying to connect design and industry to the broader culture within which fashion moves. Second, Using different materials provides not only new structural Possibilities. but - above all - new semantic and metaphoric possibilities, order modes of communication, another language, and even a change of direction, broadening of perspective, appropriation and digestion of new values and the concomitant rejection of traditional structures that renewal always Involves. The memphis fashion works on the fabric of contemporaneity (lurex yarn, latex, chrome metal and steel) and contemporaneity means computers, electronics, a new awareness of the body. mass exercise and tourism. Third, color in Memphis has never been an ideological vehicle. As with decoration it is born tilth the design, forming an integral part of the structure. It alters the objects molecules. It works as a mass, as an intrinsic feature of a certain form and volume. The Memphis fashion was realized the introduction of ultramodern science into such experimental and creative implementation as optical motive, brilliancy of colour of electronic medium in audition to metallic fabric and high technical synthetic fiber. A color tilth pop culture connotations that weaver between technological allusions and Mcdonald's.

  • PDF