• Title/Summary/Keyword: word-form

Search Result 382, Processing Time 0.032 seconds

Attention-based word correlation analysis system for big data analysis (빅데이터 분석을 위한 어텐션 기반의 단어 연관관계 분석 시스템)

  • Chi-Gon, Hwang;Chang-Pyo, Yoon;Soo-Wook, Lee
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.27 no.1
    • /
    • pp.41-46
    • /
    • 2023
  • Recently, big data analysis can use various techniques according to the development of machine learning. Big data collected in reality lacks an automated refining technique for the same or similar terms based on semantic analysis of the relationship between words. Since most of the big data is described in general sentences, it is difficult to understand the meaning and terms of the sentences. To solve these problems, it is necessary to understand the morphological analysis and meaning of sentences. Accordingly, NLP, a technique for analyzing natural language, can understand the word's relationship and sentences. Among the NLP techniques, the transformer has been proposed as a way to solve the disadvantages of RNN by using self-attention composed of an encoder-decoder structure of seq2seq. In this paper, transformers are used as a way to form associations between words in order to understand the words and phrases of sentences extracted from big data.

A Study on the Use of Stopword Corpus for Cleansing Unstructured Text Data (비정형 텍스트 데이터 정제를 위한 불용어 코퍼스의 활용에 관한 연구)

  • Lee, Won-Jo
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.6
    • /
    • pp.891-897
    • /
    • 2022
  • In big data analysis, raw text data mostly exists in various unstructured data forms, so it becomes a structured data form that can be analyzed only after undergoing heuristic pre-processing and computer post-processing cleansing. Therefore, in this study, unnecessary elements are purified through pre-processing of the collected raw data in order to apply the wordcloud of R program, which is one of the text data analysis techniques, and stopwords are removed in the post-processing process. Then, a case study of wordcloud analysis was conducted, which calculates the frequency of occurrence of words and expresses words with high frequency as key issues. In this study, to improve the problems of the "nested stopword source code" method, which is the existing stopword processing method, using the word cloud technique of R, we propose the use of "general stopword corpus" and "user-defined stopword corpus" and conduct case analysis. The advantages and disadvantages of the proposed "unstructured data cleansing process model" are comparatively verified and presented, and the practical application of word cloud visualization analysis using the "proposed external corpus cleansing technique" is presented.

Analysis of Descriptive Lectures Evaluation using Text Mining: Comparative analysis pre and post COVID-19

  • Lee, Sang-Chul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.10
    • /
    • pp.211-222
    • /
    • 2022
  • The purpose of this study is to indicate the direction of the future university classes in the post-COVID era, comparing and analyzing lecture evaluation of pre and post COVID-19. To this end, 4 yeard data were used from 2018 to 2019 for pre COVID-19 and form 2020 to 2021 data for post COVID-19. The results were as follows. In the case of liberal arts, "assignments" was the word with the highest frequency and degree centrality(DC) regardless of pre and post-COVID-19 In the major, "understanding" appeared as the most important word. The result of the ego network analysis indicated that "video lecture" and "non-face-to-face classes" were difficult and "interaction" between the professor and the students was important. As a results, it is important to reduce the weight of assignments and increase interaction with students in liberal arts classes. In the case of majors, it is necessary to operate face-to-face classes rather than non-face-to-face classes, and to organize the contents of videos without difficulty.

Poem in Ca Trù: Type, Structure, Content (베트남의 음악시, 까쭈: 형식, 구조, 내용)

  • Nguyen, Duc Mau
    • SUVANNABHUMI
    • /
    • v.2 no.1
    • /
    • pp.95-110
    • /
    • 2010
  • Poem plays an important role in Ca trù. Many music researchers say that singing Ca trù is singing poem. Of 46 tunes of Ca trù, there are more than 10 tunes expressed in available poems or styles of poetry; for example: in the tune Tỳ bà, the performer could sing Tỳ bà hành by Bạch Cư Dị being converted into seven-seven-six-eight-word-meter; in the reciting poem tunes, just reciting 5 Thien thai poems by Tào Đường or 3 Thanh Bình tune poems by Lý Bạch; in the reciting poetic essay (phú), reciting Tien Xich Bich and Hau Xich Bich by Tô Đông Pha. Others like bắc phản, cung bắc sometimes used six-eight word meters. Structurally, those are available for familiar types and beyond the scope of particular creativity because they do not originate from Ca trù's activity environment like recitative. Recitative is the main tune of Ca trù and has become an independent poem type. In terms of literature, recitative has a particular form structure and special type content. Unlike other tunes of Ca trù that only stop at some fixed works, recitative has increased to thousands of works in quantity and has been composed for many centuries. For those reasons, we confined ourselves the research to the creation which is the most typical of Ca tru: The recitative.

  • PDF

Morphological Parafoveal Preview Benefit Effects in Reading Korean (우리글 읽기에서 형태소정보의 미리보기 효과)

  • Lee, Sangeun;Choo, Hyeree;Koh, Sungryong
    • Korean Journal of Cognitive Science
    • /
    • v.31 no.2
    • /
    • pp.25-54
    • /
    • 2020
  • While there is no evidence for parafoveal processing in alphabetic languages such as English and Finnish, there is some evidence that morphological information is processed in syllabic languages like Chinese. Korean writing system, Hangul, would be able to provide morphological preview benefit effects since it is an "alphabetic syllabary" which contains both alphabetic and syllabic features. This study explored morphological parafoveal preview benefit effects during reading Korean using irregular verbs, which have phonological and orthographical differences between fundamental and conjugated forms. In the Experiment, the target word was irregular conjugated form, and there were four preview conditions: identical (e.g. 구워), fundamental form (e.g. 굽다), orthographically related (e.g. 굼다), and unrelated control (e.g. 죨어). In the result of study, identical was shortest and morphological, orthographical, unrelated preview were followed. Moreover, measures of first-pass reading of morphological preview were significantly shorter than those of unrelated control preview. This results support the hypothesis of morphological preview benefit effects in Korean. The implications of the results are discussed.

Some Developments at the Thirty-Fourth Session of the UNCITRAL Working Group II(Arbitration and Conciliation) (UNCITRAL 제2 실무작업반의 제34차 회의 동향)

  • 강병근
    • Journal of Arbitration Studies
    • /
    • v.11 no.1
    • /
    • pp.181-215
    • /
    • 2001
  • The thirty-fourth session of UNCITRAL Working Group on Arbitration was held in New York. Among the topics discussed at the session, many delegations agreed to reform the article 7 of the UNCITRAL Model Law on International Commercial Arbitration in light of the development of electronic commerce. As for the article 2(2) of the New York Convention, it was agreed to reflect the changes of the article 7 not in the form of a treaty amendment but in the form of an interpretative statement. The topic as to provisional measures has been found so difficult to reach an agreement that most of its texts submitted by the secretariat were left untouched for the lack of time. However, most provisions of the legislative texts on conciliation were dealt with by delegations. The next session is to be held in Vienna. While the Korean Arbitration Act of 1966 was fully amended in 1999, it seems interesting to look at the development in which the arbitration community of the world has already begun discussing the new dimension of the law and practice of international commercial arbitration. It may be considered early to start a new project of reforming the Korean Arbitration Act at this time when only three years passed after it was fully amended. It is, however, worthwhile to remember that some progressive efforts were aborted in amending the Arbitration Act of 1966. One of them is about the same issue on the insertion of some provisions on the enforcement of interim measures of protection to which the priority is given by the Working Group. It seems fair to say that it would not be dangerous to follow the developments and to adapt ourselves to such trends shown in the session. In Korea, the words “arbitration” and “conciliation” are misleadingly interchanged although these two words should be differentiated from each other in the sense of third-party binding decision. It is self-evident from the Korean Arbitration Act and judicial decisions that arbitral awards bind the disputing parties and are to be treated as final judgements by the competent courts. It is, however, not uncommon to find that the word “arbitration” is misinterpreted as having the same meaning of the word “conciliation”. One of the reasons for the confusion is that many legislations in Korea provide for conciliation as having the meaning of arbitration and vice versa. It may be probable that the proposed legislative texts on conciliation could be a kind of useful method to prevent such confusion from being uncontrollable. It is, therefore, necessary that the legislative texts should be introduced into Korea as a legislation on conciliation.

  • PDF

디자인 지식창출을 위한 검색시스템 구축

  • 임옥수;오민권;정인수;유의상
    • Archives of design research
    • /
    • v.16 no.1
    • /
    • pp.35-44
    • /
    • 2003
  • In the past era, acquisition and utilization of useful information was the main origin of competition. Nowadays, unlike that era, is the era of knowledge information(management) in which we should create a new knowledge on the basis of information and apply it to the field of practice. And more acquisition of information is no more the competitive power of any person, any company and any nation because in such the era of knowledge management, anyone can access and get the information he needs, utilizing internet-based searching system. Such demands of the times of knowledge management change rapidly in each field through knowledge management system and researches about knowledge management are actively processed in various academic branches. However, in our field of design, researches about those demands(knowledge management) still remain on the level of one-dimensional searching service for general data about design. Therefore, in this study, we developed building database of researches on form, color, aesthetical elements, preference image word, satisfaction etc. about CI/BI of home electronics goods, living goods, apparels, and food goods companies, also suggesting searching system through which you can obtain useful data and information helpful for designers to process CI/BI works of new product by using that database. Especially, in case of developing specific CI/BI, various search results through help of suggested system will supply a useful design concept. And more, cross table which is the result of analysis two-dimensional categorical data about existing design factors(such as form, color, aesthetical elements, preference image word, and satisfaction) will make contribution for designers to create a new design knowledge.

  • PDF

Effects of Computerized Cognitive Training Program Using Artificial Intelligence Motion Capture on Cognitive Function, Depression, and Quality of Life in Older Adults With Mild Cognitive Impairment During COVID-19: Pilot Study (인공지능 동작 인식을 활용한 전산화인지훈련이 코로나-19 기간 동안 경도 인지장애 고령자의 인지 기능, 우울, 삶의 질에 미치는 영향: 예비 연구)

  • Park, Ji Hyeun;Lee, Gyeong A;Lee, Jiyeon;Park, Young Uk;Park, Ji-Hyuk
    • Therapeutic Science for Rehabilitation
    • /
    • v.12 no.2
    • /
    • pp.85-98
    • /
    • 2023
  • Objective : We investigated the efficacy of an artificial intelligence computerized cognitive training program using motion capture to identify changes in cognition, depression, and quality of life in older adults with mild cognitive impairment. Methods : A total of seven older adults (experimental group = 4, control group = 3) participated in this study. During the COVID-19 period from October to December 2021, we used a program, "MOOVE Brain", that we had developed. The experimental group performed the program 30 minutes 3×/week for 1 month. We analyzed patients scores from the Korean version of the Mini-Mental State Examination-2, the Consortium to Establish a Registry for Alzheimer's Disease Assessment Packet for Daily Life Evaluation, the short form Geriatric Depression Scale, and Geriatric Quality of Life Scale. Results : We observed positive changes in the mean scores of the Stroop Color Test (attention), Stroop Color/Word Test (executive function), SGDS-K (depression), and GQOL (QoL). However, these changes did not reach statistical significance for each variable. Conclusion : The study results from "MOOVE Brain" can help address cognitive and psychosocial issues in isolated patients with MCI during the COVID-19 pandemic or those unable to access in-person medical services.

A Text Mining-based Intrusion Log Recommendation in Digital Forensics (디지털 포렌식에서 텍스트 마이닝 기반 침입 흔적 로그 추천)

  • Ko, Sujeong
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.2 no.6
    • /
    • pp.279-290
    • /
    • 2013
  • In digital forensics log files have been stored as a form of large data for the purpose of tracing users' past behaviors. It is difficult for investigators to manually analysis the large log data without clues. In this paper, we propose a text mining technique for extracting intrusion logs from a large log set to recommend reliable evidences to investigators. In the training stage, the proposed method extracts intrusion association words from a training log set by using Apriori algorithm after preprocessing and the probability of intrusion for association words are computed by combining support and confidence. Robinson's method of computing confidences for filtering spam mails is applied to extracting intrusion logs in the proposed method. As the results, the association word knowledge base is constructed by including the weights of the probability of intrusion for association words to improve the accuracy. In the test stage, the probability of intrusion logs and the probability of normal logs in a test log set are computed by Fisher's inverse chi-square classification algorithm based on the association word knowledge base respectively and intrusion logs are extracted from combining the results. Then, the intrusion logs are recommended to investigators. The proposed method uses a training method of clearly analyzing the meaning of data from an unstructured large log data. As the results, it complements the problem of reduction in accuracy caused by data ambiguity. In addition, the proposed method recommends intrusion logs by using Fisher's inverse chi-square classification algorithm. So, it reduces the rate of false positive(FP) and decreases in laborious effort to extract evidences manually.

Study on the User Experience Design for Emotional Marketing in an Transmedia Environment (트랜스미디어 환경에서의 감성마케팅을 위한 사용자 경험디자인에 대한 고찰)

  • Huh, Jin
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.9
    • /
    • pp.194-201
    • /
    • 2012
  • The expansion of media is in close connection with the expansion of awareness. The invention of characters enabled mankind to cross over time and space. Machines led to the development of body functions and electricity led to the expansion of space and time. Computers are the extension of the human brain and the advent of the internet led to the expansion of relationships. Even at this moment, media is unremittingly progressing like a spread of a mutant virus, and has resulted in fusion and complex phenomena such as convergence and hybrid media. Transmedia is a compound word formed by the word "Trans" which means traverse, transcend, penetrate or change, and the word "Media" and has the meaning "media which transcends media" which embraces all of modern day media. However, unlike other fusion or complex media, it is different in that it is not a combination of technologies but a combination of technology and emotion. Thus, transmedia should be recognized as a form of media that carries a significant meaning from the user experience aspect as it must simultaneously satisfy both "emotional awareness", which appeals to the human emotion, and "conscious awareness" of mankind, which arises out of the digital technology considered to be important in the smart-era society. This study first examines the concept of transmedia, and then examines the role of user experience design which triggers conscious thinking and strategies for emotional marketing. This study aims to be recognized as a matter for consideration with respect to the development stage for the establishment of a steady communication relationship between developers and designers, as well as communication with users.