• Title/Summary/Keyword: 텍스트 연구

Search Result 3,492, Processing Time 0.03 seconds

A Study on Word Cloud Techniques for Analysis of Unstructured Text Data (비정형 텍스트 테이터 분석을 위한 워드클라우드 기법에 관한 연구)

  • Lee, Won-Jo
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.4
    • /
    • pp.715-720
    • /
    • 2020
  • In Big data analysis, text data is mostly unstructured and large-capacity, so analysis was difficult because analysis techniques were not established. Therefore, this study was conducted for the possibility of commercialization through verification of usefulness and problems when applying the big data word cloud technique, one of the text data analysis techniques. In this paper, the limitations and problems of this technique are derived through visualization analysis of the "President UN Speech" using the R program word cloud technique. In addition, by proposing an improved model to solve this problem, an efficient method for practical application of the word cloud technique is proposed.

A weighted method for evaluating software quality (가중치를 적용한 소프트웨어 품질 평가 방법)

  • Jung, Hye Jung
    • Journal of Digital Convergence
    • /
    • v.19 no.8
    • /
    • pp.249-255
    • /
    • 2021
  • This study proposed a method for determining weights for the eight quality characteristics, such as functionality, reliability, usability, maintainability, portability, efficiency, security, and interoperability, which are suggested by international standards, focusing on software test reports. Currently, the test results for software quality evaluation apply the same weight to 8 quality characteristics to obtain the arithmetic average. Weights for 8 quality characteristics were applied using the results from text analysis, and weights were applied using the results of text analysis of test reports for two products. It was confirmed that the average of test reports according to the weighted quality characteristics was more efficient.

Analysis of speech in game marketing video using text mining techniques (텍스트 마이닝 기법을 이용한 게임 마케팅 비디오에서의 스피치 분석)

  • Lee, Yeokyung;Kim, Jaejik
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.1
    • /
    • pp.147-159
    • /
    • 2022
  • Nowadays, various social media platforms are widely spread and people closely use such platforms in daily life. By doing so, social influencers with a large number of subscribers, views, and comments have huge impact in our society. Following this trend, many companies are actively using influencers for marketing purpose to promote their products and services. In this study, we extract the speeches of influencers from videos for game marketing and analyze them using various text mining techniques. In the analysis, we distinguish game videos leading to successful marketing and failed marketing, and we explore and compare the linguistic features of the influencers for successful and failed marketings.

A Study on Language Modeling for Korean Legal Text Processing (한국어 법률 텍스트 처리를 위한 언어 모델링 연구)

  • Ye-Jee Kang;Fei Li;Yeon-Ji Jang;Hye-Rin Kang;Seo-Yoon Park;Han-Saem Kim
    • Annual Conference on Human and Language Technology
    • /
    • 2022.10a
    • /
    • pp.300-304
    • /
    • 2022
  • 본 논문은 한국어 법률 텍스트 처리를 위해 세 가지 서로 다른 사전 학습 모델을 미세 조정하여 그 성능을 평가하였다. 성능을 평가하기 위해 타겟 판결 요지에 대한 판결 요지 후보를 추출하여 판결 요지 간의 유사도를 계산하였다. 또한 유사도를 바탕으로 추출된 판결 요지가 실제 법률 전문가와 일반 언어학자의 직관에 부합하는지 판단하기 위해 정성적 평가를 진행하였다. 그 결과 법률 전문가가 법률 전문 지식이 없는 일반 언어학자에 비해 판결 요지 간 유사도를 낮게 평가하였는데 법률 전문가가 법률 텍스트의 유사성을 판단하는 기준이 기계와 일반 언어학자와는 달라 전문가 자문에 기반한 한국어 법률 AI 모델 개발의 필요성을 확인하였다. 최종 연구 결과로 한국어 법률 AI 프레임워크를 제안하였다.

  • PDF

Transformer-based Text Summarization Using Pre-trained Language Model (사전학습 언어 모델을 활용한 트랜스포머 기반 텍스트 요약)

  • Song, Eui-Seok;Kim, Museong;Lee, Yu-Rin;Ahn, Hyunchul;Kim, Namgyu
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2021.07a
    • /
    • pp.395-398
    • /
    • 2021
  • 최근 방대한 양의 텍스트 정보가 인터넷에 유통되면서 정보의 핵심 내용을 파악하기가 더욱 어려워졌으며, 이로 인해 자동으로 텍스트를 요약하려는 연구가 활발하게 이루어지고 있다. 텍스트 자동 요약을 위한 다양한 기법 중 특히 트랜스포머(Transformer) 기반의 모델은 추상 요약(Abstractive Summarization) 과제에서 매우 우수한 성능을 보이며, 해당 분야의 SOTA(State of the Art)를 달성하고 있다. 하지만 트랜스포머 모델은 매우 많은 수의 매개변수들(Parameters)로 구성되어 있어서, 충분한 양의 데이터가 확보되지 않으면 이들 매개변수에 대한 충분한 학습이 이루어지지 않아서 양질의 요약문을 생성하기 어렵다는 한계를 갖는다. 이러한 한계를 극복하기 위해 본 연구는 소량의 데이터가 주어진 환경에서도 양질의 요약문을 생성할 수 있는 문서 요약 방법론을 제안한다. 구체적으로 제안 방법론은 한국어 사전학습 언어 모델인 KoBERT의 임베딩 행렬을 트랜스포머 모델에 적용하는 방식으로 문서 요약을 수행하며, 제안 방법론의 우수성은 Dacon 한국어 문서 생성 요약 데이터셋에 대한 실험을 통해 ROUGE 지표를 기준으로 평가하였다.

  • PDF

Simple Image Stenography Technology for Large Scale Text (대용량 텍스트를 위한 손실 없는 영상 은닉기술)

  • Rhee, Keun-Moo
    • Annual Conference of KIPS
    • /
    • 2008.05a
    • /
    • pp.1104-1107
    • /
    • 2008
  • These people where generally the image or the document nik technique silver document image, against the digital data of audio back all type the research is advanced being used with objective and the use which are various, is a d. Needs a low-end leveling instrument security text from the research which it sees and with substitution quantity the silver nik being simple it will be able to deliver the technique which is simple it embodied. It combined the text image first and the nose which is in the collar image of 24 bit depth which will reach ting it did and it rehabilitatedded and a higher officer technique and the result it used that the loss ratio of the text image to analyze is slight it was ascertained.

An Analysis of Research Trends in Computational Thinking using Text Mining Technique (텍스트 마이닝 기법을 활용한 컴퓨팅 사고력 연구 동향 분석)

  • Lee, Jaeho;Jang, Junhyung
    • Journal of The Korean Association of Information Education
    • /
    • v.23 no.6
    • /
    • pp.543-550
    • /
    • 2019
  • In 2006, Janet Wing defined computational thinking and operated SW education as a formal curriculum in the UK in 2013. This study collected related research papers by using computational thinking, which has recently increased in importance, and analyzed it using text mining. In the first, CONCOR analysis was conducted with the keyword of computational thinking. In the second, text mining of the components of computational thinking was selected by the repr23esentative academic journals at domestic and foreign. As a result of the two-time analysis, first, abstraction, algorithm, data processing, problem decomposition, and pattern recognition were the core of the study of computational thinking component. Second, research on convergence education centered on computational thinking and science and mathematics subjects was actively conducted. Third, research on computational thinking has been expanding since 2010. Research and development of the classification and definition of computational thinking and components and applying them to education sites should be conducted steadily.

A Study of Communication Factor in Lunyu (『논어(論語)』의 커뮤니케이션 속성고(屬性考))

  • Lee, Bum-Soo
    • (The)Study of the Eastern Classic
    • /
    • no.36
    • /
    • pp.85-104
    • /
    • 2009
  • This study examines a study of communication factor in Lunyu, as a communication text, in terms of communicator, audience, message, communication factor, communication text, interdisciplinary research. In many respects, it is generally accepted that Lunyu have been the generic references of the Oriental culture. Lunyu consider ethics, logic, and practicability as the qualifying requirement of communicator, asserting that communicator should speak true language, like a "chuntzu"(君子) does, and should also put their language into practice. The audience's attitude and method as contained in Lunyu are that hearers should have sharp ears for language, hear selectively the right language, and use the language suitable to the situation. It is also emphasized that the Hearer should actively lead in the situation of transactional communications. In Lunyu, one property of message is that language, which determines the rise and fall of a nation and is also the basis of judgement for other people, should comply with ethics and reasons and sould also be put into practice. In other words, credible message, as the practice of language, is the practical requirement of ethics and the qualification of a "chuntzu"(君子, superior man) in ruling the nation or conducting one's life.

Skew Compensation and Text Extraction of The Traffic Sign in Natural Scenes (자연영상에서 교통 표지판의 기울기 보정 및 덱스트 추출)

  • Choi Gyu-Dam;Kim Sung-Dong;Choi Ki-Ho
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.3 no.2 s.5
    • /
    • pp.19-28
    • /
    • 2004
  • This paper shows how to compensate the skew from the traffic sign included in the natural image and extract the text. The research deals with the Process related to the array image. Ail the process comprises four steps. In the first fart we Perform the preprocessing and Canny edge extraction for the edge in the natural image. In the second pan we perform preprocessing and postprocessing for Hough Transform in order to extract the skewed angle. In the third part we remove the noise images and the complex lines, and then extract the candidate region using the features of the text. In the last part after performing the local binarization in the extracted candidate region, we demonstrate the text extraction by using the differences of the features which appeared between the tett and the non-text in order to select the unnecessary non-text. After carrying out an experiment with the natural image of 100 Pieces that includes the traffic sign. The research indicates a 82.54 percent extraction of the text and a 79.69 percent accuracy of the extraction, and this improved more accurate text extraction in comparison with the existing works such as the method using RLS(Run Length Smoothing) or Fourier Transform. Also this research shows a 94.5 percent extraction in respect of the extraction on the skewed angle. That improved a 26 percent, compared with the way used only Hough Transform. The research is applied to giving the information of the location regarding the walking aid system for the blind or the operation of a driverless vehicle

  • PDF

A Cognitive Pragmatic Approach to Contextual Effects In Modern Korean Poetry (한국 현대시 텍스트의 맥락 효과에 관한 인지 화용론적 연구)

  • HyonhoLee
    • Korean Journal of Cognitive Science
    • /
    • v.4 no.2
    • /
    • pp.5-28
    • /
    • 1994
  • In this thesis we attempt to analyze modern Korean poetic texts in the franmeworks of text limgisitics and cognitive pragmatics. Both frameworks describe and explai human verbal communicantion in terms of congnitive information-processing procedures.By utilizing analytical devices provided by seven standards of textuality we can analyze any type of text,especially in terms of the cognitive operations underlying the production and reception processes.It is clamed in cognitive pragmatic framework that human ostensive inforential communication is regulated by the Principle of Relevance.We claim that the relevance-based framework of pragmatics provides evidence and rationale for those cognitive operations identified in the text linguistic framework. poetic texts involve every kind of cognitive strategies and processing procedures underlying human verbal communication.So,if modern Korean poetic texts are satisfactorily analyzed by text linguistics and cognitive pragmatics,it means that both frameworks are very useful tools for analyzing texts and that all the other text types which are less complicated than poetic text will also be analyzed by these frameworks. Researchers of poetry,and poets,are sensitive to poetic effects.They feel more of poeticity while reading poetic texts than ordinary readers do.However,these researchers or poets sometimes give different interpretation of a single poetic text.The interpretation of poetry cannot be anything,because poets write poems with particular intertions and do not just throw them out so as to be interpreted at ramdom.This thesis suggersts that the poeticity felt by the reader can be described and accounted for in a scientific way.In other words,text linguistics and cognitive pragmatics enable the researchers of poetry to become objective in interpreting poetic texts. It will be clearly shown that we have to see poetic texts from a cognitive perspective,since they are by-products of cognitive processing performed by discourse participants.