• Title/Summary/Keyword: 텍스트 연구

Search Result 3,492, Processing Time 0.024 seconds

Similar Contents Recommendation Model Based On Contents Meta Data Using Language Model (언어모델을 활용한 콘텐츠 메타 데이터 기반 유사 콘텐츠 추천 모델)

  • Donghwan Kim
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.27-40
    • /
    • 2023
  • With the increase in the spread of smart devices and the impact of COVID-19, the consumption of media contents through smart devices has significantly increased. Along with this trend, the amount of media contents viewed through OTT platforms is increasing, that makes contents recommendations on these platforms more important. Previous contents-based recommendation researches have mostly utilized metadata that describes the characteristics of the contents, with a shortage of researches that utilize the contents' own descriptive metadata. In this paper, various text data including titles and synopses that describe the contents were used to recommend similar contents. KLUE-RoBERTa-large, a Korean language model with excellent performance, was used to train the model on the text data. A dataset of over 20,000 contents metadata including titles, synopses, composite genres, directors, actors, and hash tags information was used as training data. To enter the various text features into the language model, the features were concatenated using special tokens that indicate each feature. The test set was designed to promote the relative and objective nature of the model's similarity classification ability by using the three contents comparison method and applying multiple inspections to label the test set. Genres classification and hash tag classification prediction tasks were used to fine-tune the embeddings for the contents meta text data. As a result, the hash tag classification model showed an accuracy of over 90% based on the similarity test set, which was more than 9% better than the baseline language model. Through hash tag classification training, it was found that the language model's ability to classify similar contents was improved, which demonstrated the value of using a language model for the contents-based filtering.

Reality Strategies in Fantasy and Narrative Infections -Fiction Vampire and Movie The Grand Budapest Hotel (판타지의 리얼리티 전략과 서사적 감염 -소설 <흡혈귀>와 영화 <그랜드부다페스트 호텔>을 중심으로)

  • Choi, Sung-Min
    • Journal of Popular Narrative
    • /
    • v.25 no.4
    • /
    • pp.397-428
    • /
    • 2019
  • Fantasy emerges from the cracks and crevices of rational reality. Italo Calvino says, "Fantasy is possible when the reader stays at a certain distance without falling into the text." Fantasy does not form farthest from reality. It comes from the confusion between reality and fiction. In short, fantasy does not exist on the contrary of reality, but on the boundary of reality. Reality and fantasy are also structurally intertwined. We can't distinguish the reality from fantasy clearly. In fact, in this case, the reader or audience is confused about whether what I see is real or not. Todorov calls this case "hesitation." Hesitation is a key element of fantasy. Two texts that expressed "hesitation" are Kim Young-ha's short novel Vampire (1997) and Wes Anderson's film The Grand Budapest Hotel (2014). On the surface, these two texts seem to have nothing to do with narrative structural similarities. And both also arouse readers' and audiences' interest by letting confuse reality to fantasy. In Kim Young-ha's Vampire, we can look at the process of confusion of reality called "narrative infection" when a text is read to the reader. In the movie The Grand Budapest Hotel, we can find a strategy to make an unreal story feel like a fact in history. And we can also find a process in which the success stories of alienated characters become reality through 'solidarity' in the film. This paper is a study of how fantasy creates "reality", makes readers feel fantasy, and how it spreads through these two texts.

Between a Historical Subject and a Novel Subject -Reading The Song of sword based on the Logic of Choice, Transformation, and Exclusion (역사적 인간과 소설적 인간의 사이 -선택, 변형, 배제의 논리로 읽는 『칼의 노래』)

  • Kim, Won-Kyu
    • Journal of Popular Narrative
    • /
    • v.25 no.3
    • /
    • pp.103-141
    • /
    • 2019
  • The purpose of this paper is to examine the logic of choice, transformation, and exclusion in The Song of sword, comparing it with the historical records. This paper explains how a novel is 'produced'. Through this, it searches for the aspects in which The Song of sword changed into 'a narrative revealing the disillusionment of the novel's subject with the world'. In the logic of choice, it explores which time and space were chosen in the novel, and which character was chosen to prepare the content and formal framework of the novel. In the logic of transformation, it is confirmed that the meaning of 'individual' is highlighted in the novel, unlike the historical records, by transforming both the character of the enemy and the meaning of war. In the logic of exclusion, it studies the characteristics of the modern (novel's) subject in the novel by excluding the characteristics of the historical subject that existed in a particular time and space. This paper differs from previous studies in that it examines the way in which a novel is produced by comparing and analyzing The Song of sword based on the historical records. Through these analyses, we can see the unity of various heterogeneous elements, such as the historical reality, the writer's ideology and imagination, and the desire of the contemporary in the form of a novel. Also, by examining the elements of text that can not be sutured into a complete form, we can see the meaning of the novel's text as an unstable system.

Typography for Efficient Visual Flow of Text Focused on Hangul (텍스트의 효율적 시각흐름을 위한 타이포그래피-한글을 중심으로-)

  • 신경주;김지현
    • Archives of design research
    • /
    • v.11 no.3
    • /
    • pp.187-196
    • /
    • 1998
  • This study is intended to suggest the method of text arrangement in order to enhance visual perception which would help darify its communication. One hundred subjects without restriction of gender and profession participated in each experiment :their reading time was measured by the 0.01 second. The Analysis of Variance(two-way ANOVA without interaction) was performed for each experiment and the p value was 0.0001 which implies that there was a strong consistency among test results. Based on the first results, it is found that there is a consistent relationship between type size and text line length, and the following discovery was made ; the most effective ratio of type size to line length is approximately 1 :8. Judging from the Second and Third results, it seems that the vertical text arrangement is most efficient for reading regardless of text line length. So to make same rreading direction is more important than to narrow down the eye moving distance between column and column for efficient visual flow. This research supports the view that considering efficient eye movement on text, it is important to understand the mentioned variables that affect visual interpretation.

  • PDF

Hypertext Model Extension and Dynamic Server Allocation for Database Gateway in Web Database Systems (웹 데이타베이스에서 하이퍼텍스트 모델 확장 및 데이타베이스 게이트웨이의 동적 서버 할당)

  • Shin, Pan-Seop;Kim, Sung-Wan;Lim, Hae-Chull
    • Journal of KIISE:Databases
    • /
    • v.27 no.2
    • /
    • pp.227-237
    • /
    • 2000
  • A Web database System is a large-scaled multimedia application system that has multimedia processing facilities and cooperates with relational/Object-Oriented DBMS. Conventional hypertext modeling methods and DB gateway have limitations for Web database because of their restricted versatile presentation abilities and inefficient concurrency control caused by bottleneck in cooperation processing. Thus, we suggest a Dynamic Navigation Model & Virtual Graph Structure. The Dynamic Navigation Model supports implicit query processing and dynamic creation of navigation spaces, and introduce node-link creation rule considering navigation styles. We propose a mapping methodology between the suggested hypertext model and the relational data model, and suggest a dynamic allocation scheduling technique for query processing server based on weighted value. We show that the proposed technique enhances the retrieval performance of Web database systems in processing complex queries concurrently.

  • PDF

Ontology and Text Mining-based Advanced Historical People Finding Service (온톨로지와 텍스트 마이닝 기반 지능형 역사인물 검색 서비스)

  • Jeong, Do-Heon;Hwang, Myunggwon;Cho, Minhee;Jung, Hanmin;Yoon, Soyoung;Kim, Kyungsun;Kim, Pyung
    • Journal of Internet Computing and Services
    • /
    • v.13 no.5
    • /
    • pp.33-43
    • /
    • 2012
  • Semantic web is utilized to construct advanced information service by using semantic relationships between entities. Text mining can be applied to generate semantic relationships from unstructured data resources. In this study, ontology schema guideline, ontology instance generation, disambiguation of same name by text mining and advanced historical people finding service by reasoning have been proposed. Various relationships between historical event, organization, people, which are created by domain experts, are linked to literatures of National Institute of Korean History (NIKH). It improves the effectiveness of user access and proposes advanced people finding service based on relationships. In order to distinguish between people with the same name, we compares the structure and edge, nodes of personal social network. To provide additional information, external resources including thesaurus and web are linked to all of internal related resources as well.

Development of On-line Judge System based on Block Programming Environment (블록 프로그래밍 환경 기반 온라인 평가 시스템 개발)

  • Shim, Jaekwoun;Chae, Jeong Min
    • The Journal of Korean Association of Computer Education
    • /
    • v.21 no.4
    • /
    • pp.1-10
    • /
    • 2018
  • Block programming environment, which is represented by Scratch in elementary and middle school programming education, is suitable for learner's characteristics and cognitive level, and is recommended not only for beginners. Transference to the text programming environment after the block programming is essential for understanding the data processing process, understanding the accuracy and efficiency aspects of algorithms, and creating SW activity. In addition, it is presented step by step in the programming curriculum. In this study, developed WithBlock the online evaluation system for the purpose of transference from a block programming to a text programming environment. The developed system can solve the same algorithm problem in both block and text programming environment, and it can be used for elementary and secondary programming education by automatically scoring the written code and providing immediate feedback. In order to applicable to programming education in elementary and secondary surveyed the usability, learning possibility, interest and satisfaction of WithBlock. The results of the survey showed that it can be used for programming education.

Latent class model for mixed variables with applications to text data (혼합모드 잠재범주모형을 통한 텍스트 자료의 분석)

  • Shin, Hyun Soo;Seo, Byungtae
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.6
    • /
    • pp.837-849
    • /
    • 2019
  • Latent class models (LCM) are useful tools to draw hidden information from categorical data. This model can also be interpreted as a mixture model with multinomial component distributions. In some cases, however, an available dataset may contain both categorical and count or continuous data. For such cases, we can extend the LCM to a mixture model with both multinomial and other component distributions such as normal and Poisson distributions. In this paper, we consider a LCM for the data containing categorical and count data to analyze the Drug Review dataset which contains categorical responses and text review. From this data analysis, we show that we can obtain more specific hidden inforamtion than those from the LCM only with categorical responses.

Use of Text Processing Technologies in a Semantic Web Application (시맨틱 웹 응용 서비스에서의 텍스트 처리 기술 적용)

  • Jung, Han-Min;Kang, In-Su;Koo, Hee-Kwan;Lee, Seung-Woo;Kim, Pyung;Sung, Won-Kyung
    • Annual Conference on Human and Language Technology
    • /
    • 2006.10e
    • /
    • pp.189-196
    • /
    • 2006
  • 본 논문은 시맨틱 웹 응용 서비스를 구현함에 있어 필수적으로 요구되는 온톨로지 인스턴스 구축을 효율적으로 처리하는 데 있어 텍스트 처리 기술이 어떤 역할을 수행할 수 있는 가를 $OntoFrame-K^{(R)}$라는 시맨틱 웹 기반 정보 유통 체계에의 적용 사례를 통해 살펴본다. 본 논문에서 소개하는 텍스트 처리 기술은 개체 확인물 통한 개념 사례화, 주제 분야 할당을 통한 메타데이터 확장에, 그리고 인용 정보 추출 및 인용 관계 구축을 통한 객체 관계속성 구축에 적용된다. 개체 확인에서는 메타데이터 비교 잊 병합을 사용하였으며 이를 기반으로 한 수작업 구축을 통해 8,543명의 인력 URI를 확보하였다. 주제 및 분야 할당에서는 색인어와 분야분류명이 매핑된 시소러스 개념어의 매칭을 통해 색인어 별 TF (Term Frequency), 색인어와 매칭된 개념어 별 TF, 색인어와 매칭된 개념어 별 시소러스에서의 깊이, 색인어와 매칭된 개념어 별 개념 패싯, 색인어와 매칭된 각 개념어에 부착된 분야분류명 목록 등 할당을 위한 다양한 자질을 확보 적용하였다. 인용 정보 추출과 인용 관계 구축에서는 객체 URI와 인력 URI를 기반으로 하여 자동 추출된 인용 정보를 반영하는 방식으로 7,237개 문헌으로부터 총 135개의 인용 네트워크 그룹을 자동으로 확보하였다. 본 연구를 통해 제시된 텍스트 처리 기술의 활용 방안이 향후 시맨틱 웹 응용 서비스 및 인프라 구현에서 다각적으로 활용될 수 있기를 기대한다.

  • PDF

A Study on the Research Trends in Supply Chain Management in Korea using Network Text Analysis (공급사슬관리 국내연구동향 분석: 네트워크 분석을 활용하여)

  • Rha, Jin Sung
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.25 no.1
    • /
    • pp.41-53
    • /
    • 2020
  • Supply chain management (SCM) became a critical success factor for firms. As a result, researchers have carried out related research on SCM. This study aims to explore the research trends in SCM in Korea using network text analysis. We collected the information of 586 articles published in Korean journals using the RISS database, and analyzed the network generated by keywords proposed in the articles. The results showed that there are five research keyword clusters such as logistics, information systems, partnership, risk management, and sustainability.