• Title/Summary/Keyword: 텍스트 연구

Search Result 3,492, Processing Time 0.028 seconds

R&D Redundancy and Similarity Check System (클라우드 기반 R&D 연구 보고서 문서표절 및 유사도 검출 시스템)

  • Shin, Hyojoung;Park, Kiheung;Haing, Huhduck
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2016.01a
    • /
    • pp.31-32
    • /
    • 2016
  • 최근 정부의 R&D 연구에 대한 지원 규모 증가로 인해 전국가적으로 활발하게 기술 연구가 진행되고 있지만 예산을 집행하는 과정에서 기술 연구개발 과제의 중복연구로 시간과 예산을 낭비하는 사례를 노출하고 있다. 이와 같은 문제점을 해결하기 위해서는 정부 R&D 과제 선정과정에서 연구주제의 중복성 방지 등 근원적 혁신이 필요하다. 본 논문에서는 텍스트 마이닝 기술 및 빅데이터 분석 기술(하둡, 아마존 웹 서비스)과 같은 데이터 분석 기술이 도입된 클라우드 기반 R&D 연구 보고서 문서표절 및 유사도를 검출하는 시스템을 제안한다. 본 시스템은 SaaS 형태의 "on-demand software"로 웹 접속만으로 사용이 가능하다.

  • PDF

Automatic Background Keyword of Movie Extraction Method from Media Reviews (미디어 리뷰를 이용한 영화 배경 키워드 자동 추출 기법)

  • Kim, Hyung W.;Cho, Joonmyun;Yoo, Jeongju
    • Annual Conference of KIPS
    • /
    • 2013.11a
    • /
    • pp.1149-1151
    • /
    • 2013
  • 본 연구는 영화 콘텐츠의 배경(공간적/시간적)에 해당하는 키워드를 자동으로 추출하는 기법을 제안한다. 제안된 기법은 영화 콘텐츠들의 리뷰 텍스트 데이터를 웹 상으로부터 수집하는 과정, 수집된 텍스트 리뷰 데이터의 전처리 과정에 해당하는 형태소 분석 및 개체명인식 과정, 마지막으로 통계적 기법을 이용하여 최종적으로 배경에 해당하는 단어를 선택하는 과정으로 이루어진다. 자동으로 추출된 배경 정보는 사용자 평가를 통하여 정확도를 측정하였으며, 자동 생성된 배경 정보를 이용하여 영화 콘텐츠의 검색 및 추천 등에 다양하게 사용될 수 있을 것으로 예상된다.

Tourism Information Contents and Text Networking (Focused on Formal Website of Jeju and Chinese Personal Blogs) (온라인 관광정보의 내용 및 텍스트 네트워크 (제주 공식 웹사이트와 중국 개인블로그를 중심으로))

  • Zhang, Lin;Yun, Hee Jeong
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.1
    • /
    • pp.19-30
    • /
    • 2018
  • The main purposes of this study are to analyze the contents and text network of online tourism information. For this purpose, Jeju Island, one of the representative tourist destinations in South Korea is selected as a study site. And this study collects the contents of both JeJu official tourism website and Sina Weibo's personal blogs which is one of the most popular Social Network Systems in China. In addition, this study analyzes this online text information using ROST Content Mining System, one of the Chinese big data mining systems. The results of the content analysis show that the formal website of Jeju includes the nouns related to natural, geographical and physical resources, verbs related to existence of resources, and adjectives related to the beauty, cleanness and convenience of resources mainly. Meanwhile, personal blogs include the nouns of Korean-wave, food, local products, other destinations and shopping, verbs related to activity and feeling in Jeju, and adjectives related to their experiences and feeling mainly. Finally, the results of text network show that there are some strong centrality and network of online tourism information at formal website, but there are weak relationships in personal blogs. The results of this study may be able to contribute to the development of demand-based marketing strategies of tourists destination.

Topic change monitoring study based on Blue House national petition using a control chart (관리도를 활용한 국민청원 토픽 모니터링 연구)

  • Lee, Heeyeon;Choi, Jieun;Lee, Sungim;Son, Won
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.5
    • /
    • pp.795-806
    • /
    • 2021
  • Recently, as text data through online channels have become vast, there is a growing interest in research that summarizes and analyzes them. One of the fundamental analyses of text data is to extract potential topics. Although the researcher may read all the data and summarize the contents one by one, it is not easy to deal with large amounts of data. Blei and Lafferty (2007) and Blei et al. (2003) proposed topic modeling methods for extracting topics using a statistical model. Since the text data is generally collected over time, it is worthwhile to monitor the topic's changes. In this study, we propose a topic index based on the results of the topic model. In addition, a control chart, a representative tool for statistical process management, is applied to monitor the topic index over time. As a practical example, we use text data collected from Blue House National Petition boards between March 5, 2018, and March 5, 2020.

A Study on the Analysis of Park User Experiences in Phase 1 and 2 Korea's New Towns with Blog Text Data (블로그 텍스트 데이터를 활용한 1, 2기 신도시 공원의 이용자 경험 분석 연구)

  • Sim, Jooyoung;Lee, Minsoo;Choi, Hyeyoung
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.52 no.3
    • /
    • pp.89-102
    • /
    • 2024
  • This study aims to examine the characteristics of the user experience of New Town neighborhood parks and explore issues that diversify the experience of the parks. In order to quantitatively analyze a large amount of park visitors' experiences, text-based Naver blog reviews were collected and analyzed. Among the Phase 1 and 2 New Towns, the parks with the highest user experience postings were selected for each city as the target of analysis. Blog text data was collected from May 20, 2003, to May 31, 2022, and analysis was conducted targeting Ilsan Lake Park, Bundang Yuldong Park, Gwanggyo Lake Park, and Dongtan Lake Park. The findings revealed that all four parks were used for everyday relaxation and recreation. Second, the analysis underscores park's diverse user groups. Third, the programs for parks nearby were also related to park usage. Fourth, the words within the top 20 rankings represented distinctive park elements or content/programs specific to each park. Lastly, the results of the network analysis delineated four overarching types of park users and the networks of four park user types appeared differently depending on the park. This study provides two implications. First, in addition to the naturalistic characteristics, the differentiation of each park's unique facilities and programs greatly improves public awareness and enriches the individual park experience. Second, if analysis of the context surrounding the park based on spatial information is performed in addition to text analysis, the accuracy of interpretation of text data analysis results could be improved. The results of this study can be used in the planning and designing of parks and greenspaces in the Phase 3 New Towns currently in progress.

Building a Philosophy Ontology based on Content of Texts and its Application to Learning (텍스트 내용 기반의 철학 온톨로지 구축 및 교육에의 응용)

  • Chung, Hyun-Sook;Choi, Byung-Il
    • Journal of The Korean Association of Information Education
    • /
    • v.9 no.2
    • /
    • pp.257-270
    • /
    • 2005
  • Researchers of humane studies including philosophy acquire knowledge from understanding of their texts. They spent a lot time and efforts to retrieve, read and understand many texts relevant to their research fields using a metadata-based text retrieval system. In this paper, we develop a philosophy ontology that enables researchers to retrieve knowledge in the content of texts of philosophy. Our philosophy ontology includes concepts and their hierarchical and associative relationships defined by philosophy researchers. We propose a methodology for constructing text-based ontology comprised of three phases and fourteen steps. This methodology may be used to construct another ontologies for learning. Also, we introduce a case study for applying our philosophy ontology to acquire and interchange knowledge of philosophy between a professor and students during philosophy classes.

  • PDF

Web Accessibility Evaluation of Internet Shopping Malls and Development of Alternative Text Rate Improvement Tool (인터넷 쇼핑몰 웹접근성 평가 및 대체 텍스트율 향상 방안 구현)

  • Lim, Kyeng Gyu;Lee, Goo Yeon;Kim, Hwa Jong
    • Journal of Digital Contents Society
    • /
    • v.19 no.3
    • /
    • pp.537-546
    • /
    • 2018
  • In this paper, we study improvement of web accessibility of Korean Internet shopping mall websites. First, we analyze the criteria of Korean web accessibility, and then evaluate the web accessibility level of major Internet shopping mall websites in Korea. Based on the evaluation of web accessibility level, we propose and implement an alternative text enhancement method using Excel VBA to increase the rate of alternative text for improving web accessibility. Using the proposed method, even non-specialists of web programming can check and modify the alternative text of the image included in web pages, which can help improve the web accessibility compliance rate.

Study on Impact of GUI Design Elements of Mobile Phone on Brand Preference With focus on senior citizens (모바일 폰의 GUI 디자인의 구성요소가 브랜드 선호도에 미치는 영향 실버세대를 중심으로)

  • Kim, Young Seok
    • Science of Emotion and Sensibility
    • /
    • v.16 no.4
    • /
    • pp.545-556
    • /
    • 2013
  • This study has established the GUI design elements of mobile phones as color, text, layout, graphic icon, and video, under the purpose of exploring the relevance between such elements and brand preference among senior citizens. To accomplish the objective, a model and hypotheses were established, which were tested through a multiple regression analysis. The findings are as follows. First, when the statistical significance was examined by GUI design element of mobile phones, the following results were obtained: Color, text, layout, graphic icon, and video were statistically significant at the significance level given, indicating that such elements all affect brand preference. Second, the relative influence of GUI design elements of mobile phones on brand preference was revealed in the following order: text, color, video, graphic icon, layout. It indicates that boosting the brand preference of senior citizens for mobile phones requires considering 'text' and 'color' first before any other element. In addition, as the influence of 'text' and 'color' becomes greater, the brand preference also becomes higher.

Relationship between Images and Text in the Visual Paradox -Focusing on Case Studies of Volkswagen Ads- (시각적 패러독스에서 이미지와 텍스트의 상관관계 -폭스바겐 광고 사례의 분석을 중심으로-)

  • Kim, Jin-Gon;Park, Young-Won
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.1
    • /
    • pp.176-184
    • /
    • 2012
  • People are exposed to various media. After the Digital Revolution, quantitative expansion of the media is at a rapid pace. Because of the expansion of the media, advertising needs efforts that induce the audiences' reaction. Rhetorical devices are used as the efforts. This study noted the visual paradox of rhetorical devices because it is an effective representation device that induced audiences' reaction by deliberate contradiction and ambiguity. This study has defined the visual paradox based on define and classification of paradox in logic. This study also tried to reveal the relationship between images and text for signification by metalanguage because it is important to the visual paradox in advertising. And analyzed cases of Volkswagen ads to prove the research process. Finally identified that images and text interact to create a new meaning.

A Content Analysis of Journal Articles Using the Language Network Analysis Methods (언어 네트워크 분석 방법을 활용한 학술논문의 내용분석)

  • Lee, Soo-Sang
    • Journal of the Korean Society for information Management
    • /
    • v.31 no.4
    • /
    • pp.49-68
    • /
    • 2014
  • The purpose of this study is to perform content analysis of research articles using the language network analysis method in Korea and catch the basic point of the language network analysis method. Six analytical categories are used for content analysis: types of language text, methods of keyword selection, methods of forming co-occurrence relation, methods of constructing network, network analytic tools and indexes. From the results of content analysis, this study found out various features as follows. The major types of language text are research articles and interview texts. The keywords were selected from words which are extracted from text content. To form co-occurrence relation between keywords, there use the co-occurrence count. The constructed networks are multiple-type networks rather than single-type ones. The network analytic tools such as NetMiner, UCINET/NetDraw, NodeXL, Pajek are used. The major analytic indexes are including density, centralities, sub-networks, etc. These features can be used to form the basis of the language network analysis method.