• Title/Summary/Keyword: 키워드분석

Search Result 1,720, Processing Time 0.031 seconds

A Corpus Analysis of British-American Children's Adventure Novels: Treasure Island (영미 아동 모험 소설에 관한 코퍼스 분석 연구: 『보물섬』을 중심으로)

  • Choi, Eunsaem;Jung, Chae Kwan
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.1
    • /
    • pp.333-342
    • /
    • 2021
  • In this study, we analyzed the vocabulary, lemmas, keywords, and n-grams in 『Treasure Island』 to identify certain linguistic features of this British-American children's adventure novel. The current study found that, contrary to the popular claim that frequently-used words are important and essential to a story, the set of frequently-used words in 『Treasure Island』 were mostly function words and proper nouns that were not directly related to the plot found in 『Treasure Island』. We also ascertained that a list of keywords using a statistical method making use of a corpus program was not good enough to surmise the story of 『Treasure Island』. However, we managed to extract 30 keywords through the first quantitative keyword analysis and then a second qualitative keyword analysis. We also carried out a series of n-gram analyses and were able to discover lexical bundles that were preferred and frequently used by the author of 『Treasure Island』. We hope that the results of this study will help spread this knowledge among British-American children's literature as well as to further put forward corpus stylistic theory.

Web Site Keyword Selection Method by Considering Semantic Similarity Based on Word2Vec (Word2Vec 기반의 의미적 유사도를 고려한 웹사이트 키워드 선택 기법)

  • Lee, Donghun;Kim, Kwanho
    • The Journal of Society for e-Business Studies
    • /
    • v.23 no.2
    • /
    • pp.83-96
    • /
    • 2018
  • Extracting keywords representing documents is very important because it can be used for automated services such as document search, classification, recommendation system as well as quickly transmitting document information. However, when extracting keywords based on the frequency of words appearing in a web site documents and graph algorithms based on the co-occurrence of words, the problem of containing various words that are not related to the topic potentially in the web page structure, There is a difficulty in extracting the semantic keyword due to the limit of the performance of the Korean tokenizer. In this paper, we propose a method to select candidate keywords based on semantic similarity, and solve the problem that semantic keyword can not be extracted and the accuracy of Korean tokenizer analysis is poor. Finally, we use the technique of extracting final semantic keywords through filtering process to remove inconsistent keywords. Experimental results through real web pages of small business show that the performance of the proposed method is improved by 34.52% over the statistical similarity based keyword selection technique. Therefore, it is confirmed that the performance of extracting keywords from documents is improved by considering semantic similarity between words and removing inconsistent keywords.

Analysis of ICT Education Trends using Keyword Occurrence Frequency Analysis and CONCOR Technique (키워드 출현 빈도 분석과 CONCOR 기법을 이용한 ICT 교육 동향 분석)

  • Youngseok Lee
    • Journal of Industrial Convergence
    • /
    • v.21 no.1
    • /
    • pp.187-192
    • /
    • 2023
  • In this study, trends in ICT education were investigated by analyzing the frequency of appearance of keywords related to machine learning and using conversion of iteration correction(CONCOR) techniques. A total of 304 papers from 2018 to the present published in registered sites were searched on Google Scalar using "ICT education" as the keyword, and 60 papers pertaining to ICT education were selected based on a systematic literature review. Subsequently, keywords were extracted based on the title and summary of the paper. For word frequency and indicator data, 49 keywords with high appearance frequency were extracted by analyzing frequency, via the term frequency-inverse document frequency technique in natural language processing, and words with simultaneous appearance frequency. The relationship degree was verified by analyzing the connection structure and centrality of the connection degree between words, and a cluster composed of words with similarity was derived via CONCOR analysis. First, "education," "research," "result," "utilization," and "analysis" were analyzed as main keywords. Second, by analyzing an N-GRAM network graph with "education" as the keyword, "curriculum" and "utilization" were shown to exhibit the highest correlation level. Third, by conducting a cluster analysis with "education" as the keyword, five groups were formed: "curriculum," "programming," "student," "improvement," and "information." These results indicate that practical research necessary for ICT education can be conducted by analyzing ICT education trends and identifying trends.

Global Research Trends on Geospatial Information by Keyword Network Analysis (키워드 네트워크 분석을 이용한 지리공간정보의 글로벌 연구 동향 분석)

  • Kim, Byeongsun;Jeong, Minwoo;Jeon, Sangeum;Shin, Dongbin
    • Spatial Information Research
    • /
    • v.23 no.1
    • /
    • pp.69-77
    • /
    • 2015
  • The aim of this study is to examine the research trends of global scientific production of Geospatial Information (GI) papers from 1998 to 2013 by using keyword network analysis. This study constructed keyword network model through papers and keywords related to GI research retrieved from the Web of Science DB and performed keyword network analysis such as Degree Centrality, Betweenness Centrality, and Closeness Centrality. The results show that GI has been steadily applied to various fields, and also the research trends of GI techniques could be quantitatively characterized through keyword network analysis. This study result can be applied to establish the policies and the national R&D planning of geospatial information.

Keyword Extraction for Korean Language Q&A (국어정보 질의응답을 위한 키워드 추출)

  • Jong, Jong-Seok;Lee, Su-In;Lee, Hyun-A
    • Annual Conference on Human and Language Technology
    • /
    • 2015.10a
    • /
    • pp.213-215
    • /
    • 2015
  • 국립국어원 온라인가나다에서 제공되는 질의응답 문서를 이용한 국어정보에 대한 Q&A시스템은 언어 자체에 대한 질문과 답변의 특성으로 조사나 어미로 끝나는 표현이 주어로 등장하는 등의 특이한 문장이 자주 나타난다. 이러한 이유로 형태소 분석을 거쳐 명사를 키워드로 추출하는 일반적인 키워드 추출 방식은 좋은 성능을 얻기 어렵다. 본 논문에서는 국어정보 질의응답 문서의 특징에 맞는 키워드 추출 방법을 제안한다. 제안하는 방식에서는 문장 단위로 분할된 결과에서 연결어미로 문장을 추가로 분할한 뒤에 조사 앞에 나타나는 단어열을 키워드로 추출한다. 덧붙여 다자비교형 질의에서의 키워드 추출을 위해 편집거리를 이용한 키워드 추출 방법을 제안한다.

  • PDF

A Study on Personalized Mobile Web News Contents Creation using Keyword Analysis (키워드 분석을 이용한 개인화 모바일 웹 뉴스 컨텐츠 생성에 관한 연구)

  • Han, Seugn-Hyun;Lim, Young-Hwan
    • Journal of the Korea Society of Computer and Information
    • /
    • v.12 no.3
    • /
    • pp.277-285
    • /
    • 2007
  • This research proposes a personalized mobile web contents creation method that uses web news channel contents-based analysis. It promptly acquires data through the RSS and RSS-linked web pages which have been supplied by the existing web sites for a news search. And then It applies a personalization method using analysis in contents filtering and generation. The proposed method will make creating mobile web contents easier while lowering wireless contents production costs. Moreover, It can be improved a user satisfaction for contents filtering and access with using analysis that fits in with a matter of user's specific interest.

  • PDF

Keyword Network Analysis of Trends in Research on Climate Change Education (키워드 네트워크 분석을 활용한 기후변화 교육 관련 연구동향 분석)

  • Kim, Soon Shik;Lee, Sang Gyun
    • Journal of the Korean Society of Earth Science Education
    • /
    • v.13 no.3
    • /
    • pp.226-237
    • /
    • 2020
  • The purpose of the research is to analyze research trends related to climate change education by network analysis based on keywords extracted from the research title. For this purpose, 62 papers were selected from Korean Citation Index(KCI) journals published from 2011 to 2020 using such keywords as "climate change" and "climate change education" in the Research Information Sharing Service. The analysis procedure consisted of selection of analysis papers, keyword extraction and purification, and keyword network analysis and visualization. Textom, Ucinet 6.0, and NetDraw were used to analyze the frequency, degree centrality, and betweenness centrality. The results of the research showed that, first, Early 'Energy and Climate Change Education' had the highest frequency of papers examining climate change education. Second, the keywords/phrases that appeared most frequently in research on climate change education were "program" "energy," "analysis," "elementary school," "elementary school," "elementary school students," "development," and "impact." Third, the analysis of the centrality of betweenness centrality showed that the index of 'program', 'primary students' and 'primary schools' were the highest, and the largest group was 'development and effect of teaching and learning programs'. Based on these results, it was concluded that future research on climate change education needs to be examined in further detail and expanded into more specific areas.

A Study on the Library Marketing Research Trends through Keyword Network Analysis: Comparative Analysis of Korea and Other Countries (키워드 네트워크 분석을 통한 도서관마케팅 연구 경향 분석 - 우리나라와 국외연구의 비교분석 -)

  • Lee, Seongsin
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.50 no.3
    • /
    • pp.383-402
    • /
    • 2016
  • The purpose of this study is to study library marketing research trends in Korea and other countries through the analysis of author keyword network of peer-reviewed journal articles. The author keyword was collected from four major LIS journals in Korea and Scopus academic database for other countries'. The data was analyzed using NetMiner4 software. The results of the study were as follows: 1) In Korea, lots of library marketing studies focused on public libraries. However, there was a range of library marketing researches focused on academic libraries in other countries, 2) In Korea, there was not a variety of subjects of library marketing studies and the studies were mainly led by a few scholars, 3) In other countries, many scholars paid attention to digital library marketing through social media and/or web, and 4) there little library marketing studies focused on school libraries both in Korea and other countries.

Contextual Advertisement System based on Document Clustering (문서 클러스터링을 이용한 문맥 광고 시스템)

  • Lee, Dong-Kwang;Kang, In-Ho;An, Dong-Un
    • The KIPS Transactions:PartB
    • /
    • v.15B no.1
    • /
    • pp.73-80
    • /
    • 2008
  • In this paper, an advertisement-keyword finding method using document clustering is proposed to solve problems by ambiguous words and incorrect identification of main keywords. News articles that have similar contents and the same advertisement-keywords are clustered to construct the contextual information of advertisement-keywords. In addition to news articles, the web page and summary of a product are also used to construct the contextual information. The given document is classified as one of the news article clusters, and then cluster-relevant advertisement-keywords are used to identify keywords in the document. We could achieve 21% precision improvement by our proposed method.

Extracting User-Specific Advertising Keywords Based on Textual Data Mining from KakaoTalk (카카오톡에서의 텍스트 데이터 마이닝 기반의 사용자별 적합 광고 키워드 도출 )

  • Yerim Jeon;Dayeong So;Jimin Lee;Eunjin (Jinny) Jo;Jihoon Moon
    • Annual Conference of KIPS
    • /
    • 2023.05a
    • /
    • pp.368-369
    • /
    • 2023
  • 대화 데이터 기반 광고 추천은 광고 마케팅에서 고객 맞춤형 광고 제공, 마케팅 효과 극대화 등을 위한 중요한 기술로 주목받고 있다. 본 논문에서는 모바일 인스턴스 메신저인 카카오톡 대화창에서 발생한 텍스트 데이터를 기반으로 대화 내용을 분석하여 대화 주제별 적절한 광고 키워드를 제안한다. 이를 위해 주제별 대화 내용을 미용, 식음료, 상거래로 세분하고 KoNLPy 의 Okt 를 이용하여 텍스트 전처리를 수행하고 키워드별로 빈도수를 뽑아 워드 클라우드를 제시한다. 또한, 잠재 디리클레 할당(Latent Dirichlet Allocation, LDA)을 기반으로 대화 주제를 세분화한 뒤 라벨링을 통해 주제별 대화 키워드를 분석한다. 실험 결과, 대화 주제를 온라인 쇼핑, 헤어, 뷰티 관리, 음식으로 나눌 수 있었으며, 토픽별 상위 키워드를 Word2Vec 을 통해 특정 단어와 유사한 키워드를 도출하여 적절한 광고 키워드를 제시할 수 있었다.