• 제목/요약/키워드: High Frequency Word

검색결과 196건 처리시간 0.038초

Analysis on Types of Golf Tourism After COVID-19 by using Big Data

  • Hyun Seok Kim;Munyeong Yun;Gi-Hwan Ryu
    • International Journal of Advanced Culture Technology
    • /
    • 제12권1호
    • /
    • pp.270-275
    • /
    • 2024
  • Introduction. In this study, purpose is to analize the types of golf tourism, inbound or outbound, by using big data and see how movement of industry is being changed and what changes have been made during and after Covid-19 in golf industry. Method Using Textom, a big data analysis tool, "golf tourism" and "Covid-19" were selected as keywords, and search frequency information of Naver and Daum was collected for a year from 1 st January, 2023 to 31st December, 2023, and data preprocessing was conducted based on this. For the suitability of the study and more accurate data, data not related to "golf tourism" was removed through the refining process, and similar keywords were grouped into the same keyword to perform analysis. As a result of the word refining process, top 36 keywords with the highest relevance and search frequency were selected and applied to this study. The top 36 keywords derived through word purification were subjected to TF-IDF analysis, visualization analysis using Ucinet6 and NetDraw programs, network analysis between keywords, and cluster analysis between each keyword through Concor analysis. Results By using big data analysis, it was found out option of oversea golf tourism is affecting on inbound golf travel. "Golf", "Tourism", "Vietnam", "Thailand" showed high frequencies, which proves that oversea golf tour is now the re-coming trends.

워드임베딩을 이용한 온라인 비디오 강의의 고빈도 단어와 키워드 간의 유사도 비교 연구 (A Study on Comparative Analysis with High-Frequency Word and Keyword using Word Embedding)

  • 조재춘;임희석
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2017년도 춘계학술발표대회
    • /
    • pp.385-386
    • /
    • 2017
  • 무료 온라인 교육환경은 교육의 기회를 제공함으로써, 지속적으로 관심이 높아지고 있으며 배움에 대한 노력에 중시하고 있다. 따라서 본 연구는 배움에 대한 노력을 온라인에서 자동으로 판단할 수 있는 최소학습 판단 시스템을 제안해 왔다. 최소학습을 판단하기 위해 온라인 비디오 강의에서 고빈도 단어를 추출하여 단어게임을 통해 판단하는데, 이때 고빈도 단어가 최소학습을 판단하기 위한 키워드로 사용할 수 있는지에 대한 검증 실험이 요구되었다. 따라서 본 논문은 워드임베딩을 이용하여 고빈도 단어와 키워드간의 유사도를 비교하여 고빈도 단어에 대한 검증 실험을 실시하였다. 실험 결과, 고빈도 단어가 온라인 비디오 강의를 대표할 수 있는 키워드로 사용될 수 있는 긍정적인 결과를 보였고 최소 학습을 판단하기 위한 요소(Feature)로 충분히 사용가능함을 보였다.

새로운 Ternary CAM을 이용한 고속 허프만 디코더 설계 (A high speed huffman decoder using new ternary CAM)

  • 이광진;김상훈;이주석;박노경;차균현
    • 한국통신학회논문지
    • /
    • 제21권7호
    • /
    • pp.1716-1725
    • /
    • 1996
  • In this paper, the huffman decoder which is a part of the decoder in JPEG standard format is designed by using a new Ternary CAM. First, the 256 word * 16 bit-size new bit-word all parallel Ternary CAM system is designed and verified using SPICE and CADENCE Verilog-XL, and then the verified novel Ternary CAM is applied to the new huffman decoder architecture of JPEG. So the performnce of the designed CAM cell and it's block is verified. The new Ternary CAM has various applications because it has search data mask and storing data mask function, which enable bit-wise search and don't care state storing. When the CAM is used for huffman look-up table in huffman decoder, the CAM is partitioned according to the decoding symbol frequency. The scheme of partitioning CAM for huffman table overcomes the drawbacks of all-parallel CAM with much power and load. So operation speed and power consumption are improved.

  • PDF

The Syllable Frequency Effect in Semantic Categorization Tasks in Korean

  • Kim, Ji-Hye;Kwon, You-An;Nam, Ki-Chun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제5권10호
    • /
    • pp.1879-1890
    • /
    • 2011
  • Previous studies of syllable frequency effects have proposed that inhibitory effects due to high first syllable frequency were the products of competitions between activated lexical candidates within a lexical level. However, these studies have primarily used lexical decision tasks to examine the nature of syllable frequency effects. This study investigates whether a syllable frequency effect can arise in semantic categorization tasks and whether phonologically or orthographically defined syllables interact with semantically related variables such as morphological family size. If the syllable frequency effect was created by activations and competitions on a lexical level, it is highly possible that the effect was related to semantic categorization tasks. To test this hypothesis, we conducted two experiments. In Experiment 1, morphological family size and phonological syllable frequency were factorially manipulated. In Experiment 2, morphological family size and orthographic syllable frequency were factorially manipulated. The results demonstrate that morphemes have no relationship with phonological syllables but do with orthographic syllables. This suggests that phonological syllables and orthographic syllables have different roles in the syllable frequency effect on visual word recognition process.

An evaluation of Korean students' pronunciation of an English passage by a speech recognition application and two human raters

  • Yang, Byunggon
    • 말소리와 음성과학
    • /
    • 제12권4호
    • /
    • pp.19-25
    • /
    • 2020
  • This study examined thirty-one Korean students' pronunciation of an English passage using a speech recognition application, Speechnotes, and two Canadian raters' evaluations of their speech according to the International English Language Testing System (IELTS) band criteria to assess the possibility of using the application as a teaching aid for pronunciation education. The results showed that the grand average percentage of correctly recognized words was 77.7%. From the moderate recognition rate, the pronunciation level of the participants was construed as intermediate and higher. The recognition rate varied depending on the composition of the content words and the function words in each given sentence. Frequency counts of unrecognized words by group level and word type revealed the typical pronunciation problems of the participants, including fricatives and nasals. The IELTS bands chosen by the two native raters for the rainbow passage had a moderately high correlation with each other. A moderate correlation was reported between the number of correctly recognized content words and the raters' bands, while an almost a negligible correlation was found between the function words and the raters' bands. From these results, the author concludes that the speech recognition application could constitute a partial aid for diagnosing each individual's or the group's pronunciation problems, but further studies are still needed to match human raters.

종속격 정보를 적용한 동사 의미 중의성 해소 (Verb Sense Disambiguation using Subordinating Case Information)

  • 박요셉;신준철;옥철영;박혁로
    • 정보처리학회논문지B
    • /
    • 제18B권4호
    • /
    • pp.241-248
    • /
    • 2011
  • 동형이의어는 여러 가지 의미를 가진 단어를 의미한다. 문장의 의미를 이해하기 위해서는 필수적으로 문장에 포함된 동형이의어의 의미를 결정해야 한다. 기존의 단어 의미 중의성 연구들은 공기 빈도를 기반으로 해결하였다. 하지만, 동사의 경우에는 정확도 향상을 위해서 격 정보가 중요하다. 왜냐하면, 동사 동형이의어의 의미는 행위의 주체나 객체에 따라 결정되어서 종속격(목적격, 부사격, 보격) 정보가 필요하며, 동사동형이의어 의미마다 서로 다른 격 정보가 필요하기 때문이다. 본 논문에서는 한국어 격 정보를 적용한 동사 의미 중의성 해소를 제안한다. 격정보는 표준국어대사전에 명시된 조사 정보를 이용하였다. 실험은 고빈도 동형이의어 12개를 대상으로 하였으며, 실험결과 정확도가 기존의 97.3%에서 98.7%로 1.34% 향상되었다. 이는 원래의 오류율을 2.7%에서 1.3%으로 절반정도 줄였다.

Adjusting Weights of Single-word and Multi-word Terms for Keyphrase Extraction from Article Text

  • Kang, In-Su
    • 한국컴퓨터정보학회논문지
    • /
    • 제26권8호
    • /
    • pp.47-54
    • /
    • 2021
  • 핵심구 추출은 문서의 내용을 대표하는 주제 용어를 자동 추출하는 작업이다. 비지도 방식 핵심구 추출에서는 문서 텍스트로부터 핵심구 후보 용어가 되는 단어나 구를 추출하고 후보 용어에 부여된 중요도에 기반하여 최종 핵심구들이 선택된다. 본 논문에서는 비지도 방식 핵심구 후보 용어 중요도 계산에서 단어 유형 후보 용어와 구 유형 후보 용어의 중요도를 조정하는 방법을 제안한다. 이를 위해 핵심구 추출 대상 문서 텍스트로부터 후보 용어 집합의 타입-토큰 비율과 고빈도 대표 용어의 정보량을 단어 유형과 구 유형으로 구분하여 수집한 후 중요도 조정에 활용한다. 실험에서는 영어로 작성된 full-text 논문을 대상으로 구축된 4개 서로 다른 핵심구 추출 평가집합들을 사용하여 성능 평가를 수행하였고, 제안된 중요도 조정 방법은 3개 평가집합들에서 베이스 라인 및 비교 방법들보다 높은 성능을 보였다.

한국어 음소결합확률 계산기 개발연구 (A Study of Development for Korean Phonotactic Probability Calculator)

  • 이찬종;이현복;최훈영
    • 한국음향학회지
    • /
    • 제28권3호
    • /
    • pp.239-244
    • /
    • 2009
  • 본 연구는 현대 한국어 단어의 말소리가 결합하는 음소결합확률 (Phonotactic Probability)을 예측하는 계산기 엔진 개발에 관한 연구이다. 한국어 음소결합확률계산기 (이하, KPPC)는 첫째로, 한국어의 주어진 단어에서의 음소와 그 음소의 음소결합의 빈도를 예측하여 말소리가 단어내의 특정위치에서 특정 분절음이 나타나는 빈도 값, 두 음소간의 결합의 빈도값, 그리고 세 음소간의 결합의 빈도 출현률을 예측하여 계산한다. 둘째로 한국어의 주어진 단어에서 말소리 하나만 다르면서 실제로 존재할 수 있는 근접밀도 (neighborhood density)의 값을 계산한다. University of Kansas에서 개발된 음소결합계산기는 영어 20,000단어의 D/B를 대상으로 위치별 분절음빈도와 두 음소간의 음소결합률 빈도를 컴퓨터가 읽을 수 있는 발음기호를 통해서만 가능하다. 본 연구에서는 분절음빈도와 두 음소간의 빈도뿐만 아니라 세 음소간의 결합률 빈도와 근접밀도율을 예측할 수 있고 입력할 때 발음기호뿐만 아니라 단어를 입력하면 확률값을 얻을 수 있다. 이 엔진은 67,284단어의 한국어 표준발음을 D/B로 구축하여 고빈도 음소결합확률, 저빈도 음소결합확률, 고빈도 근접밀도, 저빈도 근접밀도의 값을 예측할 수 있다.

연결망 분석을 활용한 우리나라 금연연구 동향분석 (A Social Network Analysis of Research Key Words Related Smoke Cessation in South Korea)

  • 안은성
    • 보건행정학회지
    • /
    • 제29권2호
    • /
    • pp.138-145
    • /
    • 2019
  • Background: The purpose of this study is supposed to figure out the keyword network from 2009 to 2018 with social network analysis and provide the research data that can help the Korea government's policy making on smoking cessation. Methods: First, frequency analysis on the keyword was performed. After, in this study, I applied three classic centrality measures (degree centrality, betweenness centrality, and eigenvector centrality) with R 3.5.1. Moreover, I visualized the results as the word cloud and keyword network. Results: As a result of network analysis, 'smoking' and 'smoking cessation' were key words with high frequency, high degree centrality, and betweenness centrality. As a result of looking at trends in keyword, many study had been done on the keyword 'secondhand smoke' and 'adolescent' from 2009 to 2013, and 'cigarette graphic warning' and 'electronic cigarette' from 2014 to 2018. Conclusion: This study contributes to understand trends on smoking cessation study and seek further study with the keyword network analysis.

텍스트 마이닝 처리로 품질경영학회지 연구동향 분석 (Analysis of Research Trends in Journal of Korean Society for Quality Management by Text Mining Processing)

  • 이상복
    • 품질경영학회지
    • /
    • 제47권3호
    • /
    • pp.597-613
    • /
    • 2019
  • Purpose: The purpose of this study is to analyze the trend of quality research by analyzing the entire JKSQM(Journal of the Korean Society for Quality Management). Methods: This study is to analyze the frequency of words used in the abstract of the all JKSQM by applying the text mining processing. We use wordcrowd among text mining techniques. Results: 22 words of high frequency were presented in the abstract of the paper published in the JKSQM for 42 years. The frequency of words was shown on a 10 year basis, and the four important words were plotted on a change graph for each Vol. Frequent words of each Vol. are added in the appendix. Conclusion: The main research results are as follows. First, there has been no significant change in research trends over the last 40 years. Second, the early SQC words have been widely used, and since 1990, many words such as service-oriented words have been used, indicating a change in the times. Third, the use of the words of the 4th industrial revolution since 2010 is weak. In the above analysis, the trend of quality research in Korea is within the quality category and can be considered conservative. Now, it is expected that everything will be changed in the period of the 4th Industrial Revolution, and it is time to study the direction of quality in Korea.