• Title/Summary/Keyword: co-occurrence frequency

Search Result 143, Processing Time 0.023 seconds

Text Mining of Wood Science Research Published in Korean and Japanese Journals

  • Eun-Suk JANG
    • Journal of the Korean Wood Science and Technology
    • /
    • v.51 no.6
    • /
    • pp.458-469
    • /
    • 2023
  • Text mining techniques provide valuable insights into research information across various fields. In this study, text mining was used to identify research trends in wood science from 2012 to 2022, with a focus on representative journals published in Korea and Japan. Abstracts from Journal of the Korean Wood Science and Technology (JKWST, 785 articles) and Journal of Wood Science (JWS, 812 articles) obtained from the SCOPUS database were analyzed in terms of the word frequency (specifically, term frequency-inverse document frequency) and co-occurrence network analysis. Both journals showed a significant occurrence of words related to the physical and mechanical properties of wood. Furthermore, words related to wood species native to each country and their respective timber industries frequently appeared in both journals. CLT was a common keyword in engineering wood materials in Korea and Japan. In addition, the keywords "MDF," "MUF," and "GFRP" were ranked in the top 50 in Korea. Research on wood anatomy was inferred to be more active in Japan than in Korea. Co-occurrence network analysis showed that words related to the physical and structural characteristics of wood were organically related to wood materials.

A Study on Multi-frequency Keyword Visualization based on Co-occurrence (다중빈도 키워드 가시화에 관한 연구)

  • Lee, HyunChang;Shin, SeongYoon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.05a
    • /
    • pp.103-104
    • /
    • 2018
  • Recently, interest in data analysis has increased as the importance of big data becomes more important. Particularly, as social media data and academic research communities become more active and important, analysis becomes more important. In this study, co-word analysis was conducted through altmetrics articles collected from 2012 to 2017. In this way, the co-occurrence network map is derived from the keyword and the emphasized keyword is extracted.

  • PDF

A Study on Multi-frequency Keyword Visualization based on Co-occurrence (다중빈도 키워드 가시화에 관한 연구)

  • Lee, HyunChang;Shin, SeongYoon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.05a
    • /
    • pp.424-425
    • /
    • 2018
  • Recently, interest in data analysis has increased as the importance of big data becomes more important. Particularly, as social media data and academic research communities become more active and important, analysis becomes more important. In this study, co-word analysis was conducted through altmetrics articles collected from 2012 to 2017. In this way, the co-occurrence network map is derived from the keyword and the emphasized keyword is extracted.

  • PDF

Texture Analysis and Classification Using Wavelet Extension and Gray Level Co-occurrence Matrix for Defect Detection in Small Dimension Images

  • Agani, Nazori;Al-Attas, Syed Abd Rahman;Salleh, Sheikh Hussain Sheikh
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.2059-2064
    • /
    • 2004
  • Texture analysis is an important role for automatic visual insfection. This paper presents an application of wavelet extension and Gray level co-occurrence matrix (GLCM) for detection of defect encountered in textured images. Texture characteristic in low quality images is not to easy task to perform caused by noise, low frequency and small dimension. In order to solve this problem, we have developed a procedure called wavelet image extension. Wavelet extension procedure is used to determine the frequency bands carrying the most information about the texture by decomposing images into multiple frequency bands and to form an image approximation with higher resolution. Thus, wavelet extension procedure offers the ability to robust feature extraction in images. Then the features are extracted from the co-occurrence matrices computed from the sub-bands which performed by partitioning the texture image into sub-window. In the detection part, Mahalanobis distance classifier is used to decide whether the test image is defective or non defective.

  • PDF

A Temperature Analysis Study of Korea and its Neighbor Regions for Temperature Specification (온도규격 설정을 위한 한반도 및 주변권역 온도분석 연구)

  • Kim, In-Soo;Kang, Chee-Woo
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.18 no.1
    • /
    • pp.55-65
    • /
    • 2015
  • This paper describes the results of a temperature analysis of Korea and its neighbor regions for temperature specification of weapon systems to be deployed in Korea, and introduces the concept of a standard deviation frequency of occurrence which represents a frequency of yearly occurrence. On the basis of this analysis, reasonable operational temperatures for the Korea weapon systems are recommended, and the regional frequency of yearly occurrence of temperatures worse than recommended operational temperatures in each country regions are presented.

A Study on Keyword Information Characteristics of Product Names for Online Sales of Women's Jeans Using Text Mining (텍스트마이닝을 활용한 온라인 판매 여성 청바지 상품명에 나타난 키워드의 정보 특성 분석)

  • Yeo Sun Kang
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.47 no.1
    • /
    • pp.35-51
    • /
    • 2023
  • This study used text mining to extract 2,842 keywords from 7,397 product names and organized them into categories in order to analyze the characteristics of keywords appearing in the product names of jeans after 2020. The item category included denim and Chungbaji [청바지], and Ilja [일자], while the silhouette category included wide and bootcut. In addition, high-waist and banding comprised the making sector, and the materials category consisted of napping, spandex, and soft blue. Denim surpassed the others in frequency, co-occurrence frequency, and centrality, and co-appeared with various other keywords. Also, the co-appearance of item and silhouette was prominent, and there were many keyword combinations that showed characteristics related to (a) high waist; (b) hemline detail; (c) rubber band; and (d) partial tearing. Furthermore, idiom expressions such as 'slim fit' and 'back tearing', which were not highlighted in the co-occurrence frequency, were additionally confirmed through correlation. Therefore, the product name analysis effectively identified the detailed characteristics of the silhouette and the making of jeans preferred by consumers.

Comparative Analysis of Job Satisfaction Factors, Using LDA Topic Modeling by Industries : The Case Study of Job Planet Reviews (토픽모델링 기법을 활용한 산업별 직무만족요인 비교 조사 : 잡플래닛 리뷰를 중심으로)

  • Kim, Dongwook;Kang, Juyoung;Lim, Jay Ick
    • Journal of Information Technology Services
    • /
    • v.15 no.3
    • /
    • pp.157-171
    • /
    • 2016
  • As unemployment rates and concerns about turnover keep growing, the need for information is also increasing. In these situations, the job reviews which share information about the company catch people's attention because they are usually created by people who worked at the company. The development of SNS and mobile environments has led to an increase in the web services that provide job reviews. For example, Jobplanet is a job review service in Korea, and Glassdoor.com offers a similar service in the US. Despite this attention, however, research utilizing job reviews is insufficient. This paper asks whether there are differences in ratios of job satisfaction factors by industry, using LDA topic modeling and co-occurrence analysis to explore the differences. Through the results of LDA, we find that the ratios of job satisfaction factors are similar by industry. At the same time, the results of co-occurrence analysis show that the co-occurrence frequency of some job satisfaction factors appears high: pay and welfare, balance of work and life, company culture. We expect that the result of this research will be helpful in comparative analysis of job satisfaction factors by industry. Furthermore, in this paper we suggest how to use the job review data in organizational behavior research.

Empirical Comparison of Word Similarity Measures Based on Co-Occurrence, Context, and a Vector Space Model

  • Kadowaki, Natsuki;Kishida, Kazuaki
    • Journal of Information Science Theory and Practice
    • /
    • v.8 no.2
    • /
    • pp.6-17
    • /
    • 2020
  • Word similarity is often measured to enhance system performance in the information retrieval field and other related areas. This paper reports on an experimental comparison of values for word similarity measures that were computed based on 50 intentionally selected words from a Reuters corpus. There were three targets, including (1) co-occurrence-based similarity measures (for which a co-occurrence frequency is counted as the number of documents or sentences), (2) context-based distributional similarity measures obtained from a latent Dirichlet allocation (LDA), nonnegative matrix factorization (NMF), and Word2Vec algorithm, and (3) similarity measures computed from the tf-idf weights of each word according to a vector space model (VSM). Here, a Pearson correlation coefficient for a pair of VSM-based similarity measures and co-occurrence-based similarity measures according to the number of documents was highest. Group-average agglomerative hierarchical clustering was also applied to similarity matrices computed by individual measures. An evaluation of the cluster sets according to an answer set revealed that VSM- and LDA-based similarity measures performed best.

Research on Ways to Revitalize Traditional Markets by Exploring Research Trends (연구동향 탐색을 통한 전통시장 활성화 방안 연구)

  • Choon-Ho LEE;Hoe-Chang YANG
    • The Journal of Economics, Marketing and Management
    • /
    • v.11 no.4
    • /
    • pp.53-63
    • /
    • 2023
  • Purpose: The purpose of this study is to examine the research trends in the papers published by Korean researchers related to traditional markets, to check what topics have been studied, and to make various suggestions for research directions and effective ways to revitalize traditional markets. Research design, data and methodology: To this end, this study conducted word frequency analysis, co-occurrence frequency analysis, BERTopic, LDA, dynamic topic modeling and OLS regression analysis using Python 3.7 on the English abstracts of a total of 502 papers extracted through ScienceON. Results: As a result of word frequency analysis and co-occurrence frequency analysis, it was found that studies related to traditional markets have been conducted not only on factors related to customers, but also on traditional market merchants and government policies, and the degree of service, quality, and satisfaction perceived by customers using traditional markets. Through BERTopic and LDA, three topics such as 'Traditional market safety management' were identified, and among them, it was found that 'Traditional market safety management' is relatively less attention by researchers. Conclusions: The results of this study suggest that future research on the revitalization of traditional markets should be conducted from a specific consulting perspective along with the establishment of various data, a causal model study from various perspectives such as the characteristics of merchants as well as consumers, and an integrated and convergent approach to policy formulation by the government and local governments.