• Title/Summary/Keyword: Text Mining for Korean

Search Result 631, Processing Time 0.034 seconds

A Study on the User Perception in Fashion Design through Social Media Text-Mining (소셜미디어 텍스트마이닝을 통한 패션디자인 사용자 인식 조사)

  • An, Hyosun;Park, Minjung
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.41 no.6
    • /
    • pp.1060-1070
    • /
    • 2017
  • This study seeks methods to analyze users' perception in fashion designs shown in social media using textmining analysis methods. The research methods selected 'men's stripe shirts' as subjects and collected texts related to the subject mainly from blogs. Texts from 13,648 posts from November 1st, 2015 to October 31st, 2016 were analyzed by applying the LDA algorithm and content analysis. As a result, the wearing status per season and subjects of men's stripe shirts were derived. Across the entire period, the main topics discussed by users to be pattern, customized suits, brands, coordination and purchase information. In terms of seasons, spring time showed the sharing of information on coordinating daily looks or boyfriend looks, and during the winter season the information shared were about shirts suitable for special occasions such as job interviews and stripe shirts that match suits. The study results showed that text-mining analysis is capable of analyzing the context and provide a user-centered index responding to demands newly mentioned by users along with the rapid changes in fashion design trends.

A Study on Keyword Information Characteristics of Product Names for Online Sales of Women's Jeans Using Text Mining (텍스트마이닝을 활용한 온라인 판매 여성 청바지 상품명에 나타난 키워드의 정보 특성 분석)

  • Yeo Sun Kang
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.47 no.1
    • /
    • pp.35-51
    • /
    • 2023
  • This study used text mining to extract 2,842 keywords from 7,397 product names and organized them into categories in order to analyze the characteristics of keywords appearing in the product names of jeans after 2020. The item category included denim and Chungbaji [청바지], and Ilja [일자], while the silhouette category included wide and bootcut. In addition, high-waist and banding comprised the making sector, and the materials category consisted of napping, spandex, and soft blue. Denim surpassed the others in frequency, co-occurrence frequency, and centrality, and co-appeared with various other keywords. Also, the co-appearance of item and silhouette was prominent, and there were many keyword combinations that showed characteristics related to (a) high waist; (b) hemline detail; (c) rubber band; and (d) partial tearing. Furthermore, idiom expressions such as 'slim fit' and 'back tearing', which were not highlighted in the co-occurrence frequency, were additionally confirmed through correlation. Therefore, the product name analysis effectively identified the detailed characteristics of the silhouette and the making of jeans preferred by consumers.

Keywords Analysis of Clothing Materials in Consumer Reviews Using Big Data Text Mining (빅데이터 텍스트 마이닝을 활용한 소비자 리뷰에서의 의류 소재 키워드 분석)

  • Gaeun Kang;Jiwon Park;Shinjung Yoo
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.48 no.4
    • /
    • pp.729-743
    • /
    • 2024
  • This research explores consumer preferences for materials in different clothing product categories, using web-crawling and text mining techniques. Specifically, the study focuses on the material-related terms found in consumer reviews across three distinct product categories: functional clothing, formal shirts, and knit sweaters. Top-selling products within each category were identified on the Naver Shopping website based on the volume of reviews, and the four most-reviewed products were selected. Six hundred reviews per product were analyzed using the Textom big-data analysis software to determine the frequency of material-related mentions and word associations. The analysis utilized two comparative metrics: product category and usage duration. Our findings reveal notable variations in the material preferences mentioned by consumers across different product categories. The study suggests a need to re-evaluate existing standardized review criteria to better reflect consumer interests specific to each product category. Additionally, an increase in material-related terms in reviews over one month indicates the potential importance of extending the duration of product reviews to enhance the accuracy of information that reflects longer-term consumer experiences with material quality.

Analysis of research trends on mobile health intervention for Korean patients with chronic disease using text mining (텍스트마이닝을 이용한 국내 만성질환자 대상 모바일 헬스 중재연구 동향 분석)

  • Son, Youn-Jung;Lee, Soo-Kyoung
    • Journal of Digital Convergence
    • /
    • v.17 no.4
    • /
    • pp.211-217
    • /
    • 2019
  • As the widespread use of mobile health intervention among Korean patients with chronic disease, it is needed to identify research trends in mobile health intervention on chronic care using text mining technique. This secondary data analysis was conducted to investigate characteristics and main research topics in intervention studies from 2005 to 2018 with a total of 20 peer reviewed articles. Microsoft Excel and Text Analyzer were used for data analysis. Mobile health interventions were mainly applied to hypertension, diabetes, stroke, and coronary artery disease. The most common type of intervention was to develop mobile application. Lately, 'feasibility', 'mobile health', and 'outcome measure' were frequently presented. Future larger studies are needed to identify the relationships among key terms and the effectiveness of mobile health intervention using social network analysis.

A Study on the Archival Information Services of Economic Policy Using Text Mining Methods: Focusing on Economic Policy Directions (텍스트 마이닝을 활용한 경제정책기록서비스 연구: 경제정책방향을 중심으로)

  • Yeon, Jihyun;Kim, Sungwon
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.22 no.2
    • /
    • pp.117-133
    • /
    • 2022
  • The archival content listed arbitrarily makes it difficult for users to efficiently access the records of major economic policies, especially given that they use it without understanding the required period and context. Using the text mining techniques in the 30-year economic policy direction from 1991 to 2021, this paper derives economic-related keywords and changes that the government mainly dealt with. It collects and preprocesses major economic policies' background, main content, and body text and conducts text frequency, term frequency-inverse document frequency (TF-IDF), network, and time series analyses. Based on these analyses, the following words are recorded in order of frequency: "job(일자리)," "competitive(경쟁력)," and "restructuring(구조조정)." In addition, the relative ratio of "job (일자리)," "real estate(부동산)," and "corporation(기업)," by year was analyzed in terms of chronological order while presenting major keywords mentioned by each government. Based on the results, this study presents implications for developing and broadening the area of archival information services related to economic policies.

The Impact of Transforming Unstructured Data into Structured Data on a Churn Prediction Model for Loan Customers

  • Jung, Hoon;Lee, Bong Gyou
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.12
    • /
    • pp.4706-4724
    • /
    • 2020
  • With various structured data, such as the company size, loan balance, and savings accounts, the voice of customer (VOC), which is text data containing contact history and counseling details was analyzed in this study. To analyze unstructured data, the term frequency-inverse document frequency (TF-IDF) analysis, semantic network analysis, sentiment analysis, and a convolutional neural network (CNN) were implemented. A performance comparison of the models revealed that the predictive model using the CNN provided the best performance with regard to predictive power, followed by the model using the TF-IDF, and then the model using semantic network analysis. In particular, a character-level CNN and a word-level CNN were developed separately, and the character-level CNN exhibited better performance, according to an analysis for the Korean language. Moreover, a systematic selection model for optimal text mining techniques was proposed, suggesting which analytical technique is appropriate for analyzing text data depending on the context. This study also provides evidence that the results of previous studies, indicating that individual customers leave when their loyalty and switching cost are low, are also applicable to corporate customers and suggests that VOC data indicating customers' needs are very effective for predicting their behavior.

Discovering the anti-cancer phytochemical rutin against breast cancer through the methodical platform based on traditional medicinal knowledge

  • Jungwhoi Lee;Jungsul Lee;WooGwang Sim;Jae-Hoon Kim;Chulhee Choi;Jongwook Jeon
    • BMB Reports
    • /
    • v.56 no.11
    • /
    • pp.594-599
    • /
    • 2023
  • A number of therapeutic drugs have been developed from functional chemicals found in plants. Knowledge of plants used for medicinal purposes has historically been transmitted by word of mouth or through literature. The aim of the present study is to provide a systemic platform for the development of lead compounds against breast cancer based on a traditional medical text. To verify our systematic approach, integrating processes consisted of text mining of traditional medical texts, 3-D virtual docking screening, and in vitro and in vivo experimental validations were demonstrated. Our text analysis system identified rutin as a specific phytochemical traditionally used for cancer treatment. 3-D virtual screening predicted that rutin could block EGFR signaling. Thus, we validated significant anti-cancer effects of rutin against breast cancer cells through blockade of EGFR signaling pathway in vitro. We also demonstrated in vivo anti-cancer effects of rutin using the breast cancer recurrence in vivo models. In summary, our innovative approach might be proper for discovering new phytochemical lead compounds designing for blockade of malignant neoplasm including breast cancer.

  • PDF

Analysis of the Contents of Hanbok in the 「Home Life and Safety」 section of the High School Technical Family Textbook: Content Analysis and Text Mining Techniques are utilized (고등학교 기술·가정 교과서 「가정생활과 안전」 영역의 한복 내용 분석)

  • Shim, Joon Young;Baek, Min Kyung
    • Human Ecology Research
    • /
    • v.59 no.2
    • /
    • pp.261-273
    • /
    • 2021
  • This study is not just a meaning of costume but a function of culture and includes addresses the associated emotions. As the interest of youths has increased recently, the importance of traditional costume education has been growing. Therefore, this study aims to analyze the contents of Hanbok in the 2015 revised high school technology and home textbooks using content analysis techniques and text mining techniques. As a result of the study, first, the symbolic meaning and characteristics of Hanbok and the beauty of Hanbok were practiced in daily life, and the value was found through the excellence of Hanbok and the modernization of Hanbok was dealt with Second, most of the illustrations related to traditional costumes were presented in various ways, but there were some regrets due to lack of quantity and quality. Third, the words used to explain traditional costumes were used in the form of culture, excellence, tradition, modernity, harmony, succession, etc. except for the types of clothing. Therefore, the results and discussions derived from this study are expected to help the textbooks to be efficiently selected and used in the field of the front line school along with the correct understanding of traditional culture in the process of selecting traditional culture contents and illustrations.

Analysis of the Yearbook from the Korea Meteorological Administration using a text-mining agorithm (텍스트 마이닝 알고리즘을 이용한 기상청 기상연감 자료 분석)

  • Sun, Hyunseok;Lim, Changwon;Lee, YungSeop
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.4
    • /
    • pp.603-613
    • /
    • 2017
  • Many people have recently posted about personal interests on social media. The development of the Internet and computer technology has enabled the storage of digital forms of documents that has resulted in an explosion of the amount of textual data generated; subsequently there is an increased demand for technology to create valuable information from a large number of documents. A text mining technique is often used since text-based data is mostly composed of unstructured forms that are not suitable for the application of statistical analysis or data mining techniques. This study analyzed the Meteorological Yearbook data of the Korea Meteorological Administration (KMA) with a text mining technique. First, a term dictionary was constructed through preprocessing and a term-document matrix was generated. This term dictionary was then used to calculate the annual frequency of term, and observe the change in relative frequency for frequently appearing words. We also used regression analysis to identify terms with increasing and decreasing trends. We analyzed the trends in the Meteorological Yearbook of the KMA and analyzed trends of weather related news, weather status, and status of work trends that the KMA focused on. This study is to provide useful information that can help analyze and improve the meteorological services and reflect meteorological policy.