• 제목/요약/키워드: TextMining

Search Result 1,563, Processing Time 0.03 seconds

A study on NLP Text Preprocessing for digital forensic investigation (디지털 포렌식 조사를 위한 NLP의 텍스트 전처리 연구)

  • Lee, Sung-won;Kim, Dohyun
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.189-191
    • /
    • 2022
  • In modern society, messenger services are necessary to communication with others, and criminals are no exception. In representative cases of Burning Sun Gate(2018) and NthRoom(2019), messenger data analysis was used as a smoking gun to solve these criminal cases. Therefore messenger text analytics is critical for the resolution of crimes in a modern environment. also, it takes a lot of time to analyze messenger data in the digital forensic investigation process, so researchers in text mining need to be more effective to respond with the current situation In this paper, we study various natural language preprocessing(NLP) methods according to the characteristics of instant messages to effectively proceed with NLP analysis on instant messengers.

  • PDF

Discovering the anti-cancer phytochemical rutin against breast cancer through the methodical platform based on traditional medicinal knowledge

  • Jungwhoi Lee;Jungsul Lee;WooGwang Sim;Jae-Hoon Kim;Chulhee Choi;Jongwook Jeon
    • BMB Reports
    • /
    • v.56 no.11
    • /
    • pp.594-599
    • /
    • 2023
  • A number of therapeutic drugs have been developed from functional chemicals found in plants. Knowledge of plants used for medicinal purposes has historically been transmitted by word of mouth or through literature. The aim of the present study is to provide a systemic platform for the development of lead compounds against breast cancer based on a traditional medical text. To verify our systematic approach, integrating processes consisted of text mining of traditional medical texts, 3-D virtual docking screening, and in vitro and in vivo experimental validations were demonstrated. Our text analysis system identified rutin as a specific phytochemical traditionally used for cancer treatment. 3-D virtual screening predicted that rutin could block EGFR signaling. Thus, we validated significant anti-cancer effects of rutin against breast cancer cells through blockade of EGFR signaling pathway in vitro. We also demonstrated in vivo anti-cancer effects of rutin using the breast cancer recurrence in vivo models. In summary, our innovative approach might be proper for discovering new phytochemical lead compounds designing for blockade of malignant neoplasm including breast cancer.

  • PDF

Webdrama Analysis and Recommendation using Text Mining and Opinion Mining Technique of Social Media (소셜미디어 빅데이터의 텍스트 마이닝과 오피니언 마이닝 기법을 활용한 웹드라마 분석과 제안)

  • Oh, Se-Jong;Kim, Kenneth Chi Ho
    • Cartoon and Animation Studies
    • /
    • s.44
    • /
    • pp.285-306
    • /
    • 2016
  • With the increase use of smartphones, users can consume contents such as webtoon, webnovel and TV drama directly provided by the producers. In this Direct-to-Consumer era, webdrama services from the portal websites are increasing rapidly. Webdramas such as , , and can be analyzed in real time using responses such as unique users, likes, and comments. The analyses used in this research were Social Media Big Data Mining Method and Opinion Mining Method. Specific key words from webdrama can be extracted and viewers positive, neutral or negative emotion can be predicted from the words. The analyses of popular webdramas showed that the established K-Pop Idol member appearance and servicing portal site greatly influence the views, traffics, comments, and likes. Also, 'Mobile TV' proved the effectiveness as another platform other than television. Mobile targeted contents and robust business models still to be developed and identified. Overcoming these few tasks, Korea will be proven to be a webdrama content powerhouse.

Design And Implementation of a Speech Recognition Interview Model based-on Opinion Mining Algorithm (오피니언 마이닝 알고리즘 기반 음성인식 인터뷰 모델의 설계 및 구현)

  • Kim, Kyu-Ho;Kim, Hee-Min;Lee, Ki-Young;Lim, Myung-Jae;Kim, Jeong-Lae
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.12 no.1
    • /
    • pp.225-230
    • /
    • 2012
  • The opinion mining is that to use the existing data mining technology also uploaded blog to web, to use product comment, the opinion mining can extract the author's opinion therefore it not judge text's subject, only judge subject's emotion. In this paper, published opinion mining algorithms and the text using speech recognition API for non-voice data to judge the emotions suggested. The system is open and the Subject associated with Google Voice Recognition API sunwihwa algorithm, the algorithm determines the polarity through improved design, based on this interview, speech recognition, which implements the model.

Safety Culture: A Retrospective Analysis of Occupational Health and Safety Mining Reports

  • Tetzlaff, Emily J.;Goggins, Katie A.;Pegoraro, Ann L.;Dorman, Sandra C.;Pakalnis, Vic;Eger, Tammy R.
    • Safety and Health at Work
    • /
    • v.12 no.2
    • /
    • pp.201-208
    • /
    • 2021
  • Background: In the mining industry, various methods of accident analysis have utilized official accident investigations to try and establish broader causation mechanisms. An emerging area of interest is identifying the extent to which cultural influences, such as safety culture, are acting as drivers in the reoccurrence of accidents. Thus, the overall objective of this study was to analyze occupational health and safety (OHS) reports in mining to investigate if/how safety culture has historically been framed in the mining industry, as it relates to accident causation. Methods: Using a computer-assisted qualitative data analysis software, 34 definitions of safety culture were analyzed to highlight key terms. Based on word count and contextual relevance, 26 key terms were captured. Ten OHS reports were then analyzed via an inductive thematic analysis, using the key terms. This analysis provided a concept map representing the 50-year data set and facilitated the use of text framing to highlight safety culture in the selected OHS mining reports. Results: Overall, 954 references and six themes, safety culture, attitude, competence, belief, patterns, and norms, were identified in the data set. Of the 26 key terms originally identified, 24 of them were captured within the text. The results made evident two distinct frames in which to interpret the data: the role of the individual and the role of the organization, in safety culture. Conclusion: Unless efforts are made to understand and alter cultural drivers and share these findings within and across industries, the same accidents are likely to continue to occur.

Analysis on the Trend of The Journal of Information Systems Using TLS Mining (TLS 마이닝을 이용한 '정보시스템연구' 동향 분석)

  • Yun, Ji Hye;Oh, Chang Gyu;Lee, Jong Hwa
    • The Journal of Information Systems
    • /
    • v.31 no.1
    • /
    • pp.289-304
    • /
    • 2022
  • Purpose The development of the network and mobile industries has induced companies to invest in information systems, leading a new industrial revolution. The Journal of Information Systems, which developed the information system field into a theoretical and practical study in the 1990s, retains a 30-year history of information systems. This study aims to identify academic values and research trends of JIS by analyzing the trends. Design/methodology/approach This study aims to analyze the trend of JIS by compounding various methods, named as TLS mining analysis. TLS mining analysis consists of a series of analysis including Term Frequency-Inverse Document Frequency (TF-IDF) weight model, Latent Dirichlet Allocation (LDA) topic modeling, and a text mining with Semantic Network Analysis. Firstly, keywords are extracted from the research data using the TF-IDF weight model, and after that, topic modeling is performed using the Latent Dirichlet Allocation (LDA) algorithm to identify issue keywords. Findings The current study used the summery service of the published research paper provided by Korea Citation Index to analyze JIS. 714 papers that were published from 2002 to 2012 were divided into two periods: 2002-2011 and 2012-2021. In the first period (2002-2011), the research trend in the information system field had focused on E-business strategies as most of the companies adopted online business models. In the second period (2012-2021), data-based information technology and new industrial revolution technologies such as artificial intelligence, SNS, and mobile had been the main research issues in the information system field. In addition, keywords for improving the JIS citation index were presented.