• Title/Summary/Keyword: 뉴럴 임베딩 모델

Search Result 3, Processing Time 0.015 seconds

Automatic Classification and Vocabulary Analysis of Political Bias in News Articles by Using Subword Tokenization (부분 단어 토큰화 기법을 이용한 뉴스 기사 정치적 편향성 자동 분류 및 어휘 분석)

  • Cho, Dan Bi;Lee, Hyun Young;Jung, Won Sup;Kang, Seung Shik
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.1
    • /
    • pp.1-8
    • /
    • 2021
  • In the political field of news articles, there are polarized and biased characteristics such as conservative and liberal, which is called political bias. We constructed keyword-based dataset to classify bias of news articles. Most embedding researches represent a sentence with sequence of morphemes. In our work, we expect that the number of unknown tokens will be reduced if the sentences are constituted by subwords that are segmented by the language model. We propose a document embedding model with subword tokenization and apply this model to SVM and feedforward neural network structure to classify the political bias. As a result of comparing the performance of the document embedding model with morphological analysis, the document embedding model with subwords showed the highest accuracy at 78.22%. It was confirmed that the number of unknown tokens was reduced by subword tokenization. Using the best performance embedding model in our bias classification task, we extract the keywords based on politicians. The bias of keywords was verified by the average similarity with the vector of politicians from each political tendency.

Improved Sensor Filtering Method for Sensor Registry System (센서 레지스트리 시스템을 위한 개선된 센서 필터링 기법)

  • Chen, Haotian;Jung, Hyunjun;Lee, Sukhoon;On, Byung-Won;Jeong, Dongwon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.1
    • /
    • pp.7-14
    • /
    • 2022
  • Sensor Registry System (SRS) has been devised for maintaining semantic interoperability of data on heterogeneous sensor networks. SRS measures the connectability of the mobile device to ambient sensors based on positions and only provides metadata of sensors that may be successfully connected. The step of identifying the ambient sensors which can be successfully connected is called sensor filtering. Improving the performance of sensor filtering is one of the core issues of SRS research. In reality, GPS sometimes shows the wrong position and thus leads to failed sensor filtering. Therefore, this paper proposes a new sensor filtering strategy using geographical embedding and neural network-based path prediction. This paper also evaluates the service provision rate with the Monte Carlo approach. The empirical study shows that the proposed method can compensate for position abnormalities and is an effective model for sensor filtering in SRS.

Antibiotics-Resistant Bacteria Infection Prediction Based on Deep Learning (딥러닝 기반 항생제 내성균 감염 예측)

  • Oh, Sung-Woo;Lee, Hankil;Shin, Ji-Yeon;Lee, Jung-Hoon
    • The Journal of Society for e-Business Studies
    • /
    • v.24 no.1
    • /
    • pp.105-120
    • /
    • 2019
  • The World Health Organization (WHO) and other government agencies aroundthe world have warned against antibiotic-resistant bacteria due to abuse of antibiotics and are strengthening their care and monitoring to prevent infection. However, it is highly necessary to develop an expeditious and accurate prediction and estimating method for preemptive measures. Because it takes several days to cultivate the infecting bacteria to identify the infection, quarantine and contact are not effective to prevent spread of infection. In this study, the disease diagnosis and antibiotic prescriptions included in Electronic Health Records were embedded through neural embedding model and matrix factorization, and deep learning based classification predictive model was proposed. The f1-score of the deep learning model increased from 0.525 to 0.617when embedding information on disease and antibiotics, which are the main causes of antibiotic resistance, added to the patient's basic information and hospital use information. And deep learning model outperformed the traditional machine hospital use information. And deep learning model outperformed the traditional machine learning models.As a result of analyzing the characteristics of antibiotic resistant patients, resistant patients were more likely to use antibiotics in J01 than nonresistant patients who were diagnosed with the same diseases and were prescribed 6.3 times more than DDD.