• 제목/요약/키워드: mBERT

검색결과 64건 처리시간 0.022초

대화형 텍스트 데이터 내 개인정보 식별에 대한 연구 (A Study on Identifying Personal Information on Conversational Text Data)

  • 차도현;권보근;윤희창;이구협;주종화
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2022년도 추계학술발표대회
    • /
    • pp.11-13
    • /
    • 2022
  • 데이터 3 법을 필두로, 기업은 개인정보가 포함된 데이터를 활용하기 위해 비식별 처리가 필요하게 되었다. 기존 방식은, 비정형 텍스트 데이터에서 정규표현식을 통한 개인정보 식별은 데이터의 다양성에 의해 한계가 명확하며, 기존의 Named Entity Recognition(NER) 태스크로 해결하기에는 언어의 중의적 표현과 2 인 대화에서 나타나는 개인정보가 누구의 것인지 판단하지 못한다는 한계가 존재한다. 따라서 우리는 기존의 한계점을 극복하고 개선하기 위해 BERT 언어 모델에 화자 정보를 학습시키고, 하나의 어절에 2 개의 tag 를 labeling 하는 방법을 제안하여 정확한 개인정보 식별을 시도하였다.

Aspect-based Sentiment Analysis of Product Reviews using Multi-agent Deep Reinforcement Learning

  • M. Sivakumar;Srinivasulu Reddy Uyyala
    • Asia pacific journal of information systems
    • /
    • 제32권2호
    • /
    • pp.226-248
    • /
    • 2022
  • The existing model for sentiment analysis of product reviews learned from past data and new data was labeled based on training. But new data was never used by the existing system for making a decision. The proposed Aspect-based multi-agent Deep Reinforcement learning Sentiment Analysis (ADRSA) model learned from its very first data without the help of any training dataset and labeled a sentence with aspect category and sentiment polarity. It keeps on learning from the new data and updates its knowledge for improving its intelligence. The decision of the proposed system changed over time based on the new data. So, the accuracy of the sentiment analysis using deep reinforcement learning was improved over supervised learning and unsupervised learning methods. Hence, the sentiments of premium customers on a particular site can be explored to other customers effectively. A dynamic environment with a strong knowledge base can help the system to remember the sentences and usage State Action Reward State Action (SARSA) algorithm with Bidirectional Encoder Representations from Transformers (BERT) model improved the performance of the proposed system in terms of accuracy when compared to the state of art methods.

Structural monitoring of wind turbines using wireless sensor networks

  • Swartz, R. Andrew;Lynch, Jerome P.;Zerbst, Stephan;Sweetman, Bert;Rolfes, Raimund
    • Smart Structures and Systems
    • /
    • 제6권3호
    • /
    • pp.183-196
    • /
    • 2010
  • Monitoring and economical design of alternative energy generators such as wind turbines is becoming increasingly critical; however acquisition of the dynamic output data can be a time-consuming and costly process. In recent years, low-cost wireless sensors have emerged as an enabling technology for structural monitoring applications. In this study, wireless sensor networks are installed in three operational turbines in order to demonstrate their efficacy in this unique operational environment. The objectives of the first installation are to verify that vibrational (acceleration) data can be collected and transmitted within a turbine tower and that it is comparable to data collected using a traditional tethered system. In the second instrumentation, the wireless network includes strain gauges at the base of the structure. Also, data is collected regarding the performance of the wireless communication channels within the tower. In both turbines, collected wireless sensor data is used for off-line, output-only modal analysis of the ambiently (wind) excited turbine towers. The final installation is on a turbine with embedded braking capabilities within the nacelle to generate an "impulse-like" load at the top of the tower. This ability to apply such a load improves the modal analysis results obtained in cases where ambient excitation fails to be sufficiently broad-band or white. The improved loading allows for computation of true mode shapes, a necessary precursor to many conditional monitoring techniques.

An Application of RASA Technology to Design an AI Virtual Assistant: A Case of Learning Finance and Banking Terms in Vietnamese

  • PHAM, Thi My Ni;PHAM, Thi Ngoc Thao;NGUYEN, Ha Phuong Truc;LY, Bao Tuyen;NGUYEN, Truc Linh;LE, Hoanh Su
    • The Journal of Asian Finance, Economics and Business
    • /
    • 제9권5호
    • /
    • pp.273-283
    • /
    • 2022
  • Banking and finance is a broad term that incorporates a variety of smaller, more specialized subjects such as corporate finance, tax finance, and insurance finance. A virtual assistant that assists users in searching for information about banking and finance terms might be an extremely beneficial tool for users. In this study, we explored the process of searching for information, seeking opportunities, and developing a virtual assistant in the first stages of starting learning and understanding Vietnamese to increase effectiveness and save time, which is also an innovative business practice in Use-case Vietnam. We built the FIBA2020 dataset and proposed a pipeline that used Natural Language Processing (NLP) inclusive of Natural Language Understanding (NLU) algorithms to build chatbot applications. The open-source framework RASA is used to implement the system in our study. We aim to improve our model performance by replacing parts of RASA's default tokenizers with Vietnamese tokenizers and experimenting with various language models. The best accuracy we achieved is 86.48% and 70.04% in the ideal condition and worst condition, respectively. Finally, we put our findings into practice by creating an Android virtual assistant application using the model trained using Whitespace tokenizer and the pre-trained language m-BERT.