• Title/Summary/Keyword: 언어지능

Search Result 797, Processing Time 0.029 seconds

Analysis of the Status of Natural Language Processing Technology Based on Deep Learning (딥러닝 중심의 자연어 처리 기술 현황 분석)

  • Park, Sang-Un
    • The Journal of Bigdata
    • /
    • v.6 no.1
    • /
    • pp.63-81
    • /
    • 2021
  • The performance of natural language processing is rapidly improving due to the recent development and application of machine learning and deep learning technologies, and as a result, the field of application is expanding. In particular, as the demand for analysis on unstructured text data increases, interest in NLP(Natural Language Processing) is also increasing. However, due to the complexity and difficulty of the natural language preprocessing process and machine learning and deep learning theories, there are still high barriers to the use of natural language processing. In this paper, for an overall understanding of NLP, by examining the main fields of NLP that are currently being actively researched and the current state of major technologies centered on machine learning and deep learning, We want to provide a foundation to understand and utilize NLP more easily. Therefore, we investigated the change of NLP in AI(artificial intelligence) through the changes of the taxonomy of AI technology. The main areas of NLP which consists of language model, text classification, text generation, document summarization, question answering and machine translation were explained with state of the art deep learning models. In addition, major deep learning models utilized in NLP were explained, and data sets and evaluation measures for performance evaluation were summarized. We hope researchers who want to utilize NLP for various purposes in their field be able to understand the overall technical status and the main technologies of NLP through this paper.

Automatic Classification and Vocabulary Analysis of Political Bias in News Articles by Using Subword Tokenization (부분 단어 토큰화 기법을 이용한 뉴스 기사 정치적 편향성 자동 분류 및 어휘 분석)

  • Cho, Dan Bi;Lee, Hyun Young;Jung, Won Sup;Kang, Seung Shik
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.1
    • /
    • pp.1-8
    • /
    • 2021
  • In the political field of news articles, there are polarized and biased characteristics such as conservative and liberal, which is called political bias. We constructed keyword-based dataset to classify bias of news articles. Most embedding researches represent a sentence with sequence of morphemes. In our work, we expect that the number of unknown tokens will be reduced if the sentences are constituted by subwords that are segmented by the language model. We propose a document embedding model with subword tokenization and apply this model to SVM and feedforward neural network structure to classify the political bias. As a result of comparing the performance of the document embedding model with morphological analysis, the document embedding model with subwords showed the highest accuracy at 78.22%. It was confirmed that the number of unknown tokens was reduced by subword tokenization. Using the best performance embedding model in our bias classification task, we extract the keywords based on politicians. The bias of keywords was verified by the average similarity with the vector of politicians from each political tendency.

Multifaceted Evaluation Methodology for AI Interview Candidates - Integration of Facial Recognition, Voice Analysis, and Natural Language Processing (AI면접 대상자에 대한 다면적 평가방법론 -얼굴인식, 음성분석, 자연어처리 영역의 융합)

  • Hyunwook Ji;Sangjin Lee;Seongmin Mun;Jaeyeol Lee;Dongeun Lee;kyusang Lim
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2024.01a
    • /
    • pp.55-58
    • /
    • 2024
  • 최근 각 기업의 AI 면접시스템 도입이 증가하고 있으며, AI 면접에 대한 실효성 논란 또한 많은 상황이다. 본 논문에서는 AI 면접 과정에서 지원자를 평가하는 방식을 시각, 음성, 자연어처리 3영역에서 구현함으로써, 면접 지원자를 다방면으로 분석 방법론의 적절성에 대해 평가하고자 한다. 첫째, 시각적 측면에서, 면접 지원자의 감정을 인식하기 위해, 합성곱 신경망(CNN) 기법을 활용해, 지원자 얼굴에서 6가지 감정을 인식했으며, 지원자가 카메라를 응시하고 있는지를 시계열로 도출하였다. 이를 통해 지원자가 면접에 임하는 태도와 특히 얼굴에서 드러나는 감정을 분석하는 데 주력했다. 둘째, 시각적 효과만으로 면접자의 태도를 파악하는 데 한계가 있기 때문에, 지원자 음성을 주파수로 환산해 특성을 추출하고, Bidirectional LSTM을 활용해 훈련해 지원자 음성에 따른 6가지 감정을 추출했다. 셋째, 지원자의 발언 내용과 관련해 맥락적 의미를 파악해 지원자의 상태를 파악하기 위해, 음성을 STT(Speech-to-Text) 기법을 이용하여 텍스트로 변환하고, 사용 단어의 빈도를 분석하여 지원자의 언어 습관을 파악했다. 이와 함께, 지원자의 발언 내용에 대한 감정 분석을 위해 KoBERT 모델을 적용했으며, 지원자의 성격, 태도, 직무에 대한 이해도를 파악하기 위해 객관적인 평가지표를 제작하여 적용했다. 논문의 분석 결과 AI 면접의 다면적 평가시스템의 적절성과 관련해, 시각화 부분에서는 상당 부분 정확도가 객관적으로 입증되었다고 판단된다. 음성에서 감정분석 분야는 면접자가 제한된 시간에 모든 유형의 감정을 드러내지 않고, 또 유사한 톤의 말이 진행되다 보니 특정 감정을 나타내는 주파수가 다소 집중되는 현상이 나타났다. 마지막으로 자연어처리 영역은 면접자의 발언에서 나오는 말투, 특정 단어의 빈도수를 넘어, 전체적인 맥락과 느낌을 이해할 수 있는 자연어처리 분석모델의 필요성이 더욱 커졌음을 판단했다.

  • PDF

Analysis of Users' Sentiments and Needs for ChatGPT through Social Media on Reddit (Reddit 소셜미디어를 활용한 ChatGPT에 대한 사용자의 감정 및 요구 분석)

  • Hye-In Na;Byeong-Hee Lee
    • Journal of Internet Computing and Services
    • /
    • v.25 no.2
    • /
    • pp.79-92
    • /
    • 2024
  • ChatGPT, as a representative chatbot leveraging generative artificial intelligence technology, is used valuable not only in scientific and technological domains but also across diverse sectors such as society, economy, industry, and culture. This study conducts an explorative analysis of user sentiments and needs for ChatGPT by examining global social media discourse on Reddit. We collected 10,796 comments on Reddit from December 2022 to August 2023 and then employed keyword analysis, sentiment analysis, and need-mining-based topic modeling to derive insights. The analysis reveals several key findings. The most frequently mentioned term in ChatGPT-related comments is "time," indicative of users' emphasis on prompt responses, time efficiency, and enhanced productivity. Users express sentiments of trust and anticipation in ChatGPT, yet simultaneously articulate concerns and frustrations regarding its societal impact, including fears and anger. In addition, the topic modeling analysis identifies 14 topics, shedding light on potential user needs. Notably, users exhibit a keen interest in the educational applications of ChatGPT and its societal implications. Moreover, our investigation uncovers various user-driven topics related to ChatGPT, encompassing language models, jobs, information retrieval, healthcare applications, services, gaming, regulations, energy, and ethical concerns. In conclusion, this analysis provides insights into user perspectives, emphasizing the significance of understanding and addressing user needs. The identified application directions offer valuable guidance for enhancing existing products and services or planning the development of new service platforms.

Case Report on NTBC Treatment of Type 1 Tyrosinemia Diagnosed through Newborn Screening (신생아 선별검사를 통해 진단된 1형 타이로신혈증의 NTBC 치료 사례 보고)

  • Ji Eun Jeong;Hwa Young Kim;Jung Min Ko
    • Journal of The Korean Society of Inherited Metabolic disease
    • /
    • v.23 no.2
    • /
    • pp.39-44
    • /
    • 2023
  • Hereditary tyrosinemia type 1 (HT-1) is a metabolic disorder caused by biallelic pathogenic variants in the fumarylacetoacetate hydrolase (FAH) gene, which impairs the function of the FAH enzyme, resulting in the accumulation of tyrosine's toxic metabolites in hepatocytes and renal tubular cells. As a consequence, individuals with HT-1 exhibit symptomatic manifestations. Rapid diagnosis and treatment of HT-1 can prevent short-term death and long-term complications. A 15-day-old boy presented to the outpatient department with elevated levels of tyrosine on his newborn screening tests conducted at the age of 3 and 10 days, respectively. Further blood tests revealed increased levels of alpha-fetoprotein and amino acids including tyrosine and threonine. Urine organic acid tests indicated a significant elevation in tyrosine metabolites, as well as the presence of succinylacetone (SA), which led to the diagnosis of HT-1. Two pathogenic and likely pathogenic variants of FAH compatible with HT-1 were also detected. He began a tyrosine-restricted diet at one month old and received nitisinone (NTBC) at two months old. With continued treatment, the patient's initially elevated AFP level, detection of SA in the urine, and mild hepatomegaly showed improvement. During four years and seven months of treatment, there were no exceptional complications apart from an increase in tyrosine levels and a delay in speech. We report a case of tyrosinemia type 1 detected through newborn screening, treated with dietary restriction and NTBC, with a good prognosis.

  • PDF

Analysis of the scholastic capability of ChatGPT utilizing the Korean College Scholastic Ability Test (대학입시 수능시험을 평가 도구로 적용한 ChatGPT의 학업 능력 분석)

  • WEN HUILIN;Kim Jinhyuk;Han Kyonghee;Kim Shiho
    • Journal of Platform Technology
    • /
    • v.11 no.5
    • /
    • pp.72-83
    • /
    • 2023
  • ChatGPT, commercial launch in late 2022, has shown successful results in various professional exams, including US Bar Exam and the United States Medical Licensing Exam (USMLE), demonstrating its ability to pass qualifying exams in professional domains. However, further experimentation and analysis are required to assess ChatGPT's scholastic capability, such as logical inference and problem-solving skills. This study evaluated ChatGPT's scholastic performance utilizing the Korean College Scholastic Ability Test (KCSAT) subjects, including Korean, English, and Mathematics. The experimental results revealed that ChatGPT achieved a relatively high accuracy rate of 69% in the English exam but relatively lower rates of 34% and 19% in the Korean Language and Mathematics domains, respectively. Through analyzing the results of the Korean language exam, English exams, and TOPIK II, we evaluated ChatGPT's strengths and weaknesses in comprehension and logical inference abilities. Although ChatGPT, as a generative language model, can understand and respond to general Korean, English, and Mathematics problems, it is considered weak in tasks involving higher-level logical inference and complex mathematical problem-solving. This study might provide simple yet accurate and effective evaluation criteria for generative artificial intelligence performance assessment through the analysis of KCSAT scores.

  • PDF

Exploring the Effects of Passive Haptic Factors When Interacting with a Virtual Pet in Immersive VR Environment (몰입형 VR 환경에서 가상 반려동물과 상호작용에 관한 패시브 햅틱 요소의 영향 분석)

  • Donggeun KIM;Dongsik Jo
    • Journal of the Korea Computer Graphics Society
    • /
    • v.30 no.3
    • /
    • pp.125-132
    • /
    • 2024
  • Recently, with immersive virtual reality(IVR) technologies, various services such as education, training, entertainment, industry, healthcare and remote collaboration have been applied. In particular, researches are actively being studied to visualize and interact with virtual humans, research on virtual pets in IVR is also emerging. For interaction with the virtual pet, similar to real-world interaction scenarios, the most important thing is to provide physical contact such as haptic and non-verbal interaction(e.g., gesture). This paper investigates the effects on factors (e.g., shape and texture) of passive haptic feedbacks using mapping physical props corresponding to the virtual pet. Experimental results show significant differences in terms of immersion, co-presence, realism, and friendliness depending on the levels of texture elements when interacting with virtual pets by passive haptic feedback. Additionally, as the main findings of this study by statistical interaction between two variables, we found that there was Uncanny valley effect in terms of friendliness. With our results, we will expect to be able to provide guidelines for creating interactive contents with the virtual pet in immersive VR environments.

THE STUDY ON RELATIONSHIP BETWEEN PSYCHOPATHOLOGY AND NEUROLOGICAL FACTORS IN CHRONIC EPILEPTIC CHILDREN (경련 질환 환아의 정신병리와 신경학적 요인과의 관계에 대한 연구)

  • Kim, Bung-Nyun;Cho, Soo-Churl;Hwang, Yong-Seung
    • Journal of the Korean Academy of Child and Adolescent Psychiatry
    • /
    • v.7 no.1
    • /
    • pp.92-109
    • /
    • 1996
  • The objectives of the present study were to provide comprehensive assessment of the impact of epilepsy on the psychological well-being of children with epilepsy and to identify the neurological factors associated with the psychopathology. The participant patients were recruited from the population of children and adolescent aged 7 to 16 attending the OPD of department of pediatric neurology in Seoul National University Hospital in Korea. We exclude mental retardation, pervasive developmental disorder and brain organic pathology. As control group, formal students were chosen and their sex, age, achievement, socioeconomic status were matched to patients. The first author interviewed the children and their family members and obtained the developmental history and family information. We used the following 10 scales for assessing psychological and behavioral problems in patients and their family member. The scales were standardized and their validity and reliability were confirmed before. Parent rating scales : Yale children's inventory, Disruptive behavior disorder scale, Parent's attitude to epilepsy questionnaire, Family environment scale, Symptom check-list-90 revision, Children behavior check-list. Children's self rating scales : Children's depression inventory, Spielberger's state-trait anxiety anxiety, Piers-Harris self-concept inventory and Self-administered Dependency questionnaire for Mother. The result showed the risk factors associated depression were early onset, complex partial seizure, lateralized temporal focal abnormality on EEG, Drug polypharmacy, high seizure frequency and sick factors associated anxiety were old age of patient, lateralized temporal focal abnormality EEG, Drug polypharmacy, high seizure frequency. Also the result of this present study indicated that risk factors associated oppositional defiant disorder, conduct disorder and attention deficit hyperactivity disorder were young age, male, early onset, lateral temporal EEG abnormality and high seizure frequency. According to these results, common risk factors associated psychological and behavioral problems were lateralized EEG temporal abnormality, high seizure frequency in neurological factors.

  • PDF

Comparison of Deep Learning Frameworks: About Theano, Tensorflow, and Cognitive Toolkit (딥러닝 프레임워크의 비교: 티아노, 텐서플로, CNTK를 중심으로)

  • Chung, Yeojin;Ahn, SungMahn;Yang, Jiheon;Lee, Jaejoon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.1-17
    • /
    • 2017
  • The deep learning framework is software designed to help develop deep learning models. Some of its important functions include "automatic differentiation" and "utilization of GPU". The list of popular deep learning framework includes Caffe (BVLC) and Theano (University of Montreal). And recently, Microsoft's deep learning framework, Microsoft Cognitive Toolkit, was released as open-source license, following Google's Tensorflow a year earlier. The early deep learning frameworks have been developed mainly for research at universities. Beginning with the inception of Tensorflow, however, it seems that companies such as Microsoft and Facebook have started to join the competition of framework development. Given the trend, Google and other companies are expected to continue investing in the deep learning framework to bring forward the initiative in the artificial intelligence business. From this point of view, we think it is a good time to compare some of deep learning frameworks. So we compare three deep learning frameworks which can be used as a Python library. Those are Google's Tensorflow, Microsoft's CNTK, and Theano which is sort of a predecessor of the preceding two. The most common and important function of deep learning frameworks is the ability to perform automatic differentiation. Basically all the mathematical expressions of deep learning models can be represented as computational graphs, which consist of nodes and edges. Partial derivatives on each edge of a computational graph can then be obtained. With the partial derivatives, we can let software compute differentiation of any node with respect to any variable by utilizing chain rule of Calculus. First of all, the convenience of coding is in the order of CNTK, Tensorflow, and Theano. The criterion is simply based on the lengths of the codes and the learning curve and the ease of coding are not the main concern. According to the criteria, Theano was the most difficult to implement with, and CNTK and Tensorflow were somewhat easier. With Tensorflow, we need to define weight variables and biases explicitly. The reason that CNTK and Tensorflow are easier to implement with is that those frameworks provide us with more abstraction than Theano. We, however, need to mention that low-level coding is not always bad. It gives us flexibility of coding. With the low-level coding such as in Theano, we can implement and test any new deep learning models or any new search methods that we can think of. The assessment of the execution speed of each framework is that there is not meaningful difference. According to the experiment, execution speeds of Theano and Tensorflow are very similar, although the experiment was limited to a CNN model. In the case of CNTK, the experimental environment was not maintained as the same. The code written in CNTK has to be run in PC environment without GPU where codes execute as much as 50 times slower than with GPU. But we concluded that the difference of execution speed was within the range of variation caused by the different hardware setup. In this study, we compared three types of deep learning framework: Theano, Tensorflow, and CNTK. According to Wikipedia, there are 12 available deep learning frameworks. And 15 different attributes differentiate each framework. Some of the important attributes would include interface language (Python, C ++, Java, etc.) and the availability of libraries on various deep learning models such as CNN, RNN, DBN, and etc. And if a user implements a large scale deep learning model, it will also be important to support multiple GPU or multiple servers. Also, if you are learning the deep learning model, it would also be important if there are enough examples and references.

A COMPARATIVE STUDY ON AUDITORY ATTENTION AND PHONEME DIFFERENTIAL ABILITY AMONG CHILDREN WITH READING DISABILITY AND WITH ATTENTION DEFICIT/HYPERACTIVITY (읽기 장애와 주의력 결핍/과잉 운동 장애아동의 주의력 과제와 음소 변별 과제 수행 비교 - 청각 과제를 중심으로 -)

  • Lee, Kyung-Hee;Shin, Min-Sup;Kim, Boong-Nyun;Cho, Soo-Churl
    • Journal of the Korean Academy of Child and Adolescent Psychiatry
    • /
    • v.14 no.2
    • /
    • pp.197-208
    • /
    • 2003
  • Objective:In this study, we hypothesized that deficit in processing rapid linguistic stimuli is at the heart of Reading Disability(RD) and deficit in response inhibition is at the heart of Attention Deficit/Hyperactivity(ADHD). We conducted experiments to identify the core cognitive characteristics of children either with RD or with ADHD or with both, using attentional tasks and phoneme differential tests. Method:In the study 1, 28 children with ADHD, 16 children with RD+ADHD were individually administered visual/auditory performance tests. Then, the differences of performance on attentional tasks between two groups were compared while IQs of two groups were controlled. In the study 2, 13 children with RD+ADHD/RD, 13 children with ADHD, and 13 normal children were administered computerized phoneme differential tests. Result:Visual attentional tasks did not distinguish an ADHD group from a RD+ADHD group. With auditory attentional tasks, however, the comorbid group showed significantly more difficulties, causing a large variance in reaction time. RD, RD+ADHD, and ADHD groups showed more errors in phoneme differential tests than a normal control group, and each group showed distinctive performance patterns. Discussion:An ADHD group had difficulty in response inhibition and sustained attention, and children who also had RD along with ADHD magnified the auditory attentional difficulties. Even though children with RD had more trouble with responding correctly to target stimuli, their responses were not significantly different from those of children with ADHD.

  • PDF