• Title/Summary/Keyword: Sentence Analysis

Search Result 497, Processing Time 0.03 seconds

Fake News Detection Using CNN-based Sentiment Change Patterns (CNN 기반 감성 변화 패턴을 이용한 가짜뉴스 탐지)

  • Tae Won Lee;Ji Su Park;Jin Gon Shon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.4
    • /
    • pp.179-188
    • /
    • 2023
  • Recently, fake news disguises the form of news content and appears whenever important events occur, causing social confusion. Accordingly, artificial intelligence technology is used as a research to detect fake news. Fake news detection approaches such as automatically recognizing and blocking fake news through natural language processing or detecting social media influencer accounts that spread false information by combining with network causal inference could be implemented through deep learning. However, fake news detection is classified as a difficult problem to solve among many natural language processing fields. Due to the variety of forms and expressions of fake news, the difficulty of feature extraction is high, and there are various limitations, such as that one feature may have different meanings depending on the category to which the news belongs. In this paper, emotional change patterns are presented as an additional identification criterion for detecting fake news. We propose a model with improved performance by applying a convolutional neural network to a fake news data set to perform analysis based on content characteristics and additionally analyze emotional change patterns. Sentimental polarity is calculated for the sentences constituting the news and the result value dependent on the sentence order can be obtained by applying long-term and short-term memory. This is defined as a pattern of emotional change and combined with the content characteristics of news to be used as an independent variable in the proposed model for fake news detection. We train the proposed model and comparison model by deep learning and conduct an experiment using a fake news data set to confirm that emotion change patterns can improve fake news detection performance.

Aspects of Korean rhythm realization by second language learners: Focusing on Chinese learners of Korean (제 2언어 학습자의 한국어 리듬 실현양상 -중국인 한국어 학습자를 중심으로-)

  • Youngsook Yune
    • Phonetics and Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.27-35
    • /
    • 2023
  • This study aimed to investigate the effect of Chinese on the production of Korean rhythm. Korean and Chinese are typologically classified into different rhythmic categories; because of this, the phonological properties of Korean and Chinese are similar and different at the same time. As a result, Chinese can exert both positive and negative influences on the realization of Korean rhythm. To investigate the influence of the rhythm of the native language of L2 learners on their target language, we conducted an acoustic analysis using acoustic metrics like of the speech of 5 Korean native speakers and 10 advanced Chinese Korean learners. The analyzed material is a short paragraph of five sentences containing a variety of syllable structures. The results showed that KS and CS rhythms are similar in %V, VarcoV, and nPVI_S. However, CS, unlike KS, showed characteristics closer to those of a stress-timed language in the values of %V and VarcoV. There was also a significant difference in nPVI_V values. These results demonstrate a negative influence of the native language in the realization of Korean rhythm. This can be attributed to the fact that all vowels in Chinese sentence are not pronounced with the same emphasis due to neutral tone. In this sense, this study allowed us to observe influences of L1 on L2 production of rhythm.

A Foundational Study on Developing a Structural Model for AI-based Sentencing Prediciton Based on Violent Crime Judgment (인공지능기술 적용을 위한 강력범죄 판결문 기반 양형 예측 구조모델 개발 기초 연구)

  • Woongil Park;Eunbi Cho;Jeong-Hyeon Chang;Joo-chang Kim
    • Journal of Internet Computing and Services
    • /
    • v.25 no.1
    • /
    • pp.91-98
    • /
    • 2024
  • With the advancement of ICT (Information and Communication Technology), searching for judgments through the internet has become increasingly convenient. However, predicting sentencing based on judgments remains a challenging task for individuals. This is because sentencing involves a complex process of applying aggravating and mitigating factors within the framework of legal provisions, and it often depends on the subjective judgment of the judge. Therefore, this research aimed to develop a model for predicting sentencing using artificial intelligence by focusing on structuring the data from judgments, making it suitable for AI applications. Through theoretical and statistical analysis of previous studies, we identified variables with high explanatory power for predicting sentencing. Additionally, by analyzing 50 legal judgments related to serious crimes that are publicly available, we presented a framework for extracting essential information from judgments. This framework encompasses basic case information, sentencing details, reasons for sentencing, the reasons for the determination of the sentence, as well as information about offenders, victims, and accomplices evident within the specific content of the judgments. This research is expected to contribute to the development of artificial intelligence technologies in the field of law in the future.

Characteristics of scenario text reading fluency in middle school students with poor reading skills (중학교 읽기부진 학생의 시나리오 글 읽기 유창성 특성)

  • Jihye Park;Cheoljae Seong
    • Phonetics and Speech Sciences
    • /
    • v.16 no.3
    • /
    • pp.39-48
    • /
    • 2024
  • Reading fluency refers to the ability to read sentences or paragraphs accurately, quickly, and with appropriate prosodic expression. Most reading fluency assessments exclude expressive ability because it is difficult to objectively measure. Therefore, in this study, we examined all elements of reading fluency by analyzing prosodic characteristics of reading scenario texts to maximize expressive reading. The subjects were 30 male students in the first and second grades of middle school (15 normal and 15 poor readers). To analyze the accuracy aspect, error types at the syllable level were analyzed for each group, and related acoustic variables were measured and examined in terms of prosodic aspects. The reading accuracy analysis showed that the poor reading group had a higher error rate than the normal. In terms of error types, the normal group showed the order of 'substitution>omission>correction>insertion>repetition', whereas the poor reading group was in the order of 'correction>substitution>repetition/insertion>omission'. For the speech tempo, the dyslexic students were slower than the typical students for all sentence types. The prosodic variables also showed a high frequency of accentual phrases (AP) and intonation phrases (IP) in sentences along with a wide intensity range.

An Analysis of the High School 'Common Science' Contents and Textbooks (고등학교 ‘공통과학’의 교과내용 및 교과서 분석)

  • Lee, Gwang-Ho;Choi, Jong-Bum;Park, Moon-Kook;Cho, Kyu-Seong
    • Journal of the Korean earth science society
    • /
    • v.18 no.6
    • /
    • pp.453-463
    • /
    • 1997
  • The contents of high school 'Common science' textbooks was analyzed qualitatively and quantitatively. Seven common science textbooks were selected and its contents, structure, inquiry, activities, appendix and its characteristics were investigated, and analyzed using the Goal Clusters of Project Synthesis and Romey's indices of text evaluation were calculated. The contents of each unit are not much different among textbooks because they are written according to the curriculum ordinance and textbook guidelines of the Ministry of Education. The textbooks was consist of $471{\sim}519$ pages. It was distribute similarly among the chapter of 'materials', 'forces', lives' and 'earth'. The chapter of 'energy' and 'environment' was treat significantly. The contents and structure of common science is a mere physical consolidation. I make an alternative plan that a topic form. Inquiry activities used in the textbooks are 11 type, however most of that is interpretation of data, experiment, survey and discussion. Ninety six percents of the experiment, belong to the 1st level, four percents of that belong to the 2nd level of the Schwab's inquiry level and there are no activities of the 3rd level. Little attention is given to Goal Cluster I, II, IV in the common science textbooks currently employed. Its content should be broadened to include all Goal Clusters of Project Synthesis. Homey's indices representing the degrees of student involvement. are $0.57{\sim}1.14$ for sentence analysis, $0.60{\sim}1.67$ for figure and diagram analysis, $0.67{\sim}1.50$ for analysis of questions at chapter ends, respectively, student activity per page investigated being $0.6{\sim}0.9$. But chapter summaries cease to repeats the conclusions of the chapter also it be rather formally and inattentively written.

  • PDF

The characteristics of sentence reading intonations in North Korean defectors based on pitch range and an auditory-perceptual rating scale (북한이탈주민의 문장 읽기 억양 특성-음도범위와 청지각적 평가를 중심으로)

  • Kim, Damee;Kim, Shinhee;Kim, Jiseong;An, Eunsol;Cho, Yongyun;Yang, Yoonhee;Yim, Dongsun
    • Phonetics and Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.9-21
    • /
    • 2019
  • This study aimed to compare the prosodic characteristics of North Korean defectors and South Koreans in three types of sentences (declarative, interrogative, and negative) in two reading tasks (short and dialogue) through acoustic analysis and auditory-perceptual evaluation. In addition, this study examined the relationship between the auditory-perceptual evaluation scores and self-assessment questionnaires on intonation for North Korean defectors. The participants were 15 North Korean defectors and 15 Korean speakers with standard Seoul accents. For statistical analysis, three-way mixed ANOVA and multivariate analysis were performed within the three types of sentences in the reading tasks through acoustic analysis and the Mann-Whitney U Test for auditory-perceptual evaluation. Pearson's product-moment correlation coefficients were also used to identify the correlations between the results of the self-assessment questionnaire on intonation and the auditory-perceptual evaluation. The North Korean defectors were found to have a significantly lower pitch range and auditory-perceptual evaluation score than South Koreans in reading tasks. Moreover, there was a significant correlation between their auditory-perceptual evaluations and self-assessment questionnaires on intonation. The study findings suggest that North Korean defectors, who face many challenges with intonation, showed a tendency to think that their intonation differed from the standard Korean intonation and showed better auditory evaluation results for interrogative sentences.

Multi-Dimensional Analysis Method of Product Reviews for Market Insight (마켓 인사이트를 위한 상품 리뷰의 다차원 분석 방안)

  • Park, Jeong Hyun;Lee, Seo Ho;Lim, Gyu Jin;Yeo, Un Yeong;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.2
    • /
    • pp.57-78
    • /
    • 2020
  • With the development of the Internet, consumers have had an opportunity to check product information easily through E-Commerce. Product reviews used in the process of purchasing goods are based on user experience, allowing consumers to engage as producers of information as well as refer to information. This can be a way to increase the efficiency of purchasing decisions from the perspective of consumers, and from the seller's point of view, it can help develop products and strengthen their competitiveness. However, it takes a lot of time and effort to understand the overall assessment and assessment dimensions of the products that I think are important in reading the vast amount of product reviews offered by E-Commerce for the products consumers want to compare. This is because product reviews are unstructured information and it is difficult to read sentiment of reviews and assessment dimension immediately. For example, consumers who want to purchase a laptop would like to check the assessment of comparative products at each dimension, such as performance, weight, delivery, speed, and design. Therefore, in this paper, we would like to propose a method to automatically generate multi-dimensional product assessment scores in product reviews that we would like to compare. The methods presented in this study consist largely of two phases. One is the pre-preparation phase and the second is the individual product scoring phase. In the pre-preparation phase, a dimensioned classification model and a sentiment analysis model are created based on a review of the large category product group review. By combining word embedding and association analysis, the dimensioned classification model complements the limitation that word embedding methods for finding relevance between dimensions and words in existing studies see only the distance of words in sentences. Sentiment analysis models generate CNN models by organizing learning data tagged with positives and negatives on a phrase unit for accurate polarity detection. Through this, the individual product scoring phase applies the models pre-prepared for the phrase unit review. Multi-dimensional assessment scores can be obtained by aggregating them by assessment dimension according to the proportion of reviews organized like this, which are grouped among those that are judged to describe a specific dimension for each phrase. In the experiment of this paper, approximately 260,000 reviews of the large category product group are collected to form a dimensioned classification model and a sentiment analysis model. In addition, reviews of the laptops of S and L companies selling at E-Commerce are collected and used as experimental data, respectively. The dimensioned classification model classified individual product reviews broken down into phrases into six assessment dimensions and combined the existing word embedding method with an association analysis indicating frequency between words and dimensions. As a result of combining word embedding and association analysis, the accuracy of the model increased by 13.7%. The sentiment analysis models could be seen to closely analyze the assessment when they were taught in a phrase unit rather than in sentences. As a result, it was confirmed that the accuracy was 29.4% higher than the sentence-based model. Through this study, both sellers and consumers can expect efficient decision making in purchasing and product development, given that they can make multi-dimensional comparisons of products. In addition, text reviews, which are unstructured data, were transformed into objective values such as frequency and morpheme, and they were analysed together using word embedding and association analysis to improve the objectivity aspects of more precise multi-dimensional analysis and research. This will be an attractive analysis model in terms of not only enabling more effective service deployment during the evolving E-Commerce market and fierce competition, but also satisfying both customers.

An Analysis and Evaluation of Cyber Home Study Contents for Self-directed Learning - Focused on the Earth Science Content of the Science Basic Course for the 7th grade - (사이버가정학습의 자율학습용 콘텐츠 분석 및 평가 - 중학교 1학년 과학 기본과정 지구과학영역을 중심으로 -)

  • Na, Jae-Joon;Son, Cheon-Jae;Kook, Dong-Sik
    • Journal of the Korean earth science society
    • /
    • v.31 no.4
    • /
    • pp.392-402
    • /
    • 2010
  • The purpose of this study is to analyze and evaluate the self-directed learning contents of Earth science area in the basic course of the 7th grade. For this purpose, we applied the 'Cyber Home Study Content Quality Control Tool' presented in 'Elementary Secondary Education e-Learning Quality Management Guidelines (Ver.2.0)' of Korea Education & Research Information Service (2008). The results of contents analysis are as follow: First, it was presented that the study guide introduced the contents which should be studied for one class, properly. And it was not analyzed that the diagnosis assesment was not completed in the initiative study; Second, it was possible to study choosing the contents fitting the learner's level of learning in the main study, it was comprised of about 15 minutes. Third, it was performed without feedback for incorrect answers in the learning assessment, just the number of wrong questions. And the learning arrangement present the important contents learned in that class, summarizing and arranging again. The results of content evaluation are as follows: First, a big difference was not showed against the needs analysis, instructional design, interaction in each class. And the evaluation of the ethics was not included a word or sentence not suitable. The evaluation of copyright, it was analyzed that Work within the content display in compliance with international copyright Second, the evaluation of instructional design presented mainly the description of a simple picture based, the visible resources like flash card were poor. And in the evaluation of Supporting System, it was presented that the contents were installed so that it was freely available for learners. But it was analyzed that there was no memo-function learners were able to jot down something during the studying contents. And in the evaluation for evaluation, the clear valuation basis about the described content was not presented. So there were slightly differences for each class. Third, in the evaluation and analysis for learning content, it was presented that there were some big differences for each class because it was not composed of the latest information, not corrected and complementary.

KNU Korean Sentiment Lexicon: Bi-LSTM-based Method for Building a Korean Sentiment Lexicon (Bi-LSTM 기반의 한국어 감성사전 구축 방안)

  • Park, Sang-Min;Na, Chul-Won;Choi, Min-Seong;Lee, Da-Hee;On, Byung-Won
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.219-240
    • /
    • 2018
  • Sentiment analysis, which is one of the text mining techniques, is a method for extracting subjective content embedded in text documents. Recently, the sentiment analysis methods have been widely used in many fields. As good examples, data-driven surveys are based on analyzing the subjectivity of text data posted by users and market researches are conducted by analyzing users' review posts to quantify users' reputation on a target product. The basic method of sentiment analysis is to use sentiment dictionary (or lexicon), a list of sentiment vocabularies with positive, neutral, or negative semantics. In general, the meaning of many sentiment words is likely to be different across domains. For example, a sentiment word, 'sad' indicates negative meaning in many fields but a movie. In order to perform accurate sentiment analysis, we need to build the sentiment dictionary for a given domain. However, such a method of building the sentiment lexicon is time-consuming and various sentiment vocabularies are not included without the use of general-purpose sentiment lexicon. In order to address this problem, several studies have been carried out to construct the sentiment lexicon suitable for a specific domain based on 'OPEN HANGUL' and 'SentiWordNet', which are general-purpose sentiment lexicons. However, OPEN HANGUL is no longer being serviced and SentiWordNet does not work well because of language difference in the process of converting Korean word into English word. There are restrictions on the use of such general-purpose sentiment lexicons as seed data for building the sentiment lexicon for a specific domain. In this article, we construct 'KNU Korean Sentiment Lexicon (KNU-KSL)', a new general-purpose Korean sentiment dictionary that is more advanced than existing general-purpose lexicons. The proposed dictionary, which is a list of domain-independent sentiment words such as 'thank you', 'worthy', and 'impressed', is built to quickly construct the sentiment dictionary for a target domain. Especially, it constructs sentiment vocabularies by analyzing the glosses contained in Standard Korean Language Dictionary (SKLD) by the following procedures: First, we propose a sentiment classification model based on Bidirectional Long Short-Term Memory (Bi-LSTM). Second, the proposed deep learning model automatically classifies each of glosses to either positive or negative meaning. Third, positive words and phrases are extracted from the glosses classified as positive meaning, while negative words and phrases are extracted from the glosses classified as negative meaning. Our experimental results show that the average accuracy of the proposed sentiment classification model is up to 89.45%. In addition, the sentiment dictionary is more extended using various external sources including SentiWordNet, SenticNet, Emotional Verbs, and Sentiment Lexicon 0603. Furthermore, we add sentiment information about frequently used coined words and emoticons that are used mainly on the Web. The KNU-KSL contains a total of 14,843 sentiment vocabularies, each of which is one of 1-grams, 2-grams, phrases, and sentence patterns. Unlike existing sentiment dictionaries, it is composed of words that are not affected by particular domains. The recent trend on sentiment analysis is to use deep learning technique without sentiment dictionaries. The importance of developing sentiment dictionaries is declined gradually. However, one of recent studies shows that the words in the sentiment dictionary can be used as features of deep learning models, resulting in the sentiment analysis performed with higher accuracy (Teng, Z., 2016). This result indicates that the sentiment dictionary is used not only for sentiment analysis but also as features of deep learning models for improving accuracy. The proposed dictionary can be used as a basic data for constructing the sentiment lexicon of a particular domain and as features of deep learning models. It is also useful to automatically and quickly build large training sets for deep learning models.

Analysis of User Behavior for the Revitalization of Small Parks near Stations by the Location Types in Influential Subway Area (역세권내 역 인접 소공원의 유형별 이용행태분석을 통한 활성화 방안 연구)

  • Lee, Joo-Hee;Park, Jin-A.
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.36 no.3
    • /
    • pp.9-20
    • /
    • 2008
  • The government is planning to link a small park with the soon to be ready subway line 9 as a part of Seoul's policy, "The standard or plan for making a water-friendly space by water use" (2007). However, the specified concepts and instructions of the small parks need further work. Therefore, the policy is expected to expand to neighboring small parks near the subway station, but these are not supported by research or data which suggests the needs or actual user behavior and utilization of small parks. our country added the specified concept of small parks and theme parks to the classification of urban parks in the Urban Park Act Revision (2005.3.31), but the concept of small parks is not clearly settled in the law in the scopes of its function, scale, promotion nor particularly defined plans for small park projects. This study examines as small park near a subway station. The characteristics of there region and users vary from the characteristics of the station and region. In the "directions for concrete standards under the types of urban parks and green zones" (2007.2) the types of small parks are classified by "regional characteristics" and "user characteristics". Therefore, this study classifies the subject of neighboring small parks near subway stations as the neighborhood and small urban parks according to the Urban Park Act. The study was paralleled with observation and questionnaires on the analysis of the neighborhood and small urban parks. The actual conditions of park utilization and user behavioral characteristics were deducted by observation, while the questionnaire determined actual user utilization, importance and satisfaction level as well as the small park environment. This study largely focused on three aspects: park facility, design of this sentence isn't even complete. The second aspect isn't finished and the third isn't here.