• Title/Summary/Keyword: Sentence-level

Search Result 204, Processing Time 0.031 seconds

Discourse Characteristics in Healthy Elderly: Effects of Aging, Gender and Educational Level (노년층의 담화 산출 특성: 노화, 성별, 교육정도에 따른 차이)

  • Choi, Hyun-Joo
    • Phonetics and Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.135-143
    • /
    • 2012
  • Discourse is regarded as an important component of communication assessment, but studies about the discourse characteristics of the elderly are scant. The purpose of this study was to confirm the effects of aging, gender, and educational level on discourse in elderly people with normal cognitive function. Forty normal elderly and forty young people participated in this study. A picture description task (Boston Cookie-Theft picture) was used to examine discourse function. The description task was analyzed for both productivity (total number of sentences, total number of syllables, and syllables per sentence) and semantics (CIU ratio). The results were as follows: 1) Only CIU ratio differed significantly according to age. 2) In the total number of syllables and syllables per sentence, females demonstrate a higher number than males. 3) The CIU ratio differed significantly according to educational level. These results suggest that impairment of communicative function is an aspect of cognitive impairment that can be related to aging. Also, discourse performance in the elderly is associated with their gender and educational level.

A Study on the Establishment of Comparison System between the Statement of Military Reports and Related Laws (군(軍) 보고서 등장 문장과 관련 법령 간 비교 시스템 구축 방안 연구)

  • Jung, Jiin;Kim, Mintae;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.3
    • /
    • pp.109-125
    • /
    • 2020
  • The Ministry of National Defense is pushing for the Defense Acquisition Program to build strong defense capabilities, and it spends more than 10 trillion won annually on defense improvement. As the Defense Acquisition Program is directly related to the security of the nation as well as the lives and property of the people, it must be carried out very transparently and efficiently by experts. However, the excessive diversification of laws and regulations related to the Defense Acquisition Program has made it challenging for many working-level officials to carry out the Defense Acquisition Program smoothly. It is even known that many people realize that there are related regulations that they were unaware of until they push ahead with their work. In addition, the statutory statements related to the Defense Acquisition Program have the tendency to cause serious issues even if only a single expression is wrong within the sentence. Despite this, efforts to establish a sentence comparison system to correct this issue in real time have been minimal. Therefore, this paper tries to propose a "Comparison System between the Statement of Military Reports and Related Laws" implementation plan that uses the Siamese Network-based artificial neural network, a model in the field of natural language processing (NLP), to observe the similarity between sentences that are likely to appear in the Defense Acquisition Program related documents and those from related statutory provisions to determine and classify the risk of illegality and to make users aware of the consequences. Various artificial neural network models (Bi-LSTM, Self-Attention, D_Bi-LSTM) were studied using 3,442 pairs of "Original Sentence"(described in actual statutes) and "Edited Sentence"(edited sentences derived from "Original Sentence"). Among many Defense Acquisition Program related statutes, DEFENSE ACQUISITION PROGRAM ACT, ENFORCEMENT RULE OF THE DEFENSE ACQUISITION PROGRAM ACT, and ENFORCEMENT DECREE OF THE DEFENSE ACQUISITION PROGRAM ACT were selected. Furthermore, "Original Sentence" has the 83 provisions that actually appear in the Act. "Original Sentence" has the main 83 clauses most accessible to working-level officials in their work. "Edited Sentence" is comprised of 30 to 50 similar sentences that are likely to appear modified in the county report for each clause("Original Sentence"). During the creation of the edited sentences, the original sentences were modified using 12 certain rules, and these sentences were produced in proportion to the number of such rules, as it was the case for the original sentences. After conducting 1 : 1 sentence similarity performance evaluation experiments, it was possible to classify each "Edited Sentence" as legal or illegal with considerable accuracy. In addition, the "Edited Sentence" dataset used to train the neural network models contains a variety of actual statutory statements("Original Sentence"), which are characterized by the 12 rules. On the other hand, the models are not able to effectively classify other sentences, which appear in actual military reports, when only the "Original Sentence" and "Edited Sentence" dataset have been fed to them. The dataset is not ample enough for the model to recognize other incoming new sentences. Hence, the performance of the model was reassessed by writing an additional 120 new sentences that have better resemblance to those in the actual military report and still have association with the original sentences. Thereafter, we were able to check that the models' performances surpassed a certain level even when they were trained merely with "Original Sentence" and "Edited Sentence" data. If sufficient model learning is achieved through the improvement and expansion of the full set of learning data with the addition of the actual report appearance sentences, the models will be able to better classify other sentences coming from military reports as legal or illegal. Based on the experimental results, this study confirms the possibility and value of building "Real-Time Automated Comparison System Between Military Documents and Related Laws". The research conducted in this experiment can verify which specific clause, of several that appear in related law clause is most similar to the sentence that appears in the Defense Acquisition Program-related military reports. This helps determine whether the contents in the military report sentences are at the risk of illegality when they are compared with those in the law clauses.

Development of automated scoring system for English writing (영작문 자동 채점 시스템 개발 연구)

  • Jin, Kyung-Ae
    • English Language & Literature Teaching
    • /
    • v.13 no.1
    • /
    • pp.235-259
    • /
    • 2007
  • The purpose of the present study is to develop a prototype automated scoring system for English writing. The system was developed for scoring writings of Korean middle school students. In order to develop the automated scoring system, following procedures have been applied. First, review and analysis of established automated essay scoring systems in other countries have been accomplished. By doing so, we could get the guidance for development of a new sentence-level automated scoring system for Korean EFL students. Second, knowledge base such as lexicon, grammar and WordNet for natural language processing and error corpus of English writing of Korean middle school students were established. Error corpus was established through the paper and pencil test with 589 third year middle school students. This study provided suggestions for the successful introduction of an automated scoring system in Korea. The automated scoring system developed in this study should be continuously upgraded to improve the accuracy of the scoring system. Also, it is suggested to develop an automated scoring system being able to carry out evaluation of English essay, not only sentence-level evaluation. The system needs to be upgraded for the improved precision, but, it was a successful introduction of an sentence-level automated scoring system for English writing in Korea.

  • PDF

Korean Lip-Reading: Data Construction and Sentence-Level Lip-Reading (한국어 립리딩: 데이터 구축 및 문장수준 립리딩)

  • Sunyoung Cho;Soosung Yoon
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.27 no.2
    • /
    • pp.167-176
    • /
    • 2024
  • Lip-reading is the task of inferring the speaker's utterance from silent video based on learning of lip movements. It is very challenging due to the inherent ambiguities present in the lip movement such as different characters that produce the same lip appearances. Recent advances in deep learning models such as Transformer and Temporal Convolutional Network have led to improve the performance of lip-reading. However, most previous works deal with English lip-reading which has limitations in directly applying to Korean lip-reading, and moreover, there is no a large scale Korean lip-reading dataset. In this paper, we introduce the first large-scale Korean lip-reading dataset with more than 120 k utterances collected from TV broadcasts containing news, documentary and drama. We also present a preprocessing method which uniformly extracts a facial region of interest and propose a transformer-based model based on grapheme unit for sentence-level Korean lip-reading. We demonstrate that our dataset and model are appropriate for Korean lip-reading through statistics of the dataset and experimental results.

The Study on the Influence that the Understanding Degree about the Sentence Stated Math. Problems Reach the Extension of the Problem Solving Capacity. - Focusing on the Unit of Equation and Inequality in Middle School - (문장제에 대한 이해정도가 문제해결력 신장에 미치는 영향에 대한 연구 -중학교 방정식과 부등식 단원을 중심으로-)

  • 지재근;오세열
    • Journal of the Korean School Mathematics Society
    • /
    • v.3 no.1
    • /
    • pp.189-200
    • /
    • 2000
  • The purpose of this thesis is that the students understand the sentence stated math problems closely related to the real life and adapted the right solving strategies try to find the solution to a problem. The following research problem were proposed. 1. How repeated thinking lessons develop the understanding of problems and influence the usage of correct problem solving strategies and extensions of problem solving. 2. There are how much differences of achievement for each type of sentence stated problems by using comparative analysis of upper class, intermediate class, and lower class for each level between the experimental and comparative classes. In order to conduct this research the classes were divided into three different level - upper class, intermediate class and lower class. Each level include an experimental class and a comparative class. The two classes (experimental class and comparative class) of the same level were tested on the basis of class division record with the experimental class repeated learning papers for two weeks were used to guide the fixed thinking algorism for each sentence stated math problems. Eight common problems were chosen from a variety of textbooks : number calculation problems, velocity-distance-time problems, the density of a mixture, benefit problems, distribution problems, problems about working, ratio problems, the length of a figure problems. After conducting this research experiment The differences in achievement level between the experimental class and comparative class, were compared and analyzed through achievement tests made from the achievement test papers with seven problems, which were worth seventy points (total score). The conclusions of this thesis are as follows: Firstly, leaning activities through the usage of repeated learning papers for each level class produce an even development of achievement level especially in the case of the upper class learners, they have particular differences (between experimental class and comparative class) compared to the intermediate level and lower classes. Secondly, according to the analysis about achievement development each problems, learners easily accept the strategies of solution through the formula setting up to the problem of velocity -distance-time, and to the density of the mixture they adapted the picture drawing strategies interestingly, However each situation requires a variety of appropriate solution strategies. Teachers will have to employ other interesting solution strategies which relate to real life.

  • PDF

The Role of Pitch Range Reset in Korean Sentence Processing

  • Kong, Eun-Jong
    • Phonetics and Speech Sciences
    • /
    • v.2 no.1
    • /
    • pp.33-39
    • /
    • 2010
  • This study investigates the effect of pitch range reset in Korean listeners' processing of syntactically ambiguous participle structures. Unlike Japanese and English,in Korean, the downtrend or the reset of pitch range does not consistently differentiate Accentual Phrases (AP), a lower level of phrasing, from Intonational Phrases (IP), a higher level of phrasing. Therefore, we explore Korean listeners' comprehension patterns for syntactically ambiguous speech strings varying in 1) the relative height of F0 peaks across prosodic units, and 2) the types of prosodic phrasing, to see whether pitch range reset informs the recovery of syntactic structure even though it is not reflected in the intonational hierarchy in Korean. The results show that the hierarchical level of prosodic phrasing affects the parsing pattern of syntactic ambiguity. The pitch range reset also cued the location of syntactic boundaries, but this effect was confined to phrases across AP.

  • PDF

A Design for Korean Phrase Structure Grammar(KPSG) in ALE (ALE를 이용한 한국어 문법의 설계)

  • Choi, Woon-Ho;Chang, Suk-Jin
    • Annual Conference on Human and Language Technology
    • /
    • 1998.10c
    • /
    • pp.217-221
    • /
    • 1998
  • 본 논문에서는 한국어의 전산처리를 위한 문법 모형 개발의 일부분으로 HPSG에 기반한 문법 모형의 개발을 시도한다. 문법 모형의 개발에는 ALE(Attribute Logic Engine)를 이용하며, 보문 구조와 보조 용언 구문을 분석하기 위한 사전구조 및 문법 규칙을 제시한다. 그리고 문의 종류 (Sentence Type:ST)와 문계(Sentence Level: SL), 시제, 존대 등을 분석해서 표상하기 위한 유형 계층 및 어휘부, 문법 규칙, 문법 원리 등을 제시한다.

  • PDF

Sentiment Analysis using Robust Parallel Tri-LSTM Sentence Embedding in Out-of-Vocabulary Word (Out-of-Vocabulary 단어에 강건한 병렬 Tri-LSTM 문장 임베딩을 이용한 감정분석)

  • Lee, Hyun Young;Kang, Seung Shik
    • Smart Media Journal
    • /
    • v.10 no.1
    • /
    • pp.16-24
    • /
    • 2021
  • The exiting word embedding methodology such as word2vec represents words, which only occur in the raw training corpus, as a fixed-length vector into a continuous vector space, so when mapping the words incorporated in the raw training corpus into a fixed-length vector in morphologically rich language, out-of-vocabulary (OOV) problem often happens. Even for sentence embedding, when representing the meaning of a sentence as a fixed-length vector by synthesizing word vectors constituting a sentence, OOV words make it challenging to meaningfully represent a sentence into a fixed-length vector. In particular, since the agglutinative language, the Korean has a morphological characteristic to integrate lexical morpheme and grammatical morpheme, handling OOV words is an important factor in improving performance. In this paper, we propose parallel Tri-LSTM sentence embedding that is robust to the OOV problem by extending utilizing the morphological information of words into sentence-level. As a result of the sentiment analysis task with corpus in Korean, we empirically found that the character unit is better than the morpheme unit as an embedding unit for Korean sentence embedding. We achieved 86.17% accuracy on the sentiment analysis task with the parallel bidirectional Tri-LSTM sentence encoder.

Eojeol-Block Bidirectional Algorithm for Automatic Word Spacing of Hangul Sentences (한글 문장의 자동 띄어쓰기를 위한 어절 블록 양방향 알고리즘)

  • Kang, Seung-Shik
    • Journal of KIISE:Software and Applications
    • /
    • v.27 no.4
    • /
    • pp.441-447
    • /
    • 2000
  • Automatic word spacing is needed to solve the automatic indexing problem of the non-spaced documents and the space-insertion problem of the character recognition system at the end of a line. We propose a word spacing algorithm that automatically finds out word spacing positions. It is based on the recognition of Eojeol components by using the sentence partition and bidirectional longest-match algorithm. The sentence partition utilizes an extraction of Eojeol-block where the Eojeol boundary is relatively clear, and a Korean morphological analyzer is applied bidirectionally to the recognition of Eojeol components. We tested the algorithm on two sentence groups of about 4,500 Eojeols. The space-level recall ratio was 97.3% and the Eojeol-level recall ratio was 93.2%.

  • PDF

A Study of Speech Control Tags Based on Semantic Information of a Text (텍스트의 의미 정보에 기반을 둔 음성컨트롤 태그에 관한 연구)

  • Chang, Moon-Soo;Chung, Kyeong-Chae;Kang, Sun-Mee
    • Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.187-200
    • /
    • 2006
  • The speech synthesis technology is widely used and its application area is also being broadened to an automatic response service, a learning system for handicapped person, etc. However, the sound quality of the speech synthesizer has not yet reached to the satisfactory level of users. To make a synthesized speech, the existing synthesizer generates rhythms only by the interval information such as space and comma or by several punctuation marks such as a question mark and an exclamation mark so that it is not easy to generate natural rhythms of people even though it is based on mass speech database. To make up for the problem, there is a way to select rhythms after processing language from a higher level information. This paper proposes a method for generating tags for controling rhythms by analyzing the meaning of sentence with speech situation information. We use the Systemic Functional Grammar (SFG) [4] which analyzes the meaning of sentence with speech situation information considering the sentence prior to the given one, the situation of a conversation, the relationship among people in the conversation, etc. In this study, we generate Semantic Speech Control Tag (SSCT) by the result of SFG's meaning analysis and the voice wave analysis.

  • PDF