• Title/Summary/Keyword: complex sentence development

Search Result 11, Processing Time 0.023 seconds

Complex Sentence Development of Korean-Chinese Bilingual Children (한국어-중국어 이중 언어 아동의 한국어 발달 : 복문발달을 중심으로)

  • Lee, Kwee-Ok;Lee, Hae-Ryoun
    • Korean Journal of Child Studies
    • /
    • v.29 no.5
    • /
    • pp.1-12
    • /
    • 2008
  • This study investigated the development of complex sentences in the early utterances of Korean-Chinese children. The subjects were 47(20 2-year-old, 15 3-year-old, and 12 4-year-old) Korean-Chinese children living in China. Each child's spontaneous natural speech during interaction with his/her caregiver was videotaped for about 30 minutes and analyzed for Korean complex sentences using Kim's(2000) categories and Korean Computerized Language Analysis 2.0(2000). Results showed that older children were higher in Mean Length of Utterance and in number and frequency of word types than younger children. The language development of bilingual children was delayed compared with monolingual children but the developmental sequence between bilingual and monolingual children was similar.

  • PDF

Development of the comprehension of complex sentences in Korean Children (아동의 복문(複文) 이해의 발달 - 시간 절부사어의 '전'과 '후'를 중심으로 -)

  • Park, Hee Sook;Choi, Kyoung Sook
    • Korean Journal of Child Studies
    • /
    • v.19 no.2
    • /
    • pp.185-200
    • /
    • 1998
  • This research examined the development in Korean children of the comprehension of complex sentences. The relative difficulty in comprehension of the temporal conjunctions "before" and "after" was investigated. The order of mention, contextual support, and syntactic appearance was controlled. The role of cognitive strategies and developmental changes in the comprehension of these conjunction was included in this study. Subjects were 90 preschool children between 3 and 5 years of age. The task was a sentence-picture matching problem having 3 types of sentences combining temporally with "before" or "after". The results were that developmental changes in comprehension of the temporal conjunctions "before' and "after" in Korean children depended on the order of mention, contextual support, and such syntactic factors as the position of the subject of the sentence. The importance of the consistency in the occurrence of events and the order of mention in the acquisition of complex sentences among Korean children is similar to the acquisition of complex sentences in other languages.

  • PDF

A Study on Syntactic Development in Spontaneous Speech (자발화에 나타난 구문구조 발달 양상)

  • Chang, Jin-A;Kim, Su-Jin;Shin, Ji-Young;Yi, Bong-Won
    • MALSORI
    • /
    • v.68
    • /
    • pp.17-32
    • /
    • 2008
  • The purpose of the present study is to investigate syntactic development of Korean by analysing the spontaneous speech data. Thirty children(3, 5, and 7-year-old and 10 per each age group) and 10 adults are employed as subjects for this study. Speech data were recorded and transcribed in orthography. Transcribed data are analysed syntactically: sentence(simple vs complex) patterns and clause patterns(4 basic types according to the predicate) etc. The results are as follows: 1) simple sentences show higher frequency for the upper age groups, 2) complex sentences with conjunctive and embedded clauses show higher frequency for the upper age groups.

  • PDF

Application of sinusoidal model to perception of electrical hearing in cochlear implants (인공와우 전기 청각 인지에 대한 정현파 모델 적용에 관한 연구)

  • Lee, Sungmin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.41 no.1
    • /
    • pp.52-57
    • /
    • 2022
  • Speech consists of the sum of complex sine-waves. This study investigated the perception of electrical hearing by applying the sinusoidal model to cochlear implant simulation. Fourteen adults with normal hearing participated in this study. The sentence recognition tests were implemented using the sentence lists processed by the sinusoidal model which extracts 2, 4, 6, 8 sine-wave components and sentence lists processed by the same sinusoidal model along with cochlear implant simulation (8 channel vocoders). The results showed lower speech recognition for the sentence lists processed by the sinusoidal model and cochlear implant simulation compared to those by the sinusoidal model alone. Notably, the lower the number of sine-wave components (2), the larger the difference was. This study provides the perceptual pattern of sine-wave speech for electrical hearing by cochlear implant listeners, and basic data for development of speech processing algorithms in cochlear implants.

The Development of Preschoolers′ Narrative Competence (3, 4, 5세 유아의 이야기 구성능력 발달)

  • 한유진;유안진
    • Journal of the Korean Home Economics Association
    • /
    • v.39 no.7
    • /
    • pp.71-84
    • /
    • 2001
  • The purpose of this study was to investigate the development of preschoolers'narrative competence. The subject were 60 preschoolers aged 3 through 5 years who were enrolled in the day care center All the subjects were asked to produce a new story. All the story children toad were recorded on audiotape. The data were analyzed Qualitatively and quantitatively using content analysis and the statistical package for Social Science 9.0. The main results of this study were as follows. 1) Significant age difference was observed in preschooler's narrative structure. Older children produced structurally more complex stories containing setting, character, initiating event, attempt and consequence than younger children. 2) Significant age difference was observed in preschooler's narrative length. Older children used significantly more words and sentence when they produced stories than younger children.3) Preschooler'narrative structure was significantly correlated with narrative length.

  • PDF

Multilayer Knowledge Representation of Customer's Opinion in Reviews (리뷰에서의 고객의견의 다층적 지식표현)

  • Vo, Anh-Dung;Nguyen, Quang-Phuoc;Ock, Cheol-Young
    • Annual Conference on Human and Language Technology
    • /
    • 2018.10a
    • /
    • pp.652-657
    • /
    • 2018
  • With the rapid development of e-commerce, many customers can now express their opinion on various kinds of product at discussion groups, merchant sites, social networks, etc. Discerning a consensus opinion about a product sold online is difficult due to more and more reviews become available on the internet. Opinion Mining, also known as Sentiment analysis, is the task of automatically detecting and understanding the sentimental expressions about a product from customer textual reviews. Recently, researchers have proposed various approaches for evaluation in sentiment mining by applying several techniques for document, sentence and aspect level. Aspect-based sentiment analysis is getting widely interesting of researchers; however, more complex algorithms are needed to address this issue precisely with larger corpora. This paper introduces an approach of knowledge representation for the task of analyzing product aspect rating. We focus on how to form the nature of sentiment representation from textual opinion by utilizing the representation learning methods which include word embedding and compositional vector models. Our experiment is performed on a dataset of reviews from electronic domain and the obtained result show that the proposed system achieved outstanding methods in previous studies.

  • PDF

The Effect of Overseas Language Training on the Development of Foreign Language Accuracy (해외어학연수의 외국어 정확성 향상에 대한 효과)

  • Cha, Mi-Yang
    • Journal of Industrial Convergence
    • /
    • v.18 no.4
    • /
    • pp.93-99
    • /
    • 2020
  • The Journal of Industrial Management Society in Republic of Korea. In order to explore the effect of overseas language training on the development of foreign language accuracy, this study investigates the errors in English compositions produced by 27 Korean university students who received overseas language training for 15 weeks. For data collection, students were made to take two tests, a pretest and a posttest, a semester apart. The differences in composition elements and errors between the two tests were examined and statistical analyses were performed. Results showed that while the average length of the compositions and sentences increased, the number of sentences decreased in the posttest. Also, more errors were found in the posttest where the students tried to construct more complex sentence structures. The students' ability to generate sentences were found to have improved, while their competence in using grammatical elements accurately within sentences did not see great improvement. This implies that overseas language training was not effective for aiding the development of one's grammatical accuracy of a foreign language over a 15-week period for the students.

A Foundational Study on Developing a Structural Model for AI-based Sentencing Prediciton Based on Violent Crime Judgment (인공지능기술 적용을 위한 강력범죄 판결문 기반 양형 예측 구조모델 개발 기초 연구)

  • Woongil Park;Eunbi Cho;Jeong-Hyeon Chang;Joo-chang Kim
    • Journal of Internet Computing and Services
    • /
    • v.25 no.1
    • /
    • pp.91-98
    • /
    • 2024
  • With the advancement of ICT (Information and Communication Technology), searching for judgments through the internet has become increasingly convenient. However, predicting sentencing based on judgments remains a challenging task for individuals. This is because sentencing involves a complex process of applying aggravating and mitigating factors within the framework of legal provisions, and it often depends on the subjective judgment of the judge. Therefore, this research aimed to develop a model for predicting sentencing using artificial intelligence by focusing on structuring the data from judgments, making it suitable for AI applications. Through theoretical and statistical analysis of previous studies, we identified variables with high explanatory power for predicting sentencing. Additionally, by analyzing 50 legal judgments related to serious crimes that are publicly available, we presented a framework for extracting essential information from judgments. This framework encompasses basic case information, sentencing details, reasons for sentencing, the reasons for the determination of the sentence, as well as information about offenders, victims, and accomplices evident within the specific content of the judgments. This research is expected to contribute to the development of artificial intelligence technologies in the field of law in the future.

Knowledge Extraction Methodology and Framework from Wikipedia Articles for Construction of Knowledge-Base (지식베이스 구축을 위한 한국어 위키피디아의 학습 기반 지식추출 방법론 및 플랫폼 연구)

  • Kim, JaeHun;Lee, Myungjin
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.43-61
    • /
    • 2019
  • Development of technologies in artificial intelligence has been rapidly increasing with the Fourth Industrial Revolution, and researches related to AI have been actively conducted in a variety of fields such as autonomous vehicles, natural language processing, and robotics. These researches have been focused on solving cognitive problems such as learning and problem solving related to human intelligence from the 1950s. The field of artificial intelligence has achieved more technological advance than ever, due to recent interest in technology and research on various algorithms. The knowledge-based system is a sub-domain of artificial intelligence, and it aims to enable artificial intelligence agents to make decisions by using machine-readable and processible knowledge constructed from complex and informal human knowledge and rules in various fields. A knowledge base is used to optimize information collection, organization, and retrieval, and recently it is used with statistical artificial intelligence such as machine learning. Recently, the purpose of the knowledge base is to express, publish, and share knowledge on the web by describing and connecting web resources such as pages and data. These knowledge bases are used for intelligent processing in various fields of artificial intelligence such as question answering system of the smart speaker. However, building a useful knowledge base is a time-consuming task and still requires a lot of effort of the experts. In recent years, many kinds of research and technologies of knowledge based artificial intelligence use DBpedia that is one of the biggest knowledge base aiming to extract structured content from the various information of Wikipedia. DBpedia contains various information extracted from Wikipedia such as a title, categories, and links, but the most useful knowledge is from infobox of Wikipedia that presents a summary of some unifying aspect created by users. These knowledge are created by the mapping rule between infobox structures and DBpedia ontology schema defined in DBpedia Extraction Framework. In this way, DBpedia can expect high reliability in terms of accuracy of knowledge by using the method of generating knowledge from semi-structured infobox data created by users. However, since only about 50% of all wiki pages contain infobox in Korean Wikipedia, DBpedia has limitations in term of knowledge scalability. This paper proposes a method to extract knowledge from text documents according to the ontology schema using machine learning. In order to demonstrate the appropriateness of this method, we explain a knowledge extraction model according to the DBpedia ontology schema by learning Wikipedia infoboxes. Our knowledge extraction model consists of three steps, document classification as ontology classes, proper sentence classification to extract triples, and value selection and transformation into RDF triple structure. The structure of Wikipedia infobox are defined as infobox templates that provide standardized information across related articles, and DBpedia ontology schema can be mapped these infobox templates. Based on these mapping relations, we classify the input document according to infobox categories which means ontology classes. After determining the classification of the input document, we classify the appropriate sentence according to attributes belonging to the classification. Finally, we extract knowledge from sentences that are classified as appropriate, and we convert knowledge into a form of triples. In order to train models, we generated training data set from Wikipedia dump using a method to add BIO tags to sentences, so we trained about 200 classes and about 2,500 relations for extracting knowledge. Furthermore, we evaluated comparative experiments of CRF and Bi-LSTM-CRF for the knowledge extraction process. Through this proposed process, it is possible to utilize structured knowledge by extracting knowledge according to the ontology schema from text documents. In addition, this methodology can significantly reduce the effort of the experts to construct instances according to the ontology schema.

Financial Fraud Detection using Text Mining Analysis against Municipal Cybercriminality (지자체 사이버 공간 안전을 위한 금융사기 탐지 텍스트 마이닝 방법)

  • Choi, Sukjae;Lee, Jungwon;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.119-138
    • /
    • 2017
  • Recently, SNS has become an important channel for marketing as well as personal communication. However, cybercrime has also evolved with the development of information and communication technology, and illegal advertising is distributed to SNS in large quantity. As a result, personal information is lost and even monetary damages occur more frequently. In this study, we propose a method to analyze which sentences and documents, which have been sent to the SNS, are related to financial fraud. First of all, as a conceptual framework, we developed a matrix of conceptual characteristics of cybercriminality on SNS and emergency management. We also suggested emergency management process which consists of Pre-Cybercriminality (e.g. risk identification) and Post-Cybercriminality steps. Among those we focused on risk identification in this paper. The main process consists of data collection, preprocessing and analysis. First, we selected two words 'daechul(loan)' and 'sachae(private loan)' as seed words and collected data with this word from SNS such as twitter. The collected data are given to the two researchers to decide whether they are related to the cybercriminality, particularly financial fraud, or not. Then we selected some of them as keywords if the vocabularies are related to the nominals and symbols. With the selected keywords, we searched and collected data from web materials such as twitter, news, blog, and more than 820,000 articles collected. The collected articles were refined through preprocessing and made into learning data. The preprocessing process is divided into performing morphological analysis step, removing stop words step, and selecting valid part-of-speech step. In the morphological analysis step, a complex sentence is transformed into some morpheme units to enable mechanical analysis. In the removing stop words step, non-lexical elements such as numbers, punctuation marks, and double spaces are removed from the text. In the step of selecting valid part-of-speech, only two kinds of nouns and symbols are considered. Since nouns could refer to things, the intent of message is expressed better than the other part-of-speech. Moreover, the more illegal the text is, the more frequently symbols are used. The selected data is given 'legal' or 'illegal'. To make the selected data as learning data through the preprocessing process, it is necessary to classify whether each data is legitimate or not. The processed data is then converted into Corpus type and Document-Term Matrix. Finally, the two types of 'legal' and 'illegal' files were mixed and randomly divided into learning data set and test data set. In this study, we set the learning data as 70% and the test data as 30%. SVM was used as the discrimination algorithm. Since SVM requires gamma and cost values as the main parameters, we set gamma as 0.5 and cost as 10, based on the optimal value function. The cost is set higher than general cases. To show the feasibility of the idea proposed in this paper, we compared the proposed method with MLE (Maximum Likelihood Estimation), Term Frequency, and Collective Intelligence method. Overall accuracy and was used as the metric. As a result, the overall accuracy of the proposed method was 92.41% of illegal loan advertisement and 77.75% of illegal visit sales, which is apparently superior to that of the Term Frequency, MLE, etc. Hence, the result suggests that the proposed method is valid and usable practically. In this paper, we propose a framework for crisis management caused by abnormalities of unstructured data sources such as SNS. We hope this study will contribute to the academia by identifying what to consider when applying the SVM-like discrimination algorithm to text analysis. Moreover, the study will also contribute to the practitioners in the field of brand management and opinion mining.