• Title/Summary/Keyword: Sentence simplification

Search Result 7, Processing Time 0.024 seconds

Sentence ion : Sentence Revision with Concept ion (문장추상화 : 개념추상화를 도입한 문장교열)

  • Kim, Gon;Yang, Jaegun;Bae, Jaehak;Lee, Jonghyuk
    • The KIPS Transactions:PartB
    • /
    • v.11B no.5
    • /
    • pp.563-572
    • /
    • 2004
  • Sentence ion is a simplification of a sentence preserving its communicative function. It accomplishes sentence revision and concept ion simultaneously. Sentence revision is a method that resolves the discrepancy between human's thoughts and its expressed semantic in sentences. Concept ion is an expression of general ideas acquired from the common elements of concepts. Sentence ion selects the main constituents of given sentences and describes the upper concepts of them with detecting their semantic information. This enables sen fence revision and concept ion simultaneously. In this paper, a syntactic parser LGPI+ and an ontology OfN are utilized for sentence ion. Sentence abstracter SABOT makes use of LGPI+ and OfN. SABOT processes the result of parsing and selects the candidate words for sentence ion. This paper computes the sentence recall of the main sentences and the topic hit ratio of the selected sentences with the text understanding system using sentence ion. The sources are 58 paragraphs in 23 stories. As a result of it, the sentence recall is about .54 ~ 72% and the topic hit ratio is about 76 ~ 86%. This paper verified that sentence ion enables sentence revision that can select the topic sentences of a given text efficiently and concept ion that can improve the depth of text understanding.

A Corpus-based Study of Translation Universals in English Translations of Korean Newspaper Texts (한국 신문의 영어 번역에 나타난 번역 보편소의 코퍼스 기반 분석)

  • Goh, Gwang-Yoon;Lee, Younghee (Cheri)
    • Cross-Cultural Studies
    • /
    • v.45
    • /
    • pp.109-143
    • /
    • 2016
  • This article examines distinctive linguistic shifts of translational English in an effort to verify the validity of the translation universals hypotheses, including simplification, explicitation, normalization and leveling-out, which have been most heavily explored to date. A large-scale study involving comparable corpora of translated and non-translated English newspaper texts has been carried out to typify particular linguistic attributes inherent in translated texts. The main findings are as follows. First, by employing the parameters of STTR, top-to-bottom frequency words, and mean values of sentence lengths, the translational instances of simplification have been detected across the translated English newspaper corpora. In contrast, the portion of function words produced contrary results, which in turn suggests that this feature might not constitute an effective test of the hypothesis. Second, it was found that the use of connectives was more salient in original English newspaper texts than translated English texts, being incompatible with the explicitation hypothesis. Third, as an indicator of translational normalization, lexical bundles were found to be more pervasive in translated texts than in non-translated texts, which is expected from and therefore support the normalization hypothesis. Finally, the standard deviations of both STTR and mean sentence lengths turned out to be higher in translated texts, indicating that the translated English newspaper texts were less leveled out within the same corpus group, which is opposed to what the leveling-out hypothesis postulates. Overall, the results suggest that not all four hypotheses may qualify for the label translation universals, or at least that some translational predictors are not feasible enough to evaluate the effectiveness of the translation universals hypotheses.

The English Cause-Focused Causal Construction

  • Kim, Yangsoon
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.4
    • /
    • pp.161-166
    • /
    • 2020
  • The primary aim of this paper is to analyze the resultative adjunct clause, i.e., (thus/thereby/hence) ~ing participle and provide explicit syntactic, semantic and sociolinguistic explanation on the question what causes the cause-focused causal construction with resultative (thus/thereby/hence) ~ing participle in English. What comes first is either cause or effect clause. This study explores the recent style shift of causal constructions from the effect-focused pattern to the cause-focused pattern. In this study, we argue that the increasing number of the cause-focused main clause with a resultative ~ing participle clause shows the process of the style evolution improving speech/wring style in many respects including syntactic simplification, clarification of the sentence meaning with impact on the focused clauses, and improvement of the flow of speech/writing. The style shift found in the English resultative adjunct clauses, i.e., (thus/hence/thereby) ~ing participle constructions prove to be the style evolution from syntactic, semantic and sociolinguistic point of views.

Embedded clause extraction and restoration for the performance enhancement in Korean-Vietnamese statistical machine translation (한베 통계기계번역의 성능 향상을 위한 내포문 추출 및 복원 기법)

  • Cho, Seung-Woo;Kim, Young-Gil;Kwon, Hong-Seok;Lee, Eui-Hyun;Lee, Won-Ki;Cho, Hyung-Mi;Lee, Jong-Hyeok
    • 한국어정보학회:학술대회논문집
    • /
    • 2016.10a
    • /
    • pp.280-284
    • /
    • 2016
  • 본 논문에서는 기호로 둘러싸인 내포문이 포함된 문장의 번역 성능을 높이는 방법을 제안한다. 입력 문장에서 내포문을 추출하여 여러 문장으로 나타내고, 각각의 문장들을 번역한다. 그리고 번역된 문장들을 복원정보를 활용하여 최종 번역 문장을 생성한다. 이러한 방법론은 입력 문장의 길이를 줄여주며, 그로 인하여 문장 구조가 단순해져 번역 품질이 향상된다. 본 논문에서는 한국어-베트남어 통계 기반 번역기에 대하여 제안한 방법론을 적용하고 실험하였다. 그 결과 BLEU 점수가 약 1.5 향상된 것을 확인할 수 있었다.

  • PDF

Embedded clause extraction and restoration for the performance enhancement in Korean-Vietnamese statistical machine translation (한베 통계기계번역의 성능 향상을 위한 내포문 추출 및 복원 기법)

  • Cho, Seung-Woo;Kim, Young-Gil;Kwon, Hong-Seok;Lee, Eui-Hyun;Lee, Won-Ki;Cho, Hyung-Mi;Lee, Jong-Hyeok
    • Annual Conference on Human and Language Technology
    • /
    • 2016.10a
    • /
    • pp.280-284
    • /
    • 2016
  • 본 논문에서는 기호로 둘러싸인 내포문이 포함된 문장의 번역 성능을 높이는 방법을 제안한다. 입력 문장에서 내포문을 추출하여 여러 문장으로 나타내고, 각각의 문장들을 번역한다. 그리고 번역된 문장들을 복원정보를 활용하여 최종 번역 문장을 생성한다. 이러한 방법론은 입력 문장의 길이를 줄여주며, 그로 인하여 문장 구조가 단순해져 번역 품질이 향상된다. 본 논문에서는 한국어-베트남어 통계 기반 번역기에 대하여 제안한 방법론을 적용하고 실험하였다. 그 결과 BLEU 점수가 약 1.5 향상된 것을 확인할 수 있었다.

  • PDF

This study revises Lee Hyo-seok's The Buckwheat Season, utilizing Novel Corpus, intermediate learners' level (소설텍스트의 난이도 조정 방안 연구 -이효석의 「메밀꽃 필 무렵」을 중심으로-)

  • Hwang, Hye ran
    • Journal of Korean language education
    • /
    • v.29 no.4
    • /
    • pp.255-294
    • /
    • 2018
  • The Buckwheat Season, evaluated as the best of Lee Hyo-seok's literature, is one of the short stories that represent Korean literature. However, vivid literary expressions such as lyrical and beautiful depictions, figurative expressions and dialects, which show the Korean beauty, rather make learners have difficulty and become a factor that fails in reading comprehension. Thus, it is necessary to revise and present the text modified for the learners' language level. The methods of revising a literary text include the revision of linguistic elements such as cryptic vocabulary or sentence structure and the revision of the composition of the text, e.g. suggestion of characters or plot, or insertion of illustration. The methods of revising the language of the text can be divided into methods of simplification and detailing. However, in the process of revising the text, many depend on the adapter's subjective perception, not revising it with objective criteria. This paper revised the text, utilizing by the Academy of Korean Studies, , and the by the National Institute of Korean Language to secure objectivity in revising the text.

Historic Status and Grammatical Characteristics of Korean language in the Early 20th Century (한국어사에서 20세기 초 한국어의 위상과 문법 특징)

  • Hong, Jongseon
    • Korean Linguistics
    • /
    • v.71
    • /
    • pp.1-22
    • /
    • 2016
  • The early 20th century is a period of time when Korea confronted with the surging waves of modernization, and made a variety of internal reactions. The Korean language, not immune to the upheaval, also experienced new changes and gradually gained characteristics of today's Korean. Although scholars have not yet fully agreed upon the time division of Korean, Gabo reformation (1896) is usually considered to be the beginning of modern Korean. Thus, the early 20th century was also the beginning of modern Korean. Phonological, lexical, and grammatical characteristics of modern day Korean began to appear during this period of time. Phonologically, the 10 vowel system was established, glottal sounds and aspirated sounds increased, vowel harmony declined. Phenomena such as vowel raising, front-vowelization, monophthongization, and the word-initial rule appeared. Meanwhile, hangul-Chinese mix writing became common practice, and hangul-only writing also started to take place in narrative writing, and elements of spoken language began to reflect in written language. All those pointed to the unification of written and spoken language. Under the influence of modernization, a great amount of new words appeared. Especially, Japanese and other foreign words flooded in in great quantities. Grammatically, '-eos-(-엇-), -neun-(-는-), -ges-(-겟-)' trichotomy system of tenses was established, and hearer-oriented honorific system also formed a binary system of 'hasoseo(하소서), hasibsio(하십시오), hao(하오), hage(하게), haera(해라)' and 'hae (해), haeyo(해요)'. In word formation and sentence construction, the use of '-gi(-기)' became more frequent than '-eum(-음)', while '~geot(~것)' also significantly increased. In negative, causative and passive expressions, the use of long form, which has fewer restrictions than the short form, became more frequent. A tendency towards simplicity appeared. In the same vain, long and complex sentences with several clauses tend to be avoided. Instead, short simple sentences became more favorable. Korean linguistics scholars should pay closer attention to the modernization period, which includes the early 20th century. In order to fully understand today's Korean language, more thorough research on this immediately preceding period is necessary.