• Title/Summary/Keyword: measure phrases

Search Result 23, Processing Time 0.025 seconds

Korean deadjectival inchoatives and measure phrases: a compositional study

  • Lim, Dongsik
    • Language and Information
    • /
    • v.20 no.1
    • /
    • pp.73-91
    • /
    • 2016
  • Korean adjectives in general cannot combine with measure phrases (MP), but MPs are compatible with adjectives when they appear with the inchoative morpheme -(e)ci. In this case, MPs can only denote the difference between two states along the dimension denoted by the root adjective. To account for this, this paper proposes that i) -(e)ci is a spell-out of V in the directed motion construction which takes an abstract path argument, like become, and ii) this path argument contains a comparative morpheme. By assuming this we can explain why MPs appear with -(e)ci, as well as other interesting phenomena such as variable telicity in deadjectival verbs with -(e)ci.

  • PDF

Document Clustering with Relational Graph Of Common Phrase and Suffix Tree Document Model (공통 Phrase의 관계 그래프와 Suffix Tree 문서 모델을 이용한 문서 군집화 기법)

  • Cho, Yoon-Ho;Lee, Sang-Keun
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.2
    • /
    • pp.142-151
    • /
    • 2009
  • Previous document clustering method, NSTC measures similarities between two document pairs using TF-IDF during web document clustering. In this paper, we propose new similarity measure using common phrase-based relational graph, not TF-IDF. This method suggests that weighting common phrases by relational graph presenting relationship among common phrases in document collection. And experimental results indicate that proposed method is more effective in clustering document collection than NSTC.

Identification of Maximal-Length Noun Phrases Based on Expanded Chunks and Classified Punctuations in Chinese (확장청크와 세분화된 문장부호에 기반한 중국어 최장명사구 식별)

  • Bai, Xue-Mei;Li, Jin-Ji;Kim, Dong-Il;Lee, Jong-Hyeok
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.4
    • /
    • pp.320-328
    • /
    • 2009
  • In general, there are two types of noun phrases(NP): Base Noun Phrase(BNP), and Maximal-Length Noun Phrase(MNP). MNP identification can largely reduce the complexity of full parsing, help analyze the general structure of complex sentences, and provide important clues for detecting main predicates in Chinese sentences. In this paper, we propose a 2-phase hybrid approach for MNP identification which adopts salient features such as expanded chunks and classified punctuations to improve performance. Experimental result shows a high quality performance of 89.66% in $F_1$-measure.

A New Importance Measure of Association Rules Using Information Theory (정보이론에 기반한 연관 규칙들의 새로운 중요도 측정 방법)

  • Lee, Chang-Hwan;Bae, Joohyun
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.3 no.1
    • /
    • pp.37-42
    • /
    • 2014
  • The abstract should concisely state what was done, how it was done, principal results, and their significance. It should be less than 300 words for all forms of publication. The abstract should be written as one paragraph and should not contain tabular material or numbered references. At the end of abstract, keywords should be given in 3 to 5 words or phrases.

Calculation Correctio Factor of Bridge Capacity using Fuzzy Sets Theory (퍼지를 이용한 교량 안전도평가의 보정계수 산정)

  • 조원신;박기태;김상효;황학주
    • Proceedings of the Korea Concrete Institute Conference
    • /
    • 1992.10a
    • /
    • pp.240-244
    • /
    • 1992
  • The values of a linguistic variable are words, phrases, or sentences in a given language. For example, structural damage can be considered as linguistic variable with values such a 'severely damaged', 'moderately damaged', which are meaningful classifications but not clearly defined, This paper is to evaluate reasonably the correction factor of bridge capacity with the aid of fuzzy sets theory. By using the above mentioned fuzzy measure, the concept of fuzzy integral and linear membership function can be defined. It is concluded that the fuzzy sets theory cam be applied to determine reasonably the correction factor of bridge capacity.

  • PDF

Topic Continuity in Korea Narrative (한국 설화문에서의 화제표현의 연속성)

  • Hi-JaChong
    • Korean Journal of Cognitive Science
    • /
    • v.2 no.2
    • /
    • pp.405-428
    • /
    • 1990
  • Language has a social function to communicate information. Linguists have gradually paid their attention to the function of language since the nineteen sixties, especially to the relationship of form, meaning and the function. The relationship could be more clearly grasped through disciyrse-based analysis than through sentence-based analysis. Many researches were centered on the discourse functional notion of topic. In the early 1970's the subject was defined as the grammatiocalized topic the topic as a discrete single constituent of the clause. In the late 1970's several lingusts including Givon suggerted that the topic was not an atomic, disctete entity, and that the clause could have more than one topic. The purpose of the present study is, following Givon, to study grammatical coding devices of topic and to measure the relative topic continuity/discontinuity of participant argu, ents in Korean narratives. By so doing, I would like to shed some light on effective ways of communicating information. The grammatical coding devices analyzed are the following eight structures: zero-anaphora, personal pronous, demonstrative pronouns, names, noun phrases following demonstratives, noun phrases following possessives, definite noun phrases and indefinite referentials. The narrative studied for the count was taken from the KoreanCIA chief's Testiomny:Revolution and Idol by Hyung Wook Kim. It was chosen because it was assumed that Kim's purpose in the novel was to tell a true story, which would not distort the natural use of language for literary effect. The measures taken in the analysis wre those of 'lookback', 'persistence', ambiguity'. The first of these, 'lookback', is a measure of the size of gap between the previous occurrence of a referent and its current occurence in the clause. The meausure of persistence, which is a measure of the speaker's topocal intent, reflects the topic's importance in the discourse. The third measure is a measure of ambiguity. This is necessary for assessing the disruptive effects that other topics within five previous clauses may have on topic identification. The more other topics are present within five previous clauses, the more difficult is the task of correct identification of a topic. The results of the present study show that the humanness of entities is the most powerful factior in topic continutiy in narrative discourse. The semantic roles of human arguments in narrative discourse tend to be agents or experiences. Since agents and experiences have high topicality in discourse, human entities clearly become clausal or discoursal topics. The results also show that the grammatical devices signal varying degrees of topic continuity discontinuity in continuous discourse. The more continuous a topic argument is, the less it is coded. For example, personal pronouns have the most continutiy and indefinite referentials have the least continutiy. The study strongly shows that topic continuity discontinutiy is controlled not only by grammatical devices available in the language but by socio-cultural factors and writer's intentions.

Automatic Recognition of Translation Phrases Enclosed with Parenthesis in Korean-English Mixed Documents (한영 혼용문에서 괄호 안 대역어구의 자동 인식)

  • Lee, Jae-Sung;Seo, Young-Hoon
    • The KIPS Transactions:PartB
    • /
    • v.9B no.4
    • /
    • pp.445-452
    • /
    • 2002
  • In Korean-English mixed documents, translated technical words are usually used with the attached full words or original words enclosed with parenthesis. In this paper, a collective method is presented to recognize and extract the translation phrases with using a base translation dictionary. In order to process the unregistered title words and translation words in the dictionary, a phonetic similarity matching method, a translation partial matching method, and a compound word matching method are newly proposed. The experiment result of each method was measured in F-measure(the alpha is set to 0.4) ; exact matching of dictionary terms as a baseline method showed 23.8%, the hybrid method of translation partial matching and phonetic similarity matching 75.9%, and the compound word matching method including the hybrid method 77.3%, which is 3.25 times better than the baseline method.

A Study on The Revision of UCP600 concerning the Sea Transport Documents (UCP 600 해상운송서류(海上運送書類) 규정(規定)의 주요(主要) 개정사항(改正事項)에 관한 연구(硏究))

  • Park, Sae-Woon
    • THE INTERNATIONAL COMMERCE & LAW REVIEW
    • /
    • v.35
    • /
    • pp.71-98
    • /
    • 2007
  • UCP 600 approved at the Banking Commission Meeting of ICC at the end of October, 2006 comes into effect from July 1, 2007. The main revision of the UCP 600 concerning the sea transport document are as follows. First, if the bill of lading contains an on-board-notation, with the date of shipment, the date stated in the on-board-notation will be deemed the date of shipment. Secondly, phrases "on its face" and "otherwise authenticated" should be eliminated. Thirdly, when an agent signs for or signs on behalf of the master, there is no longer a need for the name of master to be quoted. Fourthly, the terminology "loading on-board or shipped on a named vessel" is changed to "shipped on-board a named vessel." Fifthly, phrases "the rejection of the documents transported only by sail" is removed. Finally, new rule in UCP is the signing of a charter party bill of lading by the charterer or a named agent on behalf of the charterer. My assessment of the revision in UCP 600 is as follows: Because a freight forwarder transport document is a weaker form than a liner bill of lading as collateral, banks may need a secure measure as to protect themselves from such a weak collateral effect. we recognize that Such a weak collateral effect stemmed from the elimination of rules in UCP 500 article 30, and the admission of transport documents issued by the freight forwarder as long as any one besides carrier, shipper, and charterer satisfies the requirements of transport document clauses in UCP 600. Finally, I hope the Commentary on UCP 600 will serve to explain the ambiguities remaining in the new rules.

  • PDF

The f0 distribution of Korean speakers in a spontaneous speech corpus

  • Yang, Byunggon
    • Phonetics and Speech Sciences
    • /
    • v.13 no.3
    • /
    • pp.31-37
    • /
    • 2021
  • The fundamental frequency, or f0, is an important acoustic measure in the prosody of human speech. The current study examined the f0 distribution of a corpus of spontaneous speech in order to provide normative data for Korean speakers. The corpus consists of 40 speakers talking freely about their daily activities and their personal views. Praat scripts were created to collect f0 values, and a majority of obvious errors were corrected manually by watching and listening to the f0 contour on a narrow-band spectrogram. Statistical analyses of the f0 distribution were conducted using R. The results showed that the f0 values of all the Korean speakers were right-skewed, with a pointy distribution. The speakers produced spontaneous speech within a frequency range of 274 Hz (from 65 Hz to 339 Hz), excluding statistical outliers. The mode of the total f0 data was 102 Hz. The female f0 range, with a bimodal distribution, appeared wider than that of the male group. Regression analyses based on age and f0 values yielded negligible R-squared values. As the mode of an individual speaker could be predicted from the median, either the median or mode could serve as a good reference for the individual f0 range. Finally, an analysis of the continuous f0 points of intonational phrases revealed that the initial and final segments of the phrases yielded several f0 measurement errors. From these results, we conclude that an examination of a spontaneous speech corpus can provide linguists with useful measures to generalize acoustic properties of f0 variability in a language by an individual or groups. Further studies would be desirable of the use of statistical measures to secure reliable f0 values of individual speakers.

Using Corpora for Studying English Grammar

  • Kwon, Heok-Seung
    • Korean Journal of English Language and Linguistics
    • /
    • v.4 no.1
    • /
    • pp.61-81
    • /
    • 2004
  • This paper will look at some grammatical phenomena which will illustrate some of the questions that can be addressed with a corpus-based approach. We will use this approach to investigate the following subjects in English grammar: number ambiguity, subject-verb concord, concord with measure expressions, and (reflexive) pronoun choice in coordinated noun phrases. We will emphasize the distinctive features of the corpus-based approach, particularly its strengths in investigating language use, as opposed to traditional descriptions or prescriptions of structure in English grammar. This paper will show that a corpus-based approach has made it possible to conduct new kinds of investigations into grammar in use and to expand the scope of earlier investigations. Native speakers rarely have accurate information about frequency of use. A large representative corpus (i.e., The British National Corpus) is one of the most reliable sources of frequency information. It is important to base an analysis of language on real data rather than intuition. Any description of grammar is more complete and accurate if it is based on a body of real data.

  • PDF