• Title/Summary/Keyword: grammatical structures

Search Result 33, Processing Time 0.019 seconds

Web Catchphrase Improve System Employing Onomatopoeia and Large-Scale N-gram Corpus

  • Yamane, Hiroaki;Hagiwara, Masafumi
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.12 no.1
    • /
    • pp.94-100
    • /
    • 2012
  • In this paper, we propose a system which improves text catchphrases on the web using onomatopoeia and the Japanese Google N-grams. Onomatopoeia is regarded as a fundamental tool in daily communication for people. The proposed system inserts an onomatopoetic word into plain text catchphrases. Being based on a large catchphrase encyclopedia, the proposed system evaluates each catchphrase's candidates considering the words, structure and usage of onomatopoeia. That is, candidates are selected whether they contain onomatopoeia and they use specific catchphrase grammatical structures. Subjective experiments show that inserted onomatopoeia is effective for making attractive catchphrases.

The Grammatical Structure of Protein Sequences

  • Bystroff, Chris
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2000.11a
    • /
    • pp.28-31
    • /
    • 2000
  • We describe a hidden Markov model, HMMTIR, for general protein sequence based on the I-sites library of sequence-structure motifs. Unlike the linear HMMs used to model individual protein families, HMMSTR has a highly branched topology and captures recurrent local features of protein sequences and structures that transcend protein family boundaries. The model extends the I-sites library by describing the adjacencies of different sequence-structure motifs as observed in the database, and achieves a great reduction in parameters by representing overlapping motifs in a much more compact form. The HMM attributes a considerably higher probability to coding sequence than does an equivalent dipeptide model, predicts secondary structure with an accuracy of 74.6% and backbone torsion angles better than any previously reported method, and predicts the structural context of beta strands and turns with an accuracy that should be useful for tertiary structure prediction. HMMSTR has been incorporated into a public, fully-automated protein structure prediction server.

  • PDF

Syntax-Directed Document Editor based XML DTD (XML DTD 기반의 구문지향 문서 작성기)

  • Kim, Young-Chul;Kim, Sung-Keun;Choi, Jong-Myung
    • The Journal of Korean Association of Computer Education
    • /
    • v.7 no.4
    • /
    • pp.67-75
    • /
    • 2004
  • XML is being accepted as a standard for the next generation web documents, as it enables to extend the document structures. However, general users have difficulties in writing valid and well-formed XML documents, since the documents should satisfy the grammatical constraints of XML. In this paper, we present a syntax-directed XML document editor which will ease users in writing valid XML documents. The editor will help users, and increase productivity in writing XML documents.

  • PDF

Classification of Behavioral Lexicon and Definition of Upper, Lower Body Structures in Animation Character

  • Hongsik Pak;Suhyeon Choi;Taegu Lee
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.15 no.3
    • /
    • pp.103-117
    • /
    • 2023
  • This study focuses on the behavioural lexical classification for extracting animation character actions and the analysis of the character's upper and lower body movements. The behaviour and state of characters in the animation industry are crucial, and digital technology is enhancing the industry's value. However, research on animation motion application technology and behavioural lexical classification is still lacking. Therefore, this study aims to classify the predicates enabling animation motion, differentiate the upper and lower body movements of characters, and apply the behavioural lexicon's motion data. The necessity of this research lies in the potential contributions of advanced character motion technology to various industrial fields, and the use of the behavioural lexicon to elucidate and repurpose character motion. The research method applies a grammatical, behavioural, and semantic predicate classification and behavioural motion analysis based on the character's upper and lower body movements.

Topic Continuity in Korea Narrative (한국 설화문에서의 화제표현의 연속성)

  • Hi-JaChong
    • Korean Journal of Cognitive Science
    • /
    • v.2 no.2
    • /
    • pp.405-428
    • /
    • 1990
  • Language has a social function to communicate information. Linguists have gradually paid their attention to the function of language since the nineteen sixties, especially to the relationship of form, meaning and the function. The relationship could be more clearly grasped through disciyrse-based analysis than through sentence-based analysis. Many researches were centered on the discourse functional notion of topic. In the early 1970's the subject was defined as the grammatiocalized topic the topic as a discrete single constituent of the clause. In the late 1970's several lingusts including Givon suggerted that the topic was not an atomic, disctete entity, and that the clause could have more than one topic. The purpose of the present study is, following Givon, to study grammatical coding devices of topic and to measure the relative topic continuity/discontinuity of participant argu, ents in Korean narratives. By so doing, I would like to shed some light on effective ways of communicating information. The grammatical coding devices analyzed are the following eight structures: zero-anaphora, personal pronous, demonstrative pronouns, names, noun phrases following demonstratives, noun phrases following possessives, definite noun phrases and indefinite referentials. The narrative studied for the count was taken from the KoreanCIA chief's Testiomny:Revolution and Idol by Hyung Wook Kim. It was chosen because it was assumed that Kim's purpose in the novel was to tell a true story, which would not distort the natural use of language for literary effect. The measures taken in the analysis wre those of 'lookback', 'persistence', ambiguity'. The first of these, 'lookback', is a measure of the size of gap between the previous occurrence of a referent and its current occurence in the clause. The meausure of persistence, which is a measure of the speaker's topocal intent, reflects the topic's importance in the discourse. The third measure is a measure of ambiguity. This is necessary for assessing the disruptive effects that other topics within five previous clauses may have on topic identification. The more other topics are present within five previous clauses, the more difficult is the task of correct identification of a topic. The results of the present study show that the humanness of entities is the most powerful factior in topic continutiy in narrative discourse. The semantic roles of human arguments in narrative discourse tend to be agents or experiences. Since agents and experiences have high topicality in discourse, human entities clearly become clausal or discoursal topics. The results also show that the grammatical devices signal varying degrees of topic continuity discontinuity in continuous discourse. The more continuous a topic argument is, the less it is coded. For example, personal pronouns have the most continutiy and indefinite referentials have the least continutiy. The study strongly shows that topic continuity discontinutiy is controlled not only by grammatical devices available in the language but by socio-cultural factors and writer's intentions.

A study on detective story authors' style differentiation and style structure based on Text Mining (텍스트 마이닝 기법을 활용한 고전 추리 소설 작가 간 문체적 차이와 문체 구조에 대한 연구)

  • Moon, Seok Hyung;Kang, Juyoung
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.89-115
    • /
    • 2019
  • This study was conducted to present the stylistic differences between Arthur Conan Doyle and Agatha Christie, famous as writers of classical mystery novels, through data analysis, and further to present the analytical methodology of the study of style based on text mining. The reason why we chose mystery novels for our research is because the unique devices that exist in classical mystery novels have strong stylistic characteristics, and furthermore, by choosing Arthur Conan Doyle and Agatha Christie, who are also famous to the general reader, as subjects of analysis, so that people who are unfamiliar with the research can be familiar with them. The primary objective of this study is to identify how the differences exist within the text and to interpret the effects of these differences on the reader. Accordingly, in addition to events and characters, which are key elements of mystery novels, the writer's grammatical style of writing was defined in style and attempted to analyze it. Two series and four books were selected by each writer, and the text was divided into sentences to secure data. After measuring and granting the emotional score according to each sentence, the emotions of the page progress were visualized as a graph, and the trend of the event progress in the novel was identified under eight themes by applying Topic modeling according to the page. By organizing co-occurrence matrices and performing network analysis, we were able to visually see changes in relationships between people as events progressed. In addition, the entire sentence was divided into a grammatical system based on a total of six types of writing style to identify differences between writers and between works. This enabled us to identify not only the general grammatical writing style of the author, but also the inherent stylistic characteristics in their unconsciousness, and to interpret the effects of these characteristics on the reader. This series of research processes can help to understand the context of the entire text based on a defined understanding of the style, and furthermore, by integrating previously individually conducted stylistic studies. This prior understanding can also contribute to discovering and clarifying the existence of text in unstructured data, including online text. This could help enable more accurate recognition of emotions and delivery of commands on an interactive artificial intelligence platform that currently converts voice into natural language. In the face of increasing attempts to analyze online texts, including New Media, in many ways and discover social phenomena and managerial values, it is expected to contribute to more meaningful online text analysis and semantic interpretation through the links to these studies. However, the fact that the analysis data used in this study are two or four books by author can be considered as a limitation in that the data analysis was not attempted in sufficient quantities. The application of the writing characteristics applied to the Korean text even though it was an English text also could be limitation. The more diverse stylistic characteristics were limited to six, and the less likely interpretation was also considered as a limitation. In addition, it is also regrettable that the research was conducted by analyzing classical mystery novels rather than text that is commonly used today, and that various classical mystery novel writers were not compared. Subsequent research will attempt to increase the diversity of interpretations by taking into account a wider variety of grammatical systems and stylistic structures and will also be applied to the current frequently used online text analysis to assess the potential for interpretation. It is expected that this will enable the interpretation and definition of the specific structure of the style and that various usability can be considered.

Creating 3D Artificial Flowers using Structured Directed Graph and Interactive Genetic Algorithm (구조적 방향성 그래프와 대화형 유전자 알고리즘을 이용한 3차원 꽃의 생성)

  • 민현정;조성배
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.3
    • /
    • pp.267-275
    • /
    • 2004
  • Directed graph and Lindenmayer system (L-system) are two major encoding methods of representation to develop creatures in application field of artificial life. It is difficult to define real morphology structurally using the L-systems which are a grammatical rewriting system because L-systems represent genotype as loops, procedure calls, variables, and parameters. This paper defines a class of representations called structured directed graph, which is identified by its ability to define structures of the genotype in the translation to the phenotype, and presents an example of creating 3D flowers using a directed graph which is proper method to represent real morphology, and interactive genetic algorithm which decodes the problem with human's emotional evaluation. The experimental results show that natural flower morphology can be generated by the proposed method.

Resolving Multi-Translatable Verbs Japanese-TO-Korean Machine Translation

  • Kim Jung-In;Lee Kang-Hyuk
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.6
    • /
    • pp.790-797
    • /
    • 2005
  • It is well-known that there are many similarities between Japanese and Korean language. For example, the order of words and the nature of the grammatical conjugation of both languages are almost the same. Another similarity is the frequent omission of the subject from a sentence. Moreover, both languages have honorific expressions and the identical concept for expressing nouns in terms of Chinese characters. Using these similarities, we have developed a word-to-word translation system which does away with any deep level analysis of syntactic and semantic structures of the two languages. If we use these similarities, the direct translation method is superior to the internal language translation method or transfer-based translation method. Although the MT system based on the direct translation method is more easily developed than the ones based on other methods, it may have a lot of difficulties when it tries to select the appropriate target word from ambiguous source verbs. In this paper, we propose a new algorithm to extract the meaning of substantives and to make use of the order of the extracted meaning. We could select $86.5\%$ appropriate verbs in the sample sentences from IPAL-verb-dictionary. $13.5\%$ indicates the cases in which we could not distinguish the meaning of substantives. We are convinced, however, that the succeeding rate can be increased by getting rid of the meaning of verbs thatare not used so often.

  • PDF

Headedness Parameter and the Acquisition of Null Anaphor (문장의 머리방향 매개변수(headedness parameter)와 공조응사(null anaphor)습득)

  • 조숙환
    • Korean Journal of Cognitive Science
    • /
    • v.1 no.1
    • /
    • pp.145-164
    • /
    • 1989
  • The present paper studies the development of null anaphor,with special attention to whether Korean children's use of null anaphora exhibits the type of directionality preforence predicted by Lust & Mangione(1983)and Lust(1986).If these researchers are correct,anaphora in Korean children's language should be constrained backward since Korean is a left-branching language.That is,it is predicted that Korean chidren would perfer backward anaphora to the forward pattern.For the purpose of this study, ninety-six children were individually tested in an elicited imitation task, twenty-four children from four age groups(4:1-9:7).Three types of constructions,sentences involving a redundant NP,forward patterns of ananphora,and backsward patterns of anaphora were devised.It was discovered that like English speaking children, Korean children prefer forward patterns of anaphora to backward structures.It was thus speculated that the forward preference may well be indedpendent of grammatical factors,reflecting instead processing considerations which favour mention of the referent befor the use of anaphoric elements.

Analysis of the 3rd Graders' Solving Processes of the Word Problems by Nominalization (수학 문장제의 명사화 여부에 따른 초등학교 3학년의 해결 과정 분석)

  • Kang, Yunji;Chang, Hyewon
    • Education of Primary School Mathematics
    • /
    • v.26 no.2
    • /
    • pp.83-97
    • /
    • 2023
  • Nominalization is one of the grammatical metaphors that makes it easier to mathematize the target that needs to be converted into a formula, but it has the disadvantage of making problem understanding difficult due to complex and compressed sentence structures. To investigate how this nominalization affects students' problem-solving processes, an analysis was conducted on 233 third-grade elementary school students' problem solving of eight arithmetic word problems with or without nominalization. The analysis showed that the presence or absence of nominalization did not have a significant impact on their problem understanding and their ability to convert sentences to formulas. Although the students did not have any prior experience in nominalization, they restructured the sentences by using nominalization or agnation in the problem understanding stage. When the types of nominalization change, the rate of setting the formula correctly appeared high. Through this, the use of nominalization can be a pedagogical strategy for solving word problems and can be expected to help facilitate deeper understanding.