• 제목/요약/키워드: word boundary

Search Result 88, Processing Time 0.025 seconds

A Spatial Study about Olafur Eliasson's Emotional Atmospheric Experience of Gernot Böhme's Aesthetics (뵈메의 감성학을 통한 올라퍼 엘리아슨 공간의 지각적 분위기 체험 연구)

  • Jang, Su-Min;Kim, Kai-Chun
    • Korean Institute of Interior Design Journal
    • /
    • v.27 no.3
    • /
    • pp.108-115
    • /
    • 2018
  • The atmosphere is a popular word in everyday life. There is often an atmosphere when we enter a particular place. As if to say, The mood is perceived as an emotional and subjective word. Atmosphere is subjective and there are different feelings, but there are definitely certain feelings that people can relate to. The researcher examines the question in the paper and analyzes how the atmosphere in the space could be explained. So I will research about $B{\ddot{o}}hme^{\prime}s$ aesthetics which is called atmosphere. and analysis how his atmosphere is applied in nowadays art. So this study has two purposes. First is the notion of the atmosphere, not the atmosphere of rational perspective, it's about emotional and perceptual experiences. Therefore a connection about audience and arts is the most important focus in atmosphere. So the other purpose is Olafur Eliasson's Atmosphere. he is an artist about this perception. His work requires spectator intervention and participation to make it a perfect art. There is also a element in Eliasson's philosophy, in which the perceptual experiences of visitor's relationship between the work and the viewer, and eliminates the boundary as a perceptual expression.

A review of Chinese named entity recognition

  • Cheng, Jieren;Liu, Jingxin;Xu, Xinbin;Xia, Dongwan;Liu, Le;Sheng, Victor S.
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.6
    • /
    • pp.2012-2030
    • /
    • 2021
  • Named Entity Recognition (NER) is used to identify entity nouns in the corpus such as Location, Person and Organization, etc. NER is also an important basic of research in various natural language fields. The processing of Chinese NER has some unique difficulties, for example, there is no obvious segmentation boundary between each Chinese character in a Chinese sentence. The Chinese NER task is often combined with Chinese word segmentation, and so on. In response to these problems, we summarize the recognition methods of Chinese NER. In this review, we first introduce the sequence labeling system and evaluation metrics of NER. Then, we divide Chinese NER methods into rule-based methods, statistics-based machine learning methods and deep learning-based methods. Subsequently, we analyze in detail the model framework based on deep learning and the typical Chinese NER methods. Finally, we put forward the current challenges and future research directions of Chinese NER technology.

MSFM: Multi-view Semantic Feature Fusion Model for Chinese Named Entity Recognition

  • Liu, Jingxin;Cheng, Jieren;Peng, Xin;Zhao, Zeli;Tang, Xiangyan;Sheng, Victor S.
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.6
    • /
    • pp.1833-1848
    • /
    • 2022
  • Named entity recognition (NER) is an important basic task in the field of Natural Language Processing (NLP). Recently deep learning approaches by extracting word segmentation or character features have been proved to be effective for Chinese Named Entity Recognition (CNER). However, since this method of extracting features only focuses on extracting some of the features, it lacks textual information mining from multiple perspectives and dimensions, resulting in the model not being able to fully capture semantic features. To tackle this problem, we propose a novel Multi-view Semantic Feature Fusion Model (MSFM). The proposed model mainly consists of two core components, that is, Multi-view Semantic Feature Fusion Embedding Module (MFEM) and Multi-head Self-Attention Mechanism Module (MSAM). Specifically, the MFEM extracts character features, word boundary features, radical features, and pinyin features of Chinese characters. The acquired font shape, font sound, and font meaning features are fused to enhance the semantic information of Chinese characters with different granularities. Moreover, the MSAM is used to capture the dependencies between characters in a multi-dimensional subspace to better understand the semantic features of the context. Extensive experimental results on four benchmark datasets show that our method improves the overall performance of the CNER model.

THE NORTHERN BOUNDARY THE TSUSHIMA CURRENT AND ITS FOUCTUATIONS (하계 동해에 있어서 대마난류의 북상한계와 변동)

  • Hong, Chol-Hoon;Cho, Kyu-Dae
    • 한국해양학회지
    • /
    • v.18 no.1
    • /
    • pp.1-9
    • /
    • 1983
  • The northern boundary of the Tsusgima Current and its fluctuations are divcussed in the Japan Sea in summer. This current was characterized with high slinity, and its path was traced by following the salinity maximum on the basis of oceanographical data collected during the period from 1963 to 1979. The salinity maxima (34.45-34.85 ) of the Tsushima Current in the areas between 29 N in the East China Sea and northern part of the Japan Sea were found at depths between 46m and 135m. The representative thermosteric anomaly corresponding to the salinity maximum eas examined in order to analyze the advection of this currint. In the Tsushima Current region in the Japan Sen, the thermosteric anomaly values in the layer of salinity maximum during the period of 1970 to 1979 was beween 220 cl/t and 260 cl/t. In general, as the current moves northward its salinity decreascs, its thermosteric anomaly decreases and the depth of salinity maximum becomes shallower. The northern boundary of this current, which is indicated by 34.4 isohaline on 240 cl/t isanosteric surface during the study period of ten years, was confined to south of 40 N of the Japan Sea. The 34.4 isohaline edvealed two types of flow; one of them flows northward along the eastern coast of South Korea and then meanders eastward, while the oter flows basically northeastward along the coast of Japan. The meanders of northern boundary of this currint idrntified th isohaline in this word were nearly similar to those studied by others on the bases of isotherm analysis.

  • PDF

Analysis and Prediction of Prosodic Phrage Boundary (운율구 경계현상 분석 및 텍스트에서의 운율구 추출)

  • Kim, Sang-Hun;Seong, Cheol-Jae;Lee, Jung-Chul
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.1
    • /
    • pp.24-32
    • /
    • 1997
  • This study aims to describe, at one aspect, the relativity between syntactic structure and prosodic phrasing, and at the other, to establish a suitable phrasing pattern to produce more natural synthetic speech. To get meaningful results, all the word boundaries in the prosodic database were statistically analyzed, and assigned by the proper boundary type. The resulting 10 types of prosodic boundaries were classified into 3 types according to the strength of the breaks, which are zero, minor, and major break respectively. We have found out that the durational information was a main cue to determine the major prosodic boundary. Using the bigram and trigram of syntactic information, we predicted major and minor classification of boundary types. With brigram model, we obtained the correct major break prediction rates of 4.60%, 38.2%, the insertion error rates of 22.8%, 8.4% on each Test-I and Test-II text database respectively. With trigram mode, we also obtained the correct major break prediction rates of 58.3%, 42.8%, the insertion error rates of 30.8%, 42.8%, the insertion error rates of 30.8%, 11.8% on Test-I and Test-II text database respectively.

  • PDF

A Study on the Perception of English Rhythm and Intonation Structure by Korea University Students (대학생의 영어 리듬과 억양구조 인식에 대한 연구)

  • Park Joo-Hyun
    • Proceedings of the KSPS conference
    • /
    • 1997.07a
    • /
    • pp.92-114
    • /
    • 1997
  • This study is aimed to grasp the actual problems of the perception of English rhythm and intonation structure by Korean University students who have studied English in the secondary schools for the past six years, and to establish the systems of English rhythm and intonation structure for the Korean students of English. For this study, the listening test is provided, and 100 students are chosen as the subjects of the study. The noticeable findings are summarized as follows: (1) Koreans perceive the words stress comparatively well in nonsense words, unfamiliar place names, and familiar word. (2) Koreans do not perceive the isochrony of English rhythm well enough. The perception of the sentence stress is very unstable, especially in the sentence involved in polysyllabic words, compound words, and 'emphatic stress' pr 'contrastive stress'(or in the different rhythmic patterns). (3) Koreans do not perceive the nucleus well enough. The perception of the nucleus is more stable in content words than in function words, at the end of a sentence than in the middle of a sentence, and in monosyllabic words than in the polysyllabic words. (4) Koreans do not perceive the boundary(or pause) of intonation group well enough. The perception of the pause is unstable in the long or complex sentence. (5) Koreans discriminate the meaning of English word stress comparatively well, especially in disyllabic words. But the discrimination is somewhat unstable in polysyllabic words and between 'adjective' and 'verb' (6) Koreans' discrimination of the intonation meaning is below the level. Koreans do not perceive the differences of intonation meaning according to the pitch accent or the focus. In conclusion, the writer will propose the procedures for the teaching of rhythm and intonation in the following order: word stress drill longrightarrowstressed and reduced syllables drilllongrightarrowrhythm group drilllongrightarrowthe varying rhythm drilllongrightarrowsentence stress drilllongrightarrownucleus drill longrightarrowintonation group drilllongrightarrowlong utterance drill of more than two intonation group.

  • PDF

Improved Sentence Boundary Detection Method for Web Documents (웹 문서를 위한 개선된 문장경계인식 방법)

  • Lee, Chung-Hee;Jang, Myung-Gil;Seo, Young-Hoon
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.6
    • /
    • pp.455-463
    • /
    • 2010
  • In this paper, we present an approach to sentence boundary detection for web documents that builds on statistical-based methods and uses rule-based correction. The proposed system uses the classification model learned offline using a training set of human-labeled web documents. The web documents have many word-spacing errors and frequently no punctuation mark that indicates the end of sentence boundary. As sentence boundary candidates, the proposed method considers every Ending Eomis as well as punctuation marks. We optimize engine performance by selecting the best feature, the best training data, and the best classification algorithm. For evaluation, we made two test sets; Set1 consisting of articles and blog documents and Set2 of web community documents. We use F-measure to compare results on a large variety of tasks, Detecting only periods as sentence boundary, our basis engine showed 96.5% in Set1 and 56.7% in Set2. We improved our basis engine by adapting features and the boundary search algorithm. For the final evaluation, we compared our adaptation engine with our basis engine in Set2. As a result, the adaptation engine obtained improvements over the basis engine by 39.6%. We proved the effectiveness of the proposed method in sentence boundary detection.

A Novel Model, Recurrent Fuzzy Associative Memory, for Recognizing Time-Series Patterns Contained Ambiguity and Its Application (모호성을 포함하고 있는 시계열 패턴인식을 위한 새로운 모델 RFAM과 그 응용)

  • Kim, Won;Lee, Joong-Jae;Kim, Gye-Young;Choi, Hyung-Il
    • The KIPS Transactions:PartB
    • /
    • v.11B no.4
    • /
    • pp.449-456
    • /
    • 2004
  • This paper proposes a novel recognition model, a recurrent fuzzy associative memory(RFAM), for recognizing time-series patterns contained an ambiguity. RFAM is basically extended from FAM(Fuzzy Associative memory) by adding a recurrent layer which can be used to deal with sequential input patterns and to characterize their temporal relations. RFAM provides a Hebbian-style learning method which establishes the degree of association between input and output. The error back-propagation algorithm is also adopted to train the weights of the recurrent layer of RFAM. To evaluate the performance of the proposed model, we applied it to a word boundary detection problem of speech signal.

Effective Syllable Modeling for Korean Speech Recognition Using Continuous HMM (연속 은닉 마코프 모델을 이용한 한국어 음성 인식을 위한 효율적 음절 모델링)

  • 김봉완;이용주
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.1
    • /
    • pp.23-27
    • /
    • 2003
  • Recently attempts to we the syllable as the recognition unit to enhance performance in continuous speech recognition hate been reported. However, syllables are worse in their trainability than phones and the former have a disadvantage in that contort-dependent modeling is difficult across the syllable boundary since the number of models is much larger for syllables than for phones. In this paper, we propose a method to enhance the trainability for the syllables in Korean and phoneme-context dependent syllable modeling across the syllable boundary. An experiment in which the proposed method is applied to word recognition shows average 46.23% error reduction in comparison with the common syllable modeling. The right phone dependent syllable model showed 16.7% error reduction compared with a triphone model.

Recognition of Various Printed Hangul Images by using the Boundary Tracing Technique (경계선 기울기 방법을 이용한 다양한 인쇄체 한글의 인식)

  • Baek, Seung-Bok;Kang, Soon-Dae;Sohn, Young-Sun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.1
    • /
    • pp.1-5
    • /
    • 2003
  • In this paper, we realized a system that converts the character images of the printed Korean alphabet (Hangul) to the editable text documents by using the black and white CCD camera, We were able to abstract the contours information of the character which is based on the structural character by using the boundary tracing technique that is strong to the noise on the character recognition. By using the contours information, we recognized the horizontal vowels and vertical vowels of the character image and classify the character into the six patterns. After that, the character is divided to the unit of the consonant and vowel. The vowels are recognized by using the maximum length projection. The separated consonants are recognized by comparing the inputted pattern with the standard pattern that has the phase information of the boundary line change. We realized a system that the recognized characters are inputted to the word editor with the editable KS Hangul completion type code.