• Title/Summary/Keyword: 어휘모델

Search Result 304, Processing Time 0.036 seconds

Ontology-based u-Healthcare System for Patient-centric Service (환자중심서비스를 위한 온톨로지 기반의 u-Healthcare 시스템)

  • Jung, Yong Gyu;Lee, Jeong Chan;Jang, Eun Ji
    • Journal of Service Research and Studies
    • /
    • v.2 no.2
    • /
    • pp.45-51
    • /
    • 2012
  • U-healthcare is real-time monitoring of personal biometric information using by portable devices, home network and information and communication technology based healthcare systems, and fused together automatically to overcome the constraints of time and space are connected with hospitals and doctors. As u-healthcare gives health service in anytime and anywhere, it becomes to be a new type of medical services in patients management and disease prevention. In this paper, recent changes in prevention-oriented care is analyzed in becoming early response for Healthcare Information System by requirements analysis for technology development trend. According to the healthcare system, PACS, OCS, EMR and emergency medical system, U-healthcare is presenting the design of a patient-centered integrated client system. As the relationship between the meaning of the terms is used in the ontology, information models in the system is providing a common vocabulary with various levels of formality. In this paper, we propose an ontology-based system for patient-centered services, including the concept of clustering to clustering the data to define the relationship between these ontologies for more systematic data.

  • PDF

Development of Collaborative Filtering based User Recommender Systems for Water Leisure Boat Model Design (수상레저용 보트 설계를 위한 협력적 필터링 기반 사용자 추천시스템 개발)

  • Oh, Joong-Duk;Park, Chan-Hong;Kim, Chong-Soo;Seong, Hyeon-Kyeong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2014.05a
    • /
    • pp.413-416
    • /
    • 2014
  • Recently, demand for various leisure sports gradually increases, as people's sense of values changes into leisure-centered one according to the change of given social circumstance and the change of customer needs all over the world. The actual condition is that an interest and participation rate especially in water leports during the summer increases. And needs for various hull design of standardized boat for water leisure increase. Therefore, this paper is intended to develop a recommendation system to design a boat for water leisure by using the collaborative filtering technique in order to make it possible to actively cope with the change of various customer needs for hull design. To this end, emotion relating to kayak design was selected through consumer survey, and emotion was derived by factor analysis and assessment, and then a kayak design layout in the aspect of customer's emotional preference was presented. Besides, an analysis was made according to the elements such as hull, body, and propulsion system of kayak in order to select emotional words according to the kayak design reflecting user's preference, and then a boat model for water leisure in conformance with user's preference was presented.

  • PDF

A Multi-level Representation of the Korean Narrative Text Processing and Construction-Integration Theory: Morpho- syntactic and Discourse-Pragmatic Effects of Verb Modality on Topic Continuity (한국어 서사 텍스트 처리의 다중 표상과 구성 통합 이론: 주제어 연속성에 대한 양태 어미의 형태 통사적, 담화 화용적 기능)

  • Cho Sook-Whan;Kim Say-Young
    • Korean Journal of Cognitive Science
    • /
    • v.17 no.2
    • /
    • pp.103-118
    • /
    • 2006
  • The main purpose of this paper is to investigate the effects of discourse topic and morpho-syntactic verbal information on the resolution of null pronouns in the Korean narrative text within the framework of the construction-integration theory (Kintsch, 1988, Singer & Kintsch, 2001, Graesser, Gernsbacher, & Goldman. 2003). For the purpose of this paper, two conditions were designed: an explicit condition with both a consistently maintained discourse topic and the person-specific verb modals on one hand, and a neutral condition with no discourse topic or morpho-syntactic information provided, on the other. We measured the reading tines far the target sentence containing a null pronoun and the question response times for finding an antecedent, and the accuracy rates for finding an antecedent. During the experiments each passage was presented at a tine on a computer-controlled display. Each new sentence was presented on the screen at the moment the participant pressed the button on the computer keyboard. Main findings indicate that processing is facilitated by macro-structure (topicality) in conjunction with micro-structure (morpho-syntax) in pronoun interpretation. It is speculated that global processing alone may not be able to determine which potential antecedent is to be focused unless aided by lexical information. It is argued that the results largely support the resonance-based model, but not the minimalist hypothesis.

  • PDF

Korean Part-Of-Speech Tagging by using Head-Tail Tokenization (Head-Tail 토큰화 기법을 이용한 한국어 품사 태깅)

  • Suh, Hyun-Jae;Kim, Jung-Min;Kang, Seung-Shik
    • Smart Media Journal
    • /
    • v.11 no.5
    • /
    • pp.17-25
    • /
    • 2022
  • Korean part-of-speech taggers decompose a compound morpheme into unit morphemes and attach part-of-speech tags. So, here is a disadvantage that part-of-speech for morphemes are over-classified in detail and complex word types are generated depending on the purpose of the taggers. When using the part-of-speech tagger for keyword extraction in deep learning based language processing, it is not required to decompose compound particles and verb-endings. In this study, the part-of-speech tagging problem is simplified by using a Head-Tail tokenization technique that divides only two types of tokens, a lexical morpheme part and a grammatical morpheme part that the problem of excessively decomposed morpheme was solved. Part-of-speech tagging was attempted with a statistical technique and a deep learning model on the Head-Tail tokenized corpus, and the accuracy of each model was evaluated. Part-of-speech tagging was implemented by TnT tagger, a statistical-based part-of-speech tagger, and Bi-LSTM tagger, a deep learning-based part-of-speech tagger. TnT tagger and Bi-LSTM tagger were trained on the Head-Tail tokenized corpus to measure the part-of-speech tagging accuracy. As a result, it showed that the Bi-LSTM tagger performs part-of-speech tagging with a high accuracy of 99.52% compared to 97.00% for the TnT tagger.

Korean Sentence Generation Using Phoneme-Level LSTM Language Model (한국어 음소 단위 LSTM 언어모델을 이용한 문장 생성)

  • Ahn, SungMahn;Chung, Yeojin;Lee, Jaejoon;Yang, Jiheon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.71-88
    • /
    • 2017
  • Language models were originally developed for speech recognition and language processing. Using a set of example sentences, a language model predicts the next word or character based on sequential input data. N-gram models have been widely used but this model cannot model the correlation between the input units efficiently since it is a probabilistic model which are based on the frequency of each unit in the training set. Recently, as the deep learning algorithm has been developed, a recurrent neural network (RNN) model and a long short-term memory (LSTM) model have been widely used for the neural language model (Ahn, 2016; Kim et al., 2016; Lee et al., 2016). These models can reflect dependency between the objects that are entered sequentially into the model (Gers and Schmidhuber, 2001; Mikolov et al., 2010; Sundermeyer et al., 2012). In order to learning the neural language model, texts need to be decomposed into words or morphemes. Since, however, a training set of sentences includes a huge number of words or morphemes in general, the size of dictionary is very large and so it increases model complexity. In addition, word-level or morpheme-level models are able to generate vocabularies only which are contained in the training set. Furthermore, with highly morphological languages such as Turkish, Hungarian, Russian, Finnish or Korean, morpheme analyzers have more chance to cause errors in decomposition process (Lankinen et al., 2016). Therefore, this paper proposes a phoneme-level language model for Korean language based on LSTM models. A phoneme such as a vowel or a consonant is the smallest unit that comprises Korean texts. We construct the language model using three or four LSTM layers. Each model was trained using Stochastic Gradient Algorithm and more advanced optimization algorithms such as Adagrad, RMSprop, Adadelta, Adam, Adamax, and Nadam. Simulation study was done with Old Testament texts using a deep learning package Keras based the Theano. After pre-processing the texts, the dataset included 74 of unique characters including vowels, consonants, and punctuation marks. Then we constructed an input vector with 20 consecutive characters and an output with a following 21st character. Finally, total 1,023,411 sets of input-output vectors were included in the dataset and we divided them into training, validation, testsets with proportion 70:15:15. All the simulation were conducted on a system equipped with an Intel Xeon CPU (16 cores) and a NVIDIA GeForce GTX 1080 GPU. We compared the loss function evaluated for the validation set, the perplexity evaluated for the test set, and the time to be taken for training each model. As a result, all the optimization algorithms but the stochastic gradient algorithm showed similar validation loss and perplexity, which are clearly superior to those of the stochastic gradient algorithm. The stochastic gradient algorithm took the longest time to be trained for both 3- and 4-LSTM models. On average, the 4-LSTM layer model took 69% longer training time than the 3-LSTM layer model. However, the validation loss and perplexity were not improved significantly or became even worse for specific conditions. On the other hand, when comparing the automatically generated sentences, the 4-LSTM layer model tended to generate the sentences which are closer to the natural language than the 3-LSTM model. Although there were slight differences in the completeness of the generated sentences between the models, the sentence generation performance was quite satisfactory in any simulation conditions: they generated only legitimate Korean letters and the use of postposition and the conjugation of verbs were almost perfect in the sense of grammar. The results of this study are expected to be widely used for the processing of Korean language in the field of language processing and speech recognition, which are the basis of artificial intelligence systems.

작업부하에대한 심리/환경적 영향 평가기법 개발: 제조업체를 중심으로

  • 박창순;조영진;김정룡
    • Proceedings of the ESK Conference
    • /
    • 1998.04a
    • /
    • pp.40-45
    • /
    • 1998
  • 본 연구에서는 현장 작업자 개인이 느끼는 심리/환경적 부하를 정량적으로 파악할 수 있는 척도를 개발하였다. 이러한 척도는 현장의 중간관리자들이 간단한 기구와 직접 관찰을 통해 쉽게 사용할 수 있도록 설계되었다. 심리적 작업부하 측정은 기존 연구와 현장 작업자를 대상으로 한 인터뷰 결과를 바탕으로, 타당도가 높다고 인정되는 심리부하 측정 요소를 결정하였고, 문헌과 모의 검사를 통하여 부 담감이나 편의(bias)를 줄일 수 있도록 문항수 및 어휘를 선택하였으며, 각각의 설문은 다양한 문체 중 가장 신뢰도가 높은 형태를 선정하였다. 환경적 작업부하 측정은 문헌 조사를 통하여 측정을 위한 요소를 결덩하였고, KS 규격을 기존으로 새로운 환경부하 측정 지침서를 개발하였고, 현장 예비조사가 실시되었다. 또한, 환경부하 측정결과를 지수화하기 위한 수리적 모델과, 심리적척도와의 통합을 위 한 수리적 방법론이 제시되었다. 이러한 설문 문항과 환경조사방법은 작업장을 대상으로 적용하여 실 제로 그 효용성 여부를 검토하였다. 연구 결과 이제까지 소홀히 여겨왔던 작업자의 심리적/환경적 부 하를 정량적으로 간편하게 측정할 수 있는 방법이 개발되었고, 정량화한 심리/환경 부하 척도는 이제 까지 주로 조사되었던 생체역학적/생리적 부담도와 더불어 종합적인 작업부하 평가에 일익을 담당할 것으로 예상된다. 또한, 이러한 척도에 의한 결과는 현장 근로자들의 근로여건 개선을 위하여 노/사/ 정부 모두가 만족할 수 있는 객관적 자료로 사용될 수 있으며, 예방차원의 안전관리에도 응용될 수 있 을 것으로 기대한다.구로 신체의 종합적인 만족도, 선호를 취급하고 있을 뿐 신체각 부위의 만족 도나 선호에 관한 자세한 고찰을 하고 있지 않으며 자신의 신체에 대한 인식도 및 실제체형과의 비교는 이루어지고 있지 않다. 이에, 신체 각 부위에 대한 인식도 및 실제 신체 측정치와 만족도와의 관계 및 이상형에 대해 구체적으로 파악할 필요가 있다. 또한, 신체에 대한 이상형은 시대의 여러 여건에 따라서 변화할 수 있으므로 의복 착용자가 의복을 통해서 표현하고자 하는 이상형의 시대적 변화를 살펴볼 필요가 있다. 따라서 본 연구에서는 신체에 대한 인식도 및 만족도, 이상형에 대한 설문지 조사와 신체측정을 통하여 신체 크기에대한 만족도를 객관적인 척도로 고찰하고, 이상형과 실제 체형에 관하여 고찰하고자 한다. 도한, 1992년도 자료와의 비교를 통하여 시대에 따른 신체만족도와 이상형의 변화를 파악하고자 한다. 이를 기초로 한 의복원형 제작 및 의복 디자인에 대한 연구를 통해 의복의 맞음새가 좋을뿐만 아니라 의복착용자들 에게 심리적 만족을 줄 수 있는 의복 제작에 도움이 될 수 있을 것이다.적입지로 분석되었다.등 다양한 모형들을 고려해 본 뒤, 적절한 모형을 적용할 것이다. 가로망 설계 모형에서 신호제어를 고려하기 위해서는 주어진 가로망에 대한 통행 배정과정에서 고려되는 통행시간을 링크통행시간과 교차로 지체시간을 동시에 고려해야 하는데, 이러한 문제의 해결을 위해서 최근 활발히 논의되고 있는 교차로에서의 신호제어에 대응하는 통행배정 모형을 도입하여 고려하고자 한다. 이를 위해서 지금까지 연구되어온 Global

  • PDF

Korean Abbreviation Generation using Sequence to Sequence Learning (Sequence-to-sequence 학습을 이용한 한국어 약어 생성)

  • Choi, Su Jeong;Park, Seong-Bae;Kim, Kweon-Yang
    • KIISE Transactions on Computing Practices
    • /
    • v.23 no.3
    • /
    • pp.183-187
    • /
    • 2017
  • Smart phone users prefer fast reading and texting. Hence, users frequently use abbreviated sequences of words and phrases. Nowadays, abbreviations are widely used from chat terms to technical terms. Therefore, gathering abbreviations would be helpful to many services, including information retrieval, recommendation system, and so on. However, manually gathering abbreviations needs to much effort and cost. This is because new abbreviations are continuously generated whenever a new material such as a TV program or a phenomenon is made. Thus it is required to generate of abbreviations automatically. To generate Korean abbreviations, the existing methods use the rule-based approach. The rule-based approach has limitations, in that it is unable to generate irregular abbreviations. Another problem is to decide the correct abbreviation among candidate abbreviations generated rules. To address the limitations, we propose a method of generating Korean abbreviations automatically using sequence-to-sequence learning in this paper. The sequence-to-sequence learning can generate irregular abbreviation and does not lead to the problem of deciding correct abbreviation among candidate abbreviations. Accordingly, it is suitable for generating Korean abbreviations. To evaluate the proposed method, we use dataset of two type. As experimental results, we prove that our method is effective for irregular abbreviations.

Application of DNA microarry : Comparative functional genomic approach

  • Chu In-Sun
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2006.02a
    • /
    • pp.109-114
    • /
    • 2006
  • 최근 Human 지놈 프로젝트를 포함한 다양한 종의 지놈 프로젝트가 수행되고 수많은 지놈정보가 생산되고 있으며 이를 해석하고 서로 연관성를 찾기 위한 다양한 연구가 진행되고 있다. 즉 최신 생명공학과 관련된 연구방향이 DNA의 구조적 해석에서 기능 해석과 유전자들의 상호연관성을 규명하는 방향으로 변화하고 있으며 이를 위한 강력한 도구로서 DNA microarray (DNA chip)는 방대한 양의 지놈 정보를 이용하여 단시간에 대량으로 고속처리하여 효율적으로 유전자 기능을 분석할 수 있는 주목받고 있는 방법이다. DNA microarray 실험과 분석에 있어 데이터분석, 재현성, 종간의 비교, 확인실험 및 비용 등의 문제가 있지만 유전자발현양상 데이터로부터 정확한 환자의 예후를 예측할 수 있는 비교적 적은 유전자 그룹의 진단마커를 찾거나, 하나의 유전자가 아니라 mouse 전체 지놈의 유전자발현 패턴을 인간의 암을 위시한 각종 질병 연구를 위한 발현 신호나 변화 등을 발견하여 신약개발 등에 활용하고자 하는 시도가 활발히 진행되고 있다. 서로 다른 종간에 비슷한 phenotype의 유전자발현도 진화적으로 보존되었다는 전제 하에서 지놈 sequence의 비교연구가 가능하고 DNA microarray 발현 데이터에 근거하여 독립적으로 각 종간의 유전자발현패턴을 비교함으로써 난치병 등을 새롭게 분류할 수 있다. 즉, 암세포 등에서 유전자발현 양상은 유전학적, 환경적 alteration들이 잘 반영되어 있다고 간주하고, 이러한 양상을 바탕으로 인간의 암을 위시한 다양한 질병 연구를 위한 최적의 mouse 모델을 찾을 수 있고, 이는 결국 새로운 치료 방법 개발이나 맞춤의학 실현에 중요한 역할을 할 것으로 기대된다. 특히 pathway 타겟으로 하는 치료를 위해서는 Human-mouse 비교를 통한 발현 신호를 찾는 것이 진단에서는 매우 유용한 방법이다. 이를 위한 고성능의 분석방법이나 시스템의 개발이 중요하게 된다.. 관류의 정도와 조영증강정도를 중심으로 관류 MR 영상소견과 조직학적 소견을 관련지어 분석하였다. 결과: 조영증강 T1강조MR영상에서 환상조영증강을 보이는 다형성 교보세포종 2예에서는 변연부 외륜이 고관류를, 중심부의 괴사부위는 저관류로 나타났다. 저등급 교종은 경계가 불분명한 저관류부위로 보였다. 뇌농양 2예는 변연부 외륜이 경도의 고관류를, 중심부는 저관류로 나타났다. 뇌수막종은 미만성의 균일한 중등도 혹은 고도의 고관류로 보였으며, 임파종과 배아종은 경계가 명확한 저관류부위로 나타났다. 신경세포종은 종괴\ulcorner 일부에 중등도 혹은 고도의 고관류부위가 관찰되었고, 전이암은 다수병변중 일부에서 중등도의 고관류를 보였다. 방사선괴사는 저관류부위내에 국소적 고관류부위를 보였다. 결론: 관류 MR영상은 뇌종양의 관류상태를 비교적 잘 반영하며, 조직학적 특성을 예측하는데에 도움을 주 수 있을 것으로 기대된다. 뇌종야에서의 관류MR영상의 분명한 역할을 규명하기 위해서는 앞으로 더 많은 임상적 연구가 필요할 것으로 생각된다.조증 환자의 자극성 전타액내 lactobacilli양은 peroxidase system을 함유한 세치제를 사용한 군에서 대조군에 비해 상대적으로 낮게 나타났으나(p = 0.067) 통계학적 유의성은 없었다.같은 예에서 찾아 볼 수 있다. 첫째, 발음상으로 동사의 변화형에서 "porte[$p{\jmath}rte$](들다: 현재형), porte[$p{\jmath}rte$](과거분사형), porta[$p{\jmath}rte$](단순과거형)"등이 대립되며, 이휘 "Porto[$p{\jmath}rte$](포르토)"와도 대립된다. 둘째, 어휘적 대립 "le haut[$l{\partial}o$](위)/l'eau[lo](물)"와 형태론적 대립 "le[$l{\partial}$](정관사, 남성단수)/l

  • PDF

Debelppment of C++ Compiler and Programming Environment (C++컴파일러 및 프로그래밍 환경 개발)

  • Jang, Cheon-Hyeon;O, Se-Man
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.3
    • /
    • pp.831-845
    • /
    • 1997
  • In this paper,we proposed and developed a compiler and interactive programming enviroments for C++ wich is mostly worth of nitice among the object -oriented languages.To develope the compiler for C++ we took front=end/back-end model using EM virtual machine.In develpoing Front-End,we formailized C++ gram-mar with the context semsitive tokens which must be manipulated by dexical scanner and designed a AST class li-brary which is the hierarchy of AST node class and well defined interface among them,In develpoing Bacik-End,we proposed model for three major components :code oprtimizer,code generator and run-time enviroments.We emphasized the retargatable back-end which can be systrmatically reconfigured to genrate code for a variety of distinct target computers.We also developed terr pattern matching algorithm and implemented target code gen-erator which produce SPARC code.We also proposed the theroy and model for construction interative pro-gramming enviroments. To represent language features we adopt AST as internal reprsentation and propose uncremental analysis algorithm and viseal digrams.We also studied unparsing scheme, visual diagram,graphical user interface to generate interactive environments automatically Results of our resarch will be very useful for developing a complier and programming environments, and also can be used in compilers for parallel and distributed enviroments.

  • PDF

Exploring user experience factors through generational online review analysis of AI speakers (인공지능 스피커의 세대별 온라인 리뷰 분석을 통한 사용자 경험 요인 탐색)

  • Park, Jeongeun;Yang, Dong-Uk;Kim, Ha-Young
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.7
    • /
    • pp.193-205
    • /
    • 2021
  • The AI speaker market is growing steadily. However, the satisfaction of actual users is only 42%. Therefore, in this paper, we collected reviews on Amazon Echo Dot 3rd and 4th generation models to analyze what hinders the user experience through the topic changes and emotional changes of each generation of AI speakers. By using topic modeling analysis techniques, we found changes in topics and topics that make up reviews for each generation, and examined how user sentiment on topics changed according to generation through deep learning-based sentiment analysis. As a result of topic modeling, five topics were derived for each generation. In the case of the 3rd generation, the topic representing general features of the speaker acted as a positive factor for the product, while user convenience features acted as negative factor. Conversely, in the 4th generation, general features were negatively, and convenience features were positively derived. This analysis is significant in that it can present analysis results that take into account not only lexical features but also contextual features of the entire sentence in terms of methodology.