• Title/Summary/Keyword: 개체 기반

Search Result 923, Processing Time 0.024 seconds

Korean Sentence Generation Using Phoneme-Level LSTM Language Model (한국어 음소 단위 LSTM 언어모델을 이용한 문장 생성)

  • Ahn, SungMahn;Chung, Yeojin;Lee, Jaejoon;Yang, Jiheon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.71-88
    • /
    • 2017
  • Language models were originally developed for speech recognition and language processing. Using a set of example sentences, a language model predicts the next word or character based on sequential input data. N-gram models have been widely used but this model cannot model the correlation between the input units efficiently since it is a probabilistic model which are based on the frequency of each unit in the training set. Recently, as the deep learning algorithm has been developed, a recurrent neural network (RNN) model and a long short-term memory (LSTM) model have been widely used for the neural language model (Ahn, 2016; Kim et al., 2016; Lee et al., 2016). These models can reflect dependency between the objects that are entered sequentially into the model (Gers and Schmidhuber, 2001; Mikolov et al., 2010; Sundermeyer et al., 2012). In order to learning the neural language model, texts need to be decomposed into words or morphemes. Since, however, a training set of sentences includes a huge number of words or morphemes in general, the size of dictionary is very large and so it increases model complexity. In addition, word-level or morpheme-level models are able to generate vocabularies only which are contained in the training set. Furthermore, with highly morphological languages such as Turkish, Hungarian, Russian, Finnish or Korean, morpheme analyzers have more chance to cause errors in decomposition process (Lankinen et al., 2016). Therefore, this paper proposes a phoneme-level language model for Korean language based on LSTM models. A phoneme such as a vowel or a consonant is the smallest unit that comprises Korean texts. We construct the language model using three or four LSTM layers. Each model was trained using Stochastic Gradient Algorithm and more advanced optimization algorithms such as Adagrad, RMSprop, Adadelta, Adam, Adamax, and Nadam. Simulation study was done with Old Testament texts using a deep learning package Keras based the Theano. After pre-processing the texts, the dataset included 74 of unique characters including vowels, consonants, and punctuation marks. Then we constructed an input vector with 20 consecutive characters and an output with a following 21st character. Finally, total 1,023,411 sets of input-output vectors were included in the dataset and we divided them into training, validation, testsets with proportion 70:15:15. All the simulation were conducted on a system equipped with an Intel Xeon CPU (16 cores) and a NVIDIA GeForce GTX 1080 GPU. We compared the loss function evaluated for the validation set, the perplexity evaluated for the test set, and the time to be taken for training each model. As a result, all the optimization algorithms but the stochastic gradient algorithm showed similar validation loss and perplexity, which are clearly superior to those of the stochastic gradient algorithm. The stochastic gradient algorithm took the longest time to be trained for both 3- and 4-LSTM models. On average, the 4-LSTM layer model took 69% longer training time than the 3-LSTM layer model. However, the validation loss and perplexity were not improved significantly or became even worse for specific conditions. On the other hand, when comparing the automatically generated sentences, the 4-LSTM layer model tended to generate the sentences which are closer to the natural language than the 3-LSTM model. Although there were slight differences in the completeness of the generated sentences between the models, the sentence generation performance was quite satisfactory in any simulation conditions: they generated only legitimate Korean letters and the use of postposition and the conjugation of verbs were almost perfect in the sense of grammar. The results of this study are expected to be widely used for the processing of Korean language in the field of language processing and speech recognition, which are the basis of artificial intelligence systems.

A Basic Study on the Establishment of Preservation and Management for Natural Monument(No.374) Pyeongdae-ri Torreya nucifera forest of Jeju (천연기념물 제374호 제주 평대리 비자나무 숲의 보존·관리방향 설정을 위한 기초연구)

  • Lee, Won-Ho;Kim, Dong-Hyun;Kim, Jae-Ung;Oh, Hae-Sung;Choi, Byung-Ki;Lee, Jong-Sung
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.32 no.1
    • /
    • pp.93-106
    • /
    • 2014
  • In this study, Analyze environment of location, investigation into vegetation resources, survey management status and establish to classify the management area for Natural monument No.374 Pyengdae-ri Torreya nucifera forest. The results were as follows: First, Torreya nucifera forest is concerned about influence of development caused by utilization of land changes to agricultural region. Thus, establish to preservation management plan for preservation of prototypical and should be excluded development activity to cause the change of terrain that Gotjawal in the Torreya nucifera forest is factor of base for generating species diversity. Secondly, Torreya nucifera forest summarized as 402 taxa composed 91 familly 263 genus, 353 species, 41 varieties and 8 forms. The distribution of plants for the first grade & second grade appear of endangered plant to Ministry of Environment specify. But, critically endangered in forest by changes in habitat, diseases and illegal overcatching. Therefore, when establishing forest management plan should be considered for put priority on protection. Thirdly, Torreya nucifera representing the upper layer of the vegetation structure. But, old tree oriented management and conservation strategy result in poor age structure. Furthermore, desiccation of forest on artificial management and decline in Torreya nucifera habitat on ecological succession can indicate a problem in forest. Therefore, establish plan such as regulation of population density and sapling tree proliferation for sustainable characteristics of the Torreya nucifera forest. Fourth, Appear to damaged of trails caused by use. Especially, Scoria way occurs a lot of damaged and higher than the share ratio of each section. Therefore, share ratio reduction Plan should be considered through the additional development of tourism routes rather than the replacement of Scoria. Fifth, Representing high preference of the Torreya nucifera forest tourist factor confirmed the plant elements. It is sensitive to usage pressure. And requires continuous monitoring by characteristic of Non-permanent. In addition, need an additional plan such as additional development of tourism elements and active utilizing an element of high preference. Sixth, Strength of protected should be differently accordance with importance. First grade area have to maintenance of plant population and natural habitats. Set the direction of the management. Second grade areas focus on annual regeneration of the forest. Third grade area should be utilized demonstration forest or set to the area for proliferate sapling. Fourth grade areas require the introduced of partial rest system that disturbance are often found in proper vegetation. Fifth grade area appropriate to the service area for promoting tourism by utilizing natural resources in Torreya nucifera forest. Furthermore, installation of a buffer zone in relatively low ratings area and periodic monitoring to the improvement of edge effect that adjacent areas of different class.

Effects of Rye Silage on Growth Performance, Blood Characteristics, and Carcass Quality in Finishing Pigs (호맥 사일리지의 급여기간이 비육돈의 생산성, 혈액 성상 및 도체특성에 미치는 영향)

  • Shin, Seung-Oh;Han, Young-Keun;Cho, Jin-Ho;Kim, Hae-Jin;Chen, Ying-Jie;Yoo, Jong-Sang;Whang, Kwang-Youn;Kim, Jung-Woo;Kim, In-Ho
    • Food Science of Animal Resources
    • /
    • v.27 no.4
    • /
    • pp.392-400
    • /
    • 2007
  • This experiment was conducted to evaluate effects of various periods of rye silage feeding on the growth performance, blood characteristics, and carcass quality of finishing pigs. A total of sixteen [($Landrace{\times}Yorkshire{\times}Duroc$)] pigs (90.26 kg in average initial body weight) were tested in individual cages for a 30 day period. Dietary treatments included 1) CON (basal diet), 2) S10 (basal diet for 20 days and 3% rye silage for 10 days) 3) S20 (basal diet for 10 days and 3% rye silage for 20 days) and 4) S30 (3% rye silage for 30 days). There were no significant differences in the ADG and gain/feed ratio among the treatments(p>0.05), however the ADFI was higher in pigs fed the CON diet than with pigs fed diets with rye silage (p<0.05). The DM digestibility was higher with the S20 diet than with the S30 diet (p<0.05). With regard to blood characteristics, pigs fed rye silage had a significantly reduced cortisol concentration compared to pigs fed the CON diet (p<0.05). The backfat thickness was higher with the CON diet than with the S20 or S30 diets (p<0.05). Regarding the fatty acid contents of the leans, the C18:0 and total SFA were significantly higher with the CON diet than with the other diets (p<0.05). However, the C18:1n9, total MUFA and UFA/SFA levels were significantly lower with the CON diet than the other diets (p<0.05). Regarding the fatty acid contents of fat, the levels of C18:1n9 and MUFA were similar with the S20 and S30 diets, however, these levels were higher than with the CON or S10 diets (p<0.05). In conclusion, feed intake and DM digestibility were affected by rye silage, and the cortisol concentration, backfat thickness and fatty acid composition of pork were positively affected by feeding pigs rye silage.