• Title/Summary/Keyword: 자료망의 크기

Search Result 117, Processing Time 0.024 seconds

Korean Sentence Generation Using Phoneme-Level LSTM Language Model (한국어 음소 단위 LSTM 언어모델을 이용한 문장 생성)

  • Ahn, SungMahn;Chung, Yeojin;Lee, Jaejoon;Yang, Jiheon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.71-88
    • /
    • 2017
  • Language models were originally developed for speech recognition and language processing. Using a set of example sentences, a language model predicts the next word or character based on sequential input data. N-gram models have been widely used but this model cannot model the correlation between the input units efficiently since it is a probabilistic model which are based on the frequency of each unit in the training set. Recently, as the deep learning algorithm has been developed, a recurrent neural network (RNN) model and a long short-term memory (LSTM) model have been widely used for the neural language model (Ahn, 2016; Kim et al., 2016; Lee et al., 2016). These models can reflect dependency between the objects that are entered sequentially into the model (Gers and Schmidhuber, 2001; Mikolov et al., 2010; Sundermeyer et al., 2012). In order to learning the neural language model, texts need to be decomposed into words or morphemes. Since, however, a training set of sentences includes a huge number of words or morphemes in general, the size of dictionary is very large and so it increases model complexity. In addition, word-level or morpheme-level models are able to generate vocabularies only which are contained in the training set. Furthermore, with highly morphological languages such as Turkish, Hungarian, Russian, Finnish or Korean, morpheme analyzers have more chance to cause errors in decomposition process (Lankinen et al., 2016). Therefore, this paper proposes a phoneme-level language model for Korean language based on LSTM models. A phoneme such as a vowel or a consonant is the smallest unit that comprises Korean texts. We construct the language model using three or four LSTM layers. Each model was trained using Stochastic Gradient Algorithm and more advanced optimization algorithms such as Adagrad, RMSprop, Adadelta, Adam, Adamax, and Nadam. Simulation study was done with Old Testament texts using a deep learning package Keras based the Theano. After pre-processing the texts, the dataset included 74 of unique characters including vowels, consonants, and punctuation marks. Then we constructed an input vector with 20 consecutive characters and an output with a following 21st character. Finally, total 1,023,411 sets of input-output vectors were included in the dataset and we divided them into training, validation, testsets with proportion 70:15:15. All the simulation were conducted on a system equipped with an Intel Xeon CPU (16 cores) and a NVIDIA GeForce GTX 1080 GPU. We compared the loss function evaluated for the validation set, the perplexity evaluated for the test set, and the time to be taken for training each model. As a result, all the optimization algorithms but the stochastic gradient algorithm showed similar validation loss and perplexity, which are clearly superior to those of the stochastic gradient algorithm. The stochastic gradient algorithm took the longest time to be trained for both 3- and 4-LSTM models. On average, the 4-LSTM layer model took 69% longer training time than the 3-LSTM layer model. However, the validation loss and perplexity were not improved significantly or became even worse for specific conditions. On the other hand, when comparing the automatically generated sentences, the 4-LSTM layer model tended to generate the sentences which are closer to the natural language than the 3-LSTM model. Although there were slight differences in the completeness of the generated sentences between the models, the sentence generation performance was quite satisfactory in any simulation conditions: they generated only legitimate Korean letters and the use of postposition and the conjugation of verbs were almost perfect in the sense of grammar. The results of this study are expected to be widely used for the processing of Korean language in the field of language processing and speech recognition, which are the basis of artificial intelligence systems.

Reports on bionomical characteristics of Mellicta ambigua (여름어리표범나비(Mellicta ambigua (Menetries))의 생태적 특성에 관한 보고)

  • Kim, Se-Gwon;Nam, Gyoung-Pil;Kim, Nam-Ee;Bae, Kyoung-Sin;Choi, Young-Cheol;Lee, Sang-Hyun
    • Journal of Sericultural and Entomological Science
    • /
    • v.52 no.2
    • /
    • pp.110-116
    • /
    • 2014
  • Recently the number of the butterflies, Mellicta ambigua, had been decreasing rapidly, and already disappeared at many habitat. In this studies, we investigated ecological environment of Mellicta ambigua for preparing of primary research data recovering habitat, and studied on bionomical characteristics. Two different habitat, Jindo and Inje, were selected for investigation of ecological environment. We investigated four times during 3-month, from June to August in 2012. In Jindo, we observed more than 100 butterflies and a lot of host plants, Melampyrum roseum var. japonicum. But only 5 butterflies and only a few host plants, Veronicastrum sibiricum were observed in Inje. We could not observe the eggs, the larva and pupa on the host plants at all. For finding of bionomical characteritics, we reared butterflies at natural conditions. Collected 3-female butterflies from Jindo laid 465 eggs on the leaves of 3-host plants, Veronicastrum sibiricum. 120 ~ 186 eggs per each female were laid in the shape of cluster. An egg was globular shape, 0.6 mm diameter and 0.7 mm height. The egg periods were $9.96{\pm}0.4days$ after ovipositioning, and the hatchability was 95.% at natural condition. The larval periods were $4.1{\pm}0.6days$ (1st instar), $2.1{\pm}1.0days$ (2nd), $8.1{\pm}0.7days$ (3rd), $239.2{\pm}10.9days$ (4th), $12.3{\pm}1.3days$ (5th), $17.1{\pm}1.1days$ (6th), $10.5{\pm}1.0days$ (7th) each other. The larva of 4th instar overwintered in the nest that had been made into the leaf of host plant with secreted thread as a group until early March next year. In the early March next year, overwintered larva went around their nest in search of host plants, and went to other host plants, Veronica persica and Plantago asiatica, sometimes. The overwintered larva of Mellicta ambigua could grow up on two other host plants normally. In the following experiment, the butterflies of Mellicta ambigua laid eggs on the leaves of Plantago asiatica, but the 1st instar larva from eggs died all. The headwidth of each developmental larval stage were $0.28{\pm}0.02mm$ (1st), $0.45{\pm}0.02mm$ (2nd), $0.58{\pm}0.02mm$ (3rd), $0.75{\pm}0.03mm$ (4th), $0.89{\pm}0.05mm$ (5th), $1.23{\pm}0.06mm$ (6th), $2.13{\pm}0.11mm$ (7th). The pupal ratio was 92.0%. The pupal period were $9.1{\pm}1.6days$, and the emergence rate was 88.6%. As a result we determined that Mellicta ambigua can rear at natural conditions. But indoor-rearing is considered to be difficult and not useful industrially, because they have long term larval stage and only one life cycle per an year.

Earthquake Monitoring : Future Strategy (지진관측 : 미래 발전 전략)

  • Chi, Heon-Cheol;Park, Jung-Ho;Kim, Geun-Young;Shin, Jin-Soo;Shin, In-Cheul;Lim, In-Seub;Jeong, Byung-Sun;Sheen, Dong-Hoon
    • Geophysics and Geophysical Exploration
    • /
    • v.13 no.3
    • /
    • pp.268-276
    • /
    • 2010
  • Earthquake Hazard Mitigation Law was activated into force on March 2009. By the law, the obligation to monitor the effect of earthquake on the facilities was extended to many organizations such as gas company and local governments. Based on the estimation of National Emergency Management Agency (NEMA), the number of free-surface acceleration stations would be expanded to more than 400. The advent of internet protocol and the more simplified operation have allowed the quick and easy installation of seismic stations. In addition, the dynamic range of seismic instruments has been continuously improved enough to evaluate damage intensity and to alert alarm directly for earthquake hazard mitigation. For direct visualization of damage intensity and area, Real Time Intensity COlor Mapping (RTICOM) is explained in detail. RTICOM would be used to retrieve the essential information for damage evaluation, Peak Ground Acceleration (PGA). Destructive earthquake damage is usually due to surface waves which just follow S wave. The peak amplitude of surface wave would be pre-estimated from the amplitude and frequency content of first arrival P wave. Earthquake Early Warning (EEW) system is conventionally defined to estimate local magnitude from P wave. The status of EEW is reviewed and the application of EEW to Odesan earthquake is exampled with ShakeMap in order to make clear its appearance. In the sense of rapidity, the earthquake announcement of Korea Meteorological Agency (KMA) might be dramatically improved by the adaption of EEW. In order to realize hazard mitigation, EEW should be applied to the local crucial facilities such as nuclear power plants and fragile semi-conduct plant. The distributed EEW is introduced with the application example of Uljin earthquake. Not only Nation-wide but also locally distributed EEW applications, all relevant information is needed to be shared in real time. The plan of extension of Korea Integrated Seismic System (KISS) is briefly explained in order to future cooperation of data sharing and utilization.

Target Strength of Schlegel′s Black Rockfish (Sebastes schlegeli)and Red Seabream (Pagrus major) (조피볼락과 참돔의 표적 강도에 관한 연구)

  • 손창환;황두진
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.38 no.2
    • /
    • pp.119-128
    • /
    • 2002
  • This study investigates dorsal aspect target strength with fish size, tilt angle and frequency characteristics for the schlegel's black rockfish(Sebastes achlegeli) and the red seabream (Pagrus major). This study was carried out on free swimming fish in a cage in order to obtain acoustic data of the biomass estimation using the scientific echo sounder. The results obtained from this study are summarized as follows; 1 The coefficients of the schlegel's black rockfish and the red seabream using maximum TS with fish length were expressed -63.7dB and -62.6dB at a frequency of 38kHz, -64.4dB and -65.4dB at 120kHz, and -62.4dB and -65.0dB at 200kHz, respectively. 2. The coefficients of the schlegel\`s black rockfish and the red seabream using averaged TS with fish length were expressed -68.4dB and -67.9dB at a frequency of 38kHz, -73.4dB and -72.7dB at 120kHz, and -70.BdE and -73.4dB at 2001Hs, respectively. 3. The coefficients of the schlegel's black rockfish and the red seabream using maximum TS with body weight were expressed -52.0dB and -50.9dB at a frequency of 38kHz, -52.7dB and -53.7dB at 120kHz, and -50.7dB and -53.3dB at 200kHz, respectively. 4. The coefficients of the schlegel's black rockfish and the red seabream using averaged TS with body weight were expressed -56.7dB and -56.2dB at a frequency of 38kHz, -61.7dB and -61.0dB at 120kHz, and -59.ldE and -61.6dB at 200kHz, respectively. 5. Varying the tiIt angle of the two red seabream from -26$^{\circ}$to +25$^{\circ}$, the variation width of target strength expressed smaller at a frequency of 38kHz than at 120kHz and expressed about 3~6dB higher head up than head down at 120kHz.

Characteristics of Fish Utilization of the Nature-like Fishway Installed at the Beakjae Weir (백제보에 설치된 자연형 어도의 어류 이용 특성 분석)

  • Kim, Jeong-Hui;Yoon, Ju-Duk;Park, Sang-Hyeon;Lee, Jin-Woong;Baek, Seung-Ho;Jang, Min-Ho
    • Korean Journal of Ecology and Environment
    • /
    • v.48 no.4
    • /
    • pp.212-218
    • /
    • 2015
  • In South Korea, various nature-like fishways recently been installed for use by a wide variety of fish species. However, limited attempts have been made to monitor the fish utilization. The present study was conducted to ascertain the frequencies and patterns of utilization of the fishway installed at Beakjae Weir. We collected fish species that use the fishway by installing a fyke net at the exit of the fishway at least once a month from April 2013 to October 2013. Additionally, in order to identify all fish species that can potentially use the fishway, we investigated the fish fauna downstream to Beakjae Weir (mainstream of the Geum River). We found that 10 species belonging to 2 families used the fishway; this accounted for 64% of the total species inhabiting the mainstream. The species that used the fishway most frequently were Microphysogobio jeoni, followed by Squaliobarbus curriculus and Opsariichthys uncirostris amurensis. The highest number of fish using the fishway was observed in August, which was positively correlated with the water temperature (Spearman rank correlation, $r_s$=0.743, P=0.035). The sizes of the fish using the fishway varied widely, with the total body length ranging from 39 mm to 550 mm. Analysis of the time-dependent utilization frequency revealed that most fish used the fishway during the night (20:00~08:00). Compared to other fishways installed along the Geum River, the fishway installed at Beakjae Weir was used by fewer species and fish. This may be attributed to the structural inadequacy of the fishway, thereby resulting in a low attraction efficiency. Therefore, measures should be adopted to enhance the fishway attraction and passage efficiency. The results of this study can be used to ensure efficient operation and management of the Beakjae Weir fishway as well as serve as basic data for developing and building nature-like fishways tailored to Korean situations.

Simulation of Local Climate and Crop Productivity in Andong after Multi-Purpose Dam Construction (임하 다목적댐 건설 후 주변지역 기후 및 작물생산력 변화)

  • 윤진일;황재문;이순구
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.42 no.5
    • /
    • pp.579-596
    • /
    • 1997
  • A simulation study was carried out to delineate potential effects of the lake-induced climate change on crop productivity around Lake Imha which was formed after a multi-purpose dam construction in Andong, Korea. Twenty seven cropping zones were identified within the 30 km by 25 km study area. Five automated weather stations were installed within the study area and operated for five years after the lake formation. A geostatistical method was used to calculate the monthly climatological normals of daily maximum and minimum temperature, solar radiation and precipitation for each cropping zone before and after the dam construction. Daily weather data sets for 30 years were generated for each cropping zone from the monthly normals data representing "No lake" and "After lake" climatic scenarios, respectively. They were fed into crop models (ORYZA1 for rice, SOYGRO for soybean, CERES-maize for corn) to simulate the yield potential of each cropping zone. Calculated daily maximum temperature was higher after the dam construction for the period of October through March and lower for the remaining months except June and July. Decrease in daily minimum temperature was predicted for the period of April through August. Monthly total radiation was predicted to decrease after the lake formation in all the months except February, June, and September and the largest drop was found in winter. But there was no consistent pattern in precipitation change. According to the model calculation, the number of cropping zones which showed a decreased yield potential was 2 for soybean and 6 for corn out of 27 zones with a 10 to 17% yield drop. Little change in yield potential was found at most cropping zones in the case of paddy rice, but interannual variation was predicted to increase after the lake formation. the lake formation.

  • PDF

Analysis of Plants Social Network for Vegetation Management on Taejongdae in Busan Metropolitan City (부산 태종대 식생관리를 위한 식물사회네트워크 분석)

  • Sang-Cheol Lee;Hyun-Mi Kang;Seok-Gon Park;Jae-Bong Baek;Chan-Yeol Yu;In-Chun Hwang;Song-Hyun Choi
    • Korean Journal of Environment and Ecology
    • /
    • v.36 no.6
    • /
    • pp.651-661
    • /
    • 2022
  • Plants social network analysis, which combines plants society and social network analyses, is a new research method for understanding plants society. This study was conducted to investigate the relationship between species, using plant social network analysis targeting Taejongdae in Busan, and build basic data for management. Taejongdae, located in the warm temperate forest in Korea, is a representative coastal forest of Busan Metropolitan City, and the Pinus thunbergii-Eurya japonicacommunity is widely distributed. This study set up 100 quadrats (size of 100m2each) in Taejongdae to investigate the species that emerged and analyzed the interspecies association focusing on major species. Based on the results, a sociogram was created using the Gephi 0.9.2, and the network centrality and structure were analyzed. The results showed that the frequency of appearance was high in the order of P. thunbergii, E. japonica, Quercus serrata, Sorbus alnifolia, Ligustrum japonicum, and Styrax japonicusand that many evergreen broad-leaved trees appeared due to the environmental characteristics of the site. The plants social network of Taejongdae was composed of a small-scale network with 50 nodes and 172 links and was divided into 4 groups through modularization. The succession sere identified through a sociogram confirmed that the group that include P. thunbergiiand E. japonicawould progress to a deciduous broadleaf community dominated by Q. serrataand Carpinus tschonoskii, using hub nodes such as Prunus serrulataf. spontaneaand Toxicodendron trichocarpum. Another succession sere was highly likely to progress to an evergreen broad-leaved community dominated by Machilus thunbergiiand Neolitsea sericea, using M. thunbergiias a medium. In some areas, a transition to a deciduous broad-leaved community dominated by Celtis sinensis, Q. variabilisand Zelkova serratausing Lindera obtusilobaand C. sinensisas hub nodes was expected.