• Title/Summary/Keyword: Classical Database

Search Result 49, Processing Time 0.024 seconds

A Study on the Development of English Inflectional Morphemes Based on the CHILDES Corpus (CHILDES 코퍼스를 기반으로 한 아동의 영어 굴절형태소 발달 연구)

  • Min, Myung Sook;Jun, Jongsup;Lee, Sun-Young
    • Korean Journal of Cognitive Science
    • /
    • v.24 no.3
    • /
    • pp.203-235
    • /
    • 2013
  • The goal of this paper is to test the findings about English-speaking children's acquisition of inflectional morphemes in the literature using a large-scale database. For this, we obtained a 4.7-million-word corpus from the CHILDES (Child Language Data Exchange System) database, and analyzed 1,630 British and American children's uses of English derivational morphemes up to age 7. We analyzed the type and token frequencies, type per token ratio (TTR), and the lexical diversity (D) for such inflectional morphemes as the present progressive -ing, the past tense -(e)d, the comparative and superlative -er/est with reference to children's nationality and age groups. To sum up our findings, the correlations between the D value and children's age varied from morpheme to morpheme; e.g. we found no correlation for -ing, a marginal correlation for -ed, and a strong correlation for -er/-est. Our findings are consistent with Brown's (1973) classical observation that children learn progressive forms earlier than the past tense marker. In addition, overgeneralization errors were frequently found for -ed, but rarely for -ing, showing a U-shaped developmental pattern at ages 2-3. Finally, American children showed higher D scores than British children, which showed that American children used inflectional morphemes for more word types compared with British children. The present study has its significance in testing the earlier findings in the literature by setting up well-defined methodology for analyzing the entire CHILDES database.

  • PDF

Radiological Risk Assessment for the Public Under the Loss of Medium and Large Sources Using Bayesian Methodology (베이지안 기법에 의거한 중대형 방사선원의 분실 시 일반인에 대한 방사선 위험도의 평가)

  • Kim, Joo-Yeon;Jang, Han-Ki;Lee, Jai-Ki
    • Journal of Radiation Protection and Research
    • /
    • v.30 no.2
    • /
    • pp.91-97
    • /
    • 2005
  • Bayesian methodology is appropriated for use in PRA because subjective knowledges as well as objective data are applied to assessment. In this study, radiological risk based on Bayesian methodology is assessed for the loss of source in field radiography. The exposure scenario for the lost source presented in U.S. NRC is reconstructed by considering the domestic situation and Bayes theorem is applied to updating of failure probabilities of safety functions. In case of updating of failure probabilities, it shows that 5 % Bayes credible intervals using Jeffreys prior distribution are lower than ones using vague prior distribution. It is noted that Jeffreys prior distribution is appropriated in risk assessment for systems having very low failure probabilities. And, it shows that the mean of the expected annual dose for the public based on Bayesian methodology is higher than the dose based on classical methodology because the means of the updated probabilities are higher than classical probabilities. The database for radiological risk assessment are sparse in domestic. It summarizes that Bayesian methodology can be applied as an useful alternative lot risk assessment and the study on risk assessment will be contributed to risk-informed regulation in the field of radiation safety.

Recurrent Neural Network Models for Prediction of the inside Temperature and Humidity in Greenhouse

  • Jung, Dae-Hyun;Kim, Hak-Jin;Park, Soo Hyun;Kim, Joon Yong
    • Proceedings of the Korean Society for Agricultural Machinery Conference
    • /
    • 2017.04a
    • /
    • pp.135-135
    • /
    • 2017
  • Greenhouse have been developed to provide the plants with good environmental conditions for cultivation crop, two major factors of which are the inside air temperature and humidity. The inside temperature are influenced by the heating systems, ventilators and for systems among others, which in turn are geverned by some type of controller. Likewise, humidity environment is the result of complex mass exchanges between the inside air and the several elements of the greenhouse and the outside boundaries. Most of the existing models are based on the energy balance method and heat balance equation for modelling the heat and mass fluxes and generating dynamic elements. However, greenhouse are classified as complex system, and need to make a sophisticated modeling. Furthermore, there is a difficulty in using classical control methods for complex process system due to the process are non linear and multi-output(MIMO) systems. In order to predict the time evolution of conditions in certain greenhouse as a function, we present here to use of recurrent neural networks(RNN) which has been used to implement the direct dynamics of the inside temperature and inside humidity of greenhouse. For the training, we used algorithm of a backpropagation Through Time (BPTT). Because the environmental parameters are shared by all time steps in the network, the gradient at each output depends not only on the calculations of the current time step, but also the previous time steps. The training data was emulated to 13 input variables during March 1 to 7, and the model was tested with database file of March 8. The RMSE of results of the temperature modeling was $0.976^{\circ}C$, and the RMSE of humidity simulation was 4.11%, which will be given to prove the performance of RNN in prediction of the greenhouse environment.

  • PDF

Design and Implementation of a Directory System for Disease Retrieval Services (질병 검색 서비스를 위한 디렉토리 시스템 설계 및 구현)

  • Yeo, Myung-ho;Lee, Yoon-kyeong;Rho, Kyu-jong;Park, Hyoung-soon;Kim, Hak-sin;Park, Jun-ho;Kang, Tae-ho;Kim, Hak-yong;Yoo, Jae-soo
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2009.05a
    • /
    • pp.709-714
    • /
    • 2009
  • Recently, biological researches are required to deal with a large scale of data. While scientists used classical experimental approaches for researches in the past, it is possible to get more sophisticated observations easily with convergence of information technologies and biology. The study on diseases is one of the most important issues of the life science. Conventional services and databases provide users with information such as classification of diseases, symptoms, and medical treatments through web. However, it is hard to connect or develop them for other new services because they have independent and different criterions. It may be a factor that interferes the development of biology. In this paper, we propose an integrated data structure for the disease database, and design and implement a novel directory system for diseases as an infrastructure for developing other new services.

  • PDF

New Galaxy Catalog of the Virgo Cluster

  • Kim, Suk;Rey, Soo-Chang;Jerjen, Helmut;Lisker, Thorsten;Sung, Eon-Chang;Lee, Youngdae;Chung, Jiwon;Pak, Mina;Yi, Wonhyeong;Lee, Woong
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.39 no.2
    • /
    • pp.50-50
    • /
    • 2014
  • We present a new catalog of galaxies in the wider region of the Virgo cluster, based on the Sloan Digital Sky Survey (SDSS) Data Release 7. The Extended Virgo Cluster Catalog (EVCC) covers an area of 725 deg2 or 60.1 Mpc2. It is 5.2 times larger than the footprint of the classical Virgo Cluster Catalog (VCC) and reaches out to 3.5 times the virial radius of the Virgo cluster. We selected 1324 spectroscopically targeted galaxies with radial velocities less than 3000 km s-1. In addition, 265 galaxies that have been missed in the SDSS spectroscopic survey but have available redshifts in the NASA Extragalactic Database are also included. Our selection process secured a total of 1589 galaxies of which 676 galaxies are not included in the VCC. The certain and possible cluster members are defined by means of redshift comparison with a cluster infall model. We employed two independent and complementary galaxy classification schemes: the traditional morphological classification based on the visual inspection of optical images and a characterization of galaxies from their spectroscopic features. SDSS u, g, r, i, and z passband photometry of all EVCC galaxies was performed using Source Extractor. We compare the EVCC galaxies with the VCC in terms of morphology, spatial distribution, and luminosity function. The EVCC defines a comprehensive galaxy sample covering a wider range in galaxy density that is significantly different from the inner region of the Virgo cluster. It will be the foundation for forthcoming galaxy evolution studies in the extended Virgo cluster region, complementing ongoing and planned Virgo cluster surveys at various wavelengths.

  • PDF

Comparison of data mining methods with daily lens data (데일리 렌즈 데이터를 사용한 데이터마이닝 기법 비교)

  • Seok, Kyungha;Lee, Taewoo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.6
    • /
    • pp.1341-1348
    • /
    • 2013
  • To solve the classification problems, various data mining techniques have been applied to database marketing, credit scoring and market forecasting. In this paper, we compare various techniques such as bagging, boosting, LASSO, random forest and support vector machine with the daily lens transaction data. The classical techniques-decision tree, logistic regression-are used too. The experiment shows that the random forest has a little smaller misclassification rate and standard error than those of other methods. The performance of the SVM is good in the sense of misclassfication rate and bad in the sense of standard error. Taking the model interpretation and computing time into consideration, we conclude that the LASSO gives the best result.

An Efficient Cache Mechanism for Improving Response Times in Integrated RFID Middleware (통합 RFID 미들웨어의 응답시간 개선을 위한 효과적인 캐쉬 구조 설계)

  • Kim, Cheong-Ghil;Lee, Jun-Hwan;Park, Kyung-Lang;Kim, Shin-Dug
    • The KIPS Transactions:PartA
    • /
    • v.15A no.1
    • /
    • pp.17-26
    • /
    • 2008
  • This paper proposes an efficient caching mechanism appropriate for the integrated RFID middleware which can integrate wireless sensor networks (WSNs) and RFID (radio frequency identification) systems. The operating environment of the integrated RFID middleware is expected to face the situations of a significant amount of data reading from RFID readers, constant stream data input from large numbers of autonomous sensor nodes, and queries from various applications to history data sensed before and stored in distributed storages. Consequently, an efficient middleware layer equipping with caching mechanism is inevitably necessary for low latency of request-response while processing both data stream from sensor networks and history data from distributed database. For this purpose, the proposed caching mechanism includes two optimization methods to reduce the overhead of data processing in RFID middleware based on the classical cache implementation polices. One is data stream cache (DSC) and the other is history data cache (HDC), according to the structure of data request. We conduct a number of simulation experiments under different parameters and the results show that the proposed caching mechanism contributes considerably to fast request-response times.

The Operational Comparison of SPOT GCP Acquisition and Accuracy Evaluation

  • Kim, Kam-Lae;Kim, Uk-Nam;Chun, Ho-Woun;Lee, Ho-Nam
    • Korean Journal of Geomatics
    • /
    • v.1 no.1
    • /
    • pp.1-5
    • /
    • 2001
  • This paper presents an investigation into the operational comparison of SPOT triangulation to build GCP library by analytical plotter and DPW (digital photogrammetric workstation). GCP database derived from current SPOT images can be used to other image sensors of satellite, if any reasons, such as lack of topographic maps or GCPs. But, general formulation of a photogrammetric process for GCP measurement has to take care of the scene interpretation problem. There are two classical methods depending on whether an analytical plotter or DPW is being used. Regardless of the method used, the measurement of GCPs is the weakest point in the automation of photogrammetric orientation procedures. To make an operational comparison, five models of SPOT panchromatic images (level 1A) and negative films (level 1AP) were used. Ten images and film products were used for the five GRS areas. Photogrammetric measurements were carried out in a manual mode on P2 analytical plotter and LH Systems DPW770. We presented an approach for exterior orientation of SPOT images, which was based on the use of approximately eighty national geodetic control points as GCPs which located on the summit of the mountain. Using sixteen well-spaced geodetic control points per model, all segments consistently showed RMS error just below the pixel at the check points in analytical instrument. In the case of DPW, half of the ground controls could not found or distinguished exactly when we displayed the image on the computer monitor. Experiment results showed that the RMS errors with DPW test was fluctuated case by case. And the magnitudes of the errors were reached more than three pixels due to the lack of image interpretation capability. It showed that the geodetic control points is not suitable as the ground control points in DPW for modeling the SPOT image.

  • PDF

Study of galaxies in extensive area of the Virgo cluster

  • Kim, Suk;Rey, Soo-Chang;Sung, Eon-Chang;Jerjen, Helmut;Lisker, Thorsten;Lee, Youngdae;Chung, Jiwon;Lee, Woong;Chung, Aeree;Yoon, Hyein
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.41 no.1
    • /
    • pp.35.1-35.1
    • /
    • 2016
  • Nearby galaxy clusters and their surrounding regions represent the current endpoint of evolution galaxy cluster evolution. We present a new catalog of 1589 galaxies, what we call Extended Virgo Cluster Catalog (EVCC), in wider area of the Virgo cluster based on the Sloan Digital Sky Survey (SDSS) Data Release 7. The EVCC covers an area 5.2 times larger than the footprint of the classical Virgo Cluster Catalog, and reaches out to 3.5 times the virial radius of the Virgo cluster. The EVCC contains fundamental information such as membership, morphology, and photometric parameters of galaxies. The EVCC defines a comprehensive galaxy sample covering a wider range in galaxy density that is significantly different from the inner region of the Virgo cluster. It will be the foundation for forthcoming galaxy evolution studies in the extended Virgo cluster region, complementing ongoing and planned Virgo cluster surveys at various wavelengths. We also present the large scale structures in the field around the Virgo cluster. We identified seven galaxy filaments and one possible sheet in three dimensions of super-galactic coordinates based on the HyperLEDA database. By examining spatial distribution and Hubble diagram of galaxies, we found that six filaments are directly associated with the main body of the Virgo cluster. On the other hand, one filament and one sheet are structures located at background of the main body of Virgo cluster. The EVCC and the filament structures will be the foundation for forthcoming studies of galaxy evolution in various environments as well as buildup of the galaxy cluster at z ~ 0, complementing ongoing and planned Virgo cluster surveys at various wavelengths.

  • PDF

Progress and Prospect of Rice Biotechnology in Korea

  • Tae Young, Chung
    • Proceedings of the Korean Society of Sericultural Science Conference
    • /
    • 1997.06a
    • /
    • pp.23-49
    • /
    • 1997
  • This is a progress report of rice biotechnology including development of gene transformation system, gene cloning and molecular mapping in rice. The scope of the research was focused on the connection between conventional breeding and biotech-researches. Plant transformation via Agrobacterium or particle bombardment was developed to introduce one or several genes to recommended rice cultivars. Two chimeric genes containing a maize ribosome inactivating protein gene (RIP) and a gerbicide resistant gene (bar) were introduced to Nipponbare, a Japonica cultivar, and transmitted to Korean cultivars. The homozygous progenies of herbicide resistant transgenic plant showed good fertility and agronomic characters. To explore the genetic resourses in rice, over 8,000 cDNA clones from immature rice seed have been isolated and sequenced. About 13% of clones were identified as enzymes related to metabolic pathway. Among them, twenty clones have high homology with genes encoding enzymes in the photorespiratory carbon cycle reaction. Up to now about 100 clones were fully sequenced and registered at EMBL and GenBank. For the mapping of quantitative tarits loci (QTL) and eternal recombinant inbred population with 164 F13 lines (MGRI) was developed from a cross between Milyang 23 and Gihobyeo, Korean rice cultivars. After construction of fully saturated RFLP and AFLP map, quantitative traits using MGRI population were analyzed and integrated into the molecular map. Eighty seven loci were determined with 27 QTL characters including yield and yield components on rice chromosomes. Map based cloning was also tried to isolate semi-dwarf (sd-1) gene in rice. A DNA probe, RG 109, the most tightly linked to sd-1 gene was used to screen from bacterial artifical chromosome (BAC) libraries and five over lapping clones presumably containing sd-1 gene were isolated. Rice genetic database including results of biotech reasearch and classical genetics is provided at Korea Rice Genome Server which is accessible with world wide web (www) browser. The server provides rice cDNA sequences and map informations linked with phenotypic images.