• Title/Summary/Keyword: 통계 처리

Search Result 3,118, Processing Time 0.049 seconds

The Design of Front-end System to RDBMS for Effective Management of Statistical Database (통계 데이타베이스의 효율적 관리를 위한 관계형데이타베이스 관리 시스템에의 전위시스템 설계)

  • An, Seong-Ok;Kim, Yong-Ho
    • The Journal of Natural Sciences
    • /
    • v.5 no.2
    • /
    • pp.25-32
    • /
    • 1992
  • Statistical database(SDB) are large database primarily collected for purpose of statistical analysis. Commerical database management systems have not been widely used for SDB because of the efficiency problem of storage and access of those systems for SDB. In this paper, we propose SDB management method to use a front-end system to a Relatianal Datebase Management System (RDBMS). We do the design of SM-F system (Stasticical database Management as Front-end system) as a front-end system to a RDBMS. In the system, we use GROS model specially proposed for SDB, and store and manage summary database and meta database to support statistical analysis and to provide users with statistical summary information.

  • PDF

Neural correlates of visual mean representation (시각적 평균 표상의 신경기제)

  • Chong, Sang-Chul;Shin, Kil-Ho;Cho, Shin-Ho
    • Korean Journal of Cognitive Science
    • /
    • v.19 no.1
    • /
    • pp.75-88
    • /
    • 2008
  • Visual scene contains lots of redundant information. To process this redundant information without increasing brain's volume, human visual system may summarize incoming information. If similar but different information are given to visual system, visual system extracts statistical properties of the information. One example of the statistical representation is representation of mean size. The mean representation is accurate and durable. The process of mean representation is suggested to be parallel. However, previous studies on the mean representation mostly used behavioral methods. The purpose of this study was to investigate which neural regions extracted the mean size of a set of circles using fMRI method. According to previous studies, BOLD signal of certain areas that were in charge of cousin stimuli decreased when the same stimuli presented repetitively. We used this paradigm and found that BOLD signal of right occipital area was decreased when same mean site was presented repeatedly. This results suggest that right occipital area is the locus of mean representation of visual stimuli.

  • PDF

Integrated Indexing Method using Compound Noun Segmentation and Noun Phrase Synthesis (복합명사 분할과 명사구 합성을 이용한 통합 색인 기법)

  • Won, Hyung-Suk;Park, Mi-Hwa;Lee, Geun-Bae
    • Journal of KIISE:Software and Applications
    • /
    • v.27 no.1
    • /
    • pp.84-95
    • /
    • 2000
  • In this paper, we propose an integrated indexing method with compound noun segmentation and noun phrase synthesis. Statistical information is used in the compound noun segmentation and natural language processing techniques are carefully utilized in the noun phrase synthesis. Firstly, we choose index terms from simple words through morphological analysis and part-of-speech tagging results. Secondly, noun phrases are automatically synthesized from the syntactic analysis results. If syntactic analysis fails, only morphological analysis and tagging results are applied. Thirdly, we select compound nouns from the tagging results and then segment and re-synthesize them using statistical information. In this way, segmented and synthesized terms are used together as index terms to supplement the single terms. We demonstrate the effectiveness of the proposed integrated indexing method for Korean compound noun processing using KTSET2.0 and KRIST SET which are a standard test collection for Korean information retrieval.

  • PDF

Analysis of K-ABC Profile of Young Gifted Children and Ordinary Young Children (유아영재와 일반유아의 K-ABC 프로파일 분석)

  • Oh, Mee-Hyeong
    • Journal of Gifted/Talented Education
    • /
    • v.19 no.2
    • /
    • pp.241-260
    • /
    • 2009
  • The purpose of this study was to contrast young gifted children with ordinary young children in K-ABC profile. The subject were 51 young gifted children and 51 ordinary young children, 2 to 4 years of age. Data of children's K-ABC profile were analyzed by Correlation and Crosstabs. The main results of this study were as follows: First, in the case of ordinary young children, there were significant positive correlation among 'Mental Processing Composite' and all sub-tests of mental processing composite except 'face memory' test, 'Achievement Scale'. In young gifted children, there were significant positive correlation among 'Mental Processing Composite' and just four sub-tests of mental processing composite, and there were no significant correlation between 'Mental Processing Composite' and 'Achievement Scale'. Second, there were no significant differences among all sub-tests' strength and weakness in young gifted children and ordinary young children. Third, young gifted children got higher score in 'Sequential Processing Scale' and 'Mental Processing Composite' than 'Achievement Scale'. But in ordinary young children, there were no significant differences among all K-ABC' sub-scales.

Hybrid POS Tagging with generalized unknown word handling and post error-correction rules (일반화된 미등록어 처리와 오류 수정규칙을 이용한 혼합형 품사태깅)

  • Cha, Jeong-Won;Lee, Won-Il;Lee, Geun-Bae;Lee, Jong-Hyeok
    • Annual Conference on Human and Language Technology
    • /
    • 1997.10a
    • /
    • pp.88-93
    • /
    • 1997
  • 본 논문에서는 품사 태깅을 위해 여러 통계 모델을 실험을 통하여 비교하였으며 이를 토대로 통계적 모델을 구성하였다. 형태소 패턴 사전을 이용하여 미등록어의 위치와 개수에 관계없는 일반적인 방법의 미등록어 처리 방법을 개발하고 통계모델이 가지는 단점을 보완할 수 있는 오류 수정 규칙을 함께 이용하여 혼합형 품사 태깅 시스템인 $POSTAG^{i}$를 개발하였다. 미등록어를 추정하는 형태소 패턴 사전은 한국어 음절 정보와 용언의 불규칙 정보를 이용하여 구성하고 다어절어 사전을 이용하여 여러 어절에 걸쳐 나타나는 연어를 효과적으로 처리하면서 전체적인 태깅 정확도를 개선할 수 있다. 또 오류 수정 규칙은 Brill이 제안한 학습을 통하여 자동으로 얻어진다. 오류 수정 규칙의 자동 추출시에 몇 가지의 휴리스틱을 사용하여 보다 우수하고 일반적인 규clr을 추출할 수 있게 하였다. 10만의 형태소 품사 말뭉치로 학습하고 학습에 참여하지 않은 2만 5천여 형태소로 실험하여 97.28%의 정확도를 보였다.

  • PDF

An approximate fitting for mixture of multivariate skew normal distribution via EM algorithm (EM 알고리즘에 의한 다변량 치우친 정규분포 혼합모형의 근사적 적합)

  • Kim, Seung-Gu
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.3
    • /
    • pp.513-523
    • /
    • 2016
  • Fitting a mixture of multivariate skew normal distribution (MSNMix) with multiple skewness parameter vectors via EM algorithm often requires a highly expensive computational cost to calculate the moments and probabilities of multivariate truncated normal distribution in E-step. Subsequently, it is common to fit an asymmetric data set with MSNMix with a simple skewness parameter vector since it allows us to compute them in E-step in an univariate manner that guarantees a cheap computational cost. However, the adaptation of a simple skewness parameter is unrealistic in many situations. This paper proposes an approximate estimation for the MSNMix with multiple skewness parameter vectors that also allows us to treat them in an univariate manner. We additionally provide some experiments to show its effectiveness.

A Korean Corpus Analysis Tool for Language Information Acquisition (언어 정보 획득을 위한 한국어 코퍼스 분석 도구)

  • Lee, Ho;Kim, Jin-Dong;Rim, Hae-Chang
    • Annual Conference on Human and Language Technology
    • /
    • 1994.11a
    • /
    • pp.297-304
    • /
    • 1994
  • 코퍼스는 기계 가독형으로 개장되어 있는 실제 사용 언어의 집합으로 자연어 처리에 필요한 여러 가지 언어 정보를 내재하고 있다. 이들 정보는 코퍼스 분석기를 이용하여 획득할 수 있으며 용례와 각종 통계 정보 및 확률 정보, 연어 목록 등은 코퍼스에서 추출할 수 있는 대표적인 언어 정보들이다. 그러나 기존의 한국어 코퍼스 분석 도구들은 용례 추출 기능만을 보유하여 활용 범위가 제한되어 있었다. 이에 본 논문에서는 대량의 한국어 코퍼스를 분석하여 용례뿐만 아니라 자연어 처리의 제분야에서 필요한 언어 정보들을 추출하는 방법에 대해 연구하였으며 이의 검증을 위해 KCAT(Korean Corpus Analysis Tool)를 구현하였다. KCAT는 코퍼스 색인, 용례 추출, 통계 정보 추출, 연어 추출 부분으로 구성되어 있다. 용례 색인을 위해서는 여러 가지 사전과 용례 색인 구조가 필요한데 KCAT에서는 가변 차수 B-Tree 구조를 이용하여 사전을 구성하며 용례 색인을 위해 버킷 단위의 역 화일 구조를 이용한다. 질 좋은 용례의 추출을 위해 KCAT는 다양한 용례 연산 및 정렬 기능을 제공한다. 또한 통계적 방법의 자연어 처리 분야를 위해 어휘 확률, 상태 전이 확률, 관측 심볼 확률, 상호 정보, T-score 등을 제공하며, 기계 번역 분야에서 필요한 연어를 추출한다.

  • PDF

A Statistical Study of SNR, SDNR on Water Temperature, C/N Ratio, and BOD Loads in Wastewater Treatment process (하수처리공정에서 수온, C/N비, BOD부하량에 따른 SNR, SDNR의 통계적 연구)

  • An, Sang-Woo;Min, Jee-Eun;Park, Jae-Woo
    • 한국방재학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.823-826
    • /
    • 2008
  • Statistical methods were used in the analysis of data, which are the SNR and SDNR in describing the various natures, and the methodology relating the results with the operation was developed. Multiple regression analysis based on the results of statistics of data were SNR = 0.0219 + 0.000044BOD lording - 0.00600C/N ratio and SDNR = 0.0226 + 0.000044BOD lording - 0.00602C/N ratio. It were concluded that the variability of the process performance should be reflected to the operation condition procedure through the analysis based on the statistics methods.

  • PDF