• Title/Summary/Keyword: 분포역

Search Result 1,776, Processing Time 0.044 seconds

Korean Semantic Role Labeling Based on Bidirectional LSTM CRFs Using the Semantic Label Distribution of Syllables (음절의 의미역 태그 분포를 이용한 Bidirectional LSTM CRFs 기반의 한국어 의미역 결정)

  • Yoon, Jungmin;Bae, Kyoungman;Ko, Youngjoong
    • 한국어정보학회:학술대회논문집
    • /
    • 2016.10a
    • /
    • pp.324-329
    • /
    • 2016
  • 의미역 결정은 자연어 문장의 서술어와 그 서술어에 속하는 논항들 사이의 의미관계를 결정하는 것이다. 최근 의미역 결정 연구에는 의미역 말뭉치와 기계학습 알고리즘을 이용한 연구가 주를 이루고 있다. 본 논문에서는 순차적 레이블링 영역에서 좋은 성능을 보이고 있는 Bidirectional LSTM-CRFs 기반으로 음절의 의미역 태그 분포를 고려한 의미역 결정 모델을 제안한다. 제안한 음절의 의미역 태그 분포를 고려한 의미역 결정 모델은 분포가 고려되지 않은 모델에 비해 2.41%p 향상된 66.13%의 의미역 결정 성능을 보였다.

  • PDF

Korean Semantic Role Labeling Based on Bidirectional LSTM CRFs Using the Semantic Label Distribution of Syllables (음절의 의미역 태그 분포를 이용한 Bidirectional LSTM CRFs 기반의 한국어 의미역 결정)

  • Yoon, Jungmin;Bae, Kyoungman;Ko, Youngjoong
    • Annual Conference on Human and Language Technology
    • /
    • 2016.10a
    • /
    • pp.324-329
    • /
    • 2016
  • 의미역 결정은 자연어 문장의 서술어와 그 서술어에 속하는 논항들 사이의 의미관계를 결정하는 것이다. 최근 의미역 결정 연구에는 의미역 말뭉치와 기계학습 알고리즘을 이용한 연구가 주를 이루고 있다. 본 논문에서는 순차적 레이블링 영역에서 좋은 성능을 보이고 있는 Bidirectional LSTM-CRFs 기반으로 음절의 의미역 태그 분포를 고려한 의미역 결정 모델을 제안한다. 제안한 음절의 의미역 태그 분포를 고려한 의미역 결정 모델은 분포가 고려되지 않은 모델에 비해 2.41%p 향상된 66.13%의 의미역 결정 성능을 보였다.

  • PDF

Zooplankton Community in the Front Zone of the East Sea (the Sea of Japan), Korea : 2. Relationship between Abundance Distribution and Seawater Temperature (동해 전선역 동물플랑크톤 군집 : 2. 수온과 분포의 관계)

  • PARK Chul;LEE Chang Rae;KIM Jeong Chang
    • Korean Journal of Fisheries and Aquatic Sciences
    • /
    • v.31 no.5
    • /
    • pp.749-759
    • /
    • 1998
  • Distribution of zooplankton abundance was studied in the front zone in the East Sea in November, 1996, Averaged total abundance in the front zone was less than that in the nearby cold surface water areas but more than that in the nearby warm surface water areas. The number of taxa was the greatest in the upper layer of mixing. Abundance and the number of tun in the front zone were contributed by the cold water and the warm water, respectively. Inspite of the differences in sampling time (day vs night), the species composition and abundance distribution were similar at two sites within cold or warm water area, However, they were quite different at two sites in the front zone although the sampling time of the day was the same. from this, the history of mixing was believed to be the most important factor for the species composition and abundance distribution in the front zone. Zooplankton distribution in the study area was mainly controlled by the dominant cold water Copepod Species Metridia paoifica, the only taxon that showed significant diet vertical migration. Most other taxa showed no significant diel vortical migration, Seawater temperature also affected zooplankton distribution. Positive correlations in the warm area, weak negative correlations in the cold water area, and no significant correlation in the front zone were obtained in general between the seawater temperature and the abundances of the major taxa.

  • PDF

Kullback-Leibler Information-Based Tests of Fit for Inverse Gaussian Distribution (역가우스분포에 대한 쿨백-라이블러 정보 기반 적합도 검정)

  • Choi, Byung-Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.6
    • /
    • pp.1271-1284
    • /
    • 2011
  • The entropy-based test of fit for the inverse Gaussian distribution presented by Mudholkar and Tian(2002) can only be applied to the composite hypothesis that a sample is drawn from an inverse Gaussian distribution with both the location and scale parameters unknown. In application, however, a researcher may want a test of fit either for an inverse Gaussian distribution with one parameter known or for an inverse Gaussian distribution with both the two partameters known. In this paper, we introduce tests of fit for the inverse Gaussian distribution based on the Kullback-Leibler information as an extension of the entropy-based test. A window size should be chosen to implement the proposed tests. By means of Monte Carlo simulations, window sizes are determined for a wide range of sample sizes and the corresponding critical values of the test statistics are estimated. The results of power analysis for various alternatives report that the Kullback-Leibler information-based goodness-of-fit tests have good power.

A Modi ed Entropy-Based Goodness-of-Fit Tes for Inverse Gaussian Distribution (역가우스분포에 대한 변형된 엔트로피 기반 적합도 검정)

  • Choi, Byung-Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.2
    • /
    • pp.383-391
    • /
    • 2011
  • This paper presents a modified entropy-based test of fit for the inverse Gaussian distribution. The test is based on the entropy difference of the unknown data-generating distribution and the inverse Gaussian distribution. The entropy difference estimator used as the test statistic is obtained by employing Vasicek's sample entropy as an entropy estimator for the data-generating distribution and the uniformly minimum variance unbiased estimator as an entropy estimator for the inverse Gaussian distribution. The critical values of the test statistic empirically determined are provided in a tabular form. Monte Carlo simulations are performed to compare the proposed test with the previous entropy-based test in terms of power.

Electric Resistive Tomography using Finite Element Method and Genet (유한요소법과 유전 알고리즘을 이용한 전기비저항 탐사법의 저항역산)

  • Lim, Sung-Ki;Kim, Min-Kyu;Kim, Hong-Kyu;Jung, Hyun-Kyo
    • Proceedings of the KIEE Conference
    • /
    • 1997.07a
    • /
    • pp.3-5
    • /
    • 1997
  • 지구 물리학이나 의공학 분야등에서 이용되왔던 전기비저항 탐사법은 관심 영역에 전류 입력을 가한 후, 그에 대한 전압 응답을 측정하여 관심 영역 내의 전기비저항 분포를 규명하는 방법으로서 역해석 문제의 범주에 포함된다. 따라서 일반적인 역해석 문제가 지니고 있는 해의 존재성, 유일성, 그리고 측정 데이터에 대한 해의 연속적 의존성이라는 기본적 문제들을 가지게된다. 이러한 역해석 문제의 해결에는 정확한 정해석 풀이법과 효율적인 역해석 방법이 요구되어진다. 본 논문에서는 정해석 방법으로 유한요소법을, 역해석 방법으로는 전체 최적점을 발견할 가능성이 높은 유전 알고리즘을 최적화 방법으로 사용하였다. 기존의 역해석 문제의 해결책으로 제시되어왔던 기울기 방법에 기반한 결정론적 최적화 알고리즘들이 지니고 있는 국소해로의 수렴, 즉 단순한 전기비저항 분포의 불연속성 확인이라는 한정된 정보의 획득을 넘어서 실제 전기비저항 분포와 가장 가까운 분포는 전체 최적점 근처에서 발견될 수 있음을 보이고자 한다. 이러한 전기비저항 분포의 역해석적인 규명을 간단한 2차원 수치해석문제를 풀어보므로서 확인해본다.

  • PDF

A Graphical Method to Assess Goodness-of-Fit for Inverse Gaussian Distribution (역가우스분포에 대한 적합도 평가를 위한 그래프 방법)

  • Choi, Byungjin
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.1
    • /
    • pp.37-47
    • /
    • 2013
  • A Q-Q plot is an effective and convenient graphical method to assess a distributional assumption of data. The primary step in the construction of a Q-Q plot is to obtain a closed-form expression to represent the relation between observed quantiles and theoretical quantiles to be plotted in order that the points fall near the line y = a + bx. In this paper, we introduce a Q-Q plot to assess goodness-of-fit for inverse Gaussian distribution. The procedure is based on the distributional result that a transformed random variable $Y={\mid}\sqrt{\lambda}(X-{\mu})/{\mu}\sqrt{X}{\mid}$ follows a half-normal distribution with mean 0 and variance 1 when a random variable X has an inverse Gaussian distribution with location parameter ${\mu}$ and scale parameter ${\lambda}$. Simulations are performed to provide a guideline to interpret the pattern of points on the proposed inverse Gaussian Q-Q plot. An illustrative example is provided to show the usefulness of the inverse Gaussian Q-Q plot.

Assessment of Soil Loss Estimated by Soil Catena Originated from Granite and Gneiss in Catchment (소유역단위 화강암/편마암 기원 토양 연접군(catena)에 따른 토양 유실 평가)

  • Hur, Seung-Oh;Sonn, Yeon-Kyu;Jung, Kang-Ho;Park, Chan-Won;Lee, Hyun-Hang;Ha, Sang-Keun;Kim, Jeong-Gyu
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.40 no.5
    • /
    • pp.383-391
    • /
    • 2007
  • This study was conducted for an assessment through the estimation of soil loss by each catchment classified by soil catena. Ten catchments, which are Geumgang21, Namgang03, Dongjincheon, Gapyongcheon01, Gyongancheon02, Geumgang16, Byongsungcheon01, Daesincheon, Bukcheon02, Youngsangang08, were selected from the hydrologic unit map and the detailed soil digital map (1:25,000) for this study. The catchments like Geumgang21, Namgang03, Dongjincheon, Gapyongcheon01 and Gyongancheon02 were mainly composed with soils originated from gneiss. The catchments like Geumgang16, Byongsungcheon01, Daesincheon, Bukcheon02 and Youngsangang08 were mainly composed with soils originated from granites. The grades, which are divided into seven grades with A(very tolerable), B(tolerable), C(moderate), D(low), E(high), F(severe), G(very severe), of soil erosion estimated by USLE in catchments were distributed in most A and B because of paddy land and forestry. In detailed, the soil erosion grade of catchments mainly distributing soils originated from gneiss showed more the distribution of B and C than it of catchments mainly distributing soils originated from granites. The reason of results would be derived from topographic characteristics of soils originated from gneiss located at mountainous. The soil loss according to soil catena linked with Songsan and Jigok series, which are soils originated from gneiss was calculated with $7.66ton\;ha^{-1}\;yr^{-1}$. The soil loss of Geumgang16, Byongsungcheon01, Daesincheon, Bukcheon02 which have the soil catena linked with Samgak and Sangju soil series originated from granite, was calculated with $5.55ton\;ha^{-1}\;yr^{-1}$. The soil loss of Youngsangang08 which have the soil catena linked with Songjung and Baeksan soil series originated from granite was calculated with $9.6ton\;ha^{-1}\;yr^{-1}$, but the conclusion on soil loss in this kind of soil catena would be drawn from the analysis of more catchments. In conclusion, the results of this study inform that the classification of soil catena by catchments and estimation of soil loss according to soil catena would be effective for analysis on the grade of non-point pollution by soil erosion in a catchment.

Formation and Characteristic of Summer Fronts between Cheju and Shanghai (제주도-상해간 여름철 전선역 형성과 특성)

  • 허만영;최영환
    • Proceedings of the Korean Society of Fisheries Technology Conference
    • /
    • 2000.10a
    • /
    • pp.164-164
    • /
    • 2000
  • 일반적으로 해양에서 전선이란 서로 다른 수괴간의 불연속면을 일컫는다. 이러한 전선의 양측 면에는 유속, 수온, 염분, 수질 등이 급변한다. 본 연구는 1997년 8월 26일부터 9월 2일까지 제주도 서부에서 중국 상해 양쯔강 하구역까지 21개 정점에서 관측된 물리ㆍ화학적 관측자료로부터 연안과 외양간의 수온, 염분, 밀도 등 물리적 인자 특성으로부터 전선역을 찾아내고 전선역을 중심으로 양측의 수질특성을 용존산소, 인산염, 질산염, 규산염 등 영양염류의 분포 특성과 아울러 생산력 인자인 엽록소 a의 분포량을 파악하여 전선역을 중심으로 한 수괴간의 특성을 규명하였다

  • PDF

Multivariate empirical distribution plot and goodness-of-fit test (다변량 경험분포그림과 적합도 검정)

  • Hong, Chong Sun;Park, Yongho;Park, Jun
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.4
    • /
    • pp.579-590
    • /
    • 2017
  • The multivariate empirical distribution function could be defined when its distribution function can be estimated. It is known that bivariate empirical distribution functions could be visualized by using Step plot and Quantile plot. In this paper, the multivariate empirical distribution plot is proposed to represent the multivariate empirical distribution function on the unit square. Based on many kinds of empirical distribution plots corresponding to various multivariate normal distributions and other specific distributions, it is found that the empirical distribution plot also depends sensitively on its distribution function and correlation coefficients. Hence, we could suggest five goodness-of-fit test statistics. These critical values are obtained by Monte Carlo simulation. We explore that these critical values are not much different from those in text books. Therefore, we may conclude that the proposed test statistics in this work would be used with known critical values with ease.