• 제목/요약/키워드: principle of text distribution

검색결과 7건 처리시간 0.022초

언어 텍스트에 나타나는 벤포드 법칙: 원리와 응용 (Benford's Law in Linguistic Texts: Its Principle and Applications)

  • 홍정하
    • 한국언어정보학회지:언어와정보
    • /
    • 제14권1호
    • /
    • pp.145-163
    • /
    • 2010
  • This paper aims to propose that Benford's Law, non-uniform distribution of the leading digits in lists of numbers from many real-life sources, also appears in linguistic texts. The first digits in the frequency lists of morphemes from Sejong Morphologically Analyzed Corpora represent non-uniform distribution following Benford's Law, but showing complexity of numerical sources from complex systems like earthquakes. Benford's Law in texts is a principle reflecting regular distribution of low-frequency linguistic types, called LNRE(large number of rare events), and governing texts, corpora, or sample texts relatively independent of text sizes and the number of types. Although texts share a similar distribution pattern by Benford's Law, we can investigate non-uniform distribution slightly varied from text to text that provides useful applications to evaluate randomness of texts distribution focused on low-frequency types.

  • PDF

분포 개념의 연계성 목표 관점에 따른 중학교 확률 단원 분석 (An Analysis of the 8th Grade Probability Curriculum in Accordance with the Distribution Concepts)

  • 이영하;허지영
    • 대한수학교육학회지:수학교육학연구
    • /
    • 제20권2호
    • /
    • pp.163-183
    • /
    • 2010
  • 본 연구는 6차 교육과정이래 현재까지 사용 중인 중학교 2학년(8단계) 교육과정중에 확률단원의 개선 방안에 관한 것이다. 이들 교육과정에 따르면 확률단원은 경우의 수와 합사건, 곱사건 등의 확률 계산법을 포함하고 있으며, 확률의 의미는 수학적 확률 또는 통계적 확률의 의미를 사용하도록 되어있다. 그러나 확률의 의미를 통계적 확률의 의미로 사용하려면, 모든 확률에 대한 논의에 있어서 상대도수가 중심이 되어야 하는데, 경우의 수가 들어 있으므로 경우의 수에 관한 논의가 확률논의와 연결성이 없거나, 연결성을 살리기 위해 수학적 확률을 사용하게 된다. 이런 현상은 결국 많은 교과서들이 확률의 정의에서는 통계적 확률로 정의하고, 확률의 계산에 관한 논의는 수학적 확률로 하게 되는 결과를 초래하고 있다. 그 결과 학생들의 입장에서는 매우 혼란스러운 상태가 초래된다고 여겨진다. 본 연구는 확률의 계산 역시 상대도수 중심으로 논의하는 방안을 제시하고, 아울러 그런 교육과정의 변화가 단순히 확률의 정의의 변화만이 아닌, 단원 전체의 유기적 관계를 고려한 변화를 얻는 방안을 제안하려는 것이다.

  • PDF

조선 후기 의안(醫案) 『경보신편(輕寶新編)』 연구 (Study of Gyeongbosinpyeon, a Late Joseon Medical Records)

  • 전종욱
    • 대한한의학원전학회지
    • /
    • 제30권1호
    • /
    • pp.185-209
    • /
    • 2017
  • Objectives : The objective of this paper is to review the healing processes employed in the traditional age and discover the unique features found in the Korean Medicine through categorizing and analyzing the distribution of patients, and the aspects and results of treatments as recorded in Gyeongbosinpyeon, a historical text thought to have been authored by a regional doctor active in Joseon during the mid- to late-19th century. Methods : A table is created to view all of the total of 141 medical records introduced in the Gyeongbosinpyeon, and 7 categories were created to each contain 2 to 3 medical records that have special images. The paper provides their translation texts along with the original texts, and analyzed their medical and social significances by comparing each medical record. Results : The clinical competence displayed by the doctor who had worked in Joseon during the 19th century was surprisingly high, and it seems its values are worthy of dissemination when compared with Yeogsimanpil that has been introduced to the world. There is a great significance in how the principle of holistic treatments, the fundamental aspect of Joseon's medical study, was adhered. Additionally, the parts that show the historical text's author's medical activities and their unique characteristics are also worthy of attention. Conclusions : Korean medicine possesses a remarkable text called Donguibogam, but clinical behaviors' successes are not guaranteed solely with textual knowledge. It can be witnessed that such texts of authority and such medical records that have recorded actual activities complement each other in order to improve the quality of Joseon's study of medicine.

다요소 가중 평균법을 이용한 인공지능 기술 개발전략 연구 (A Study on the Development Strategy of Artificial Intelligence Technology Using Multi-Attribute Weighted Average Method)

  • 장해각;최일영;김재경
    • 한국IT서비스학회지
    • /
    • 제19권2호
    • /
    • pp.93-107
    • /
    • 2020
  • Recently, artificial intelligence (AI) technologies has been widely used in various fields such as finance, and distribution. Accordingly, Korea has also announced its AI R&D strategy for the realization of i-Korea 4.0 in May 2018. However, Korea's AI technology is inferior to major competitors such as the US, Canada, and Japan Therefore, in order to cope with the 4th industrial revolution, it is necessary to allocate AI R&D budgets efficiently through selection and concentration so as to gain competitive advantage under a limited budget. In this study, the importance of each AI technology was evaluated in multi-dimensional way through the questionnaire of expert group using the evaluation index derived from the literature review From the results of this study, we draw the following implication. In order to successfully establish the AI technology development strategies, it is necessary to prioritize the cognitive computing technology that has great market growth potential, ripple effect of technology development, and the urgency of technology development according to the principle of selection and concentration. To this end, it is necessary to find creative ideas, manage assessments, converge multidisciplinary systems and strengthen core competencies. In addition, since AI technology has a large impact on socioeconomic development, it is necessary to comprehensively grasp and manage scientific and technological regulations in order to systematically promote AI technology development.

지역상권 활성화 및 효율적 관리를 위한 제도 개선방안 연구 (Study on Improving the System for the Revitalization and Efficient Management of the Local Commercial Area)

  • 김승희;김영기
    • 유통과학연구
    • /
    • 제11권5호
    • /
    • pp.55-62
    • /
    • 2013
  • Purpose - This study aims to determine the problems and limitations of the Commercial Area Activation System, which was created by a special law for promoting traditional markets and shopping districts to revitalize and efficiently manage the central commercial area in different regions. We also suggest different options for its improvement. Research design, data, and methodology - We also look into the problems of which is being promoted as a demonstration project, from the aspects of legal text and guidelines. Results - The current commercial area activation system has several problems. First, the establishment of a comprehensive basic plan on the commercial area activation is not a requirement. Second, the benefit principle should be established to prevent the moral laxity of merchants who serve important roles in the main components of the commercial area activation business when they conduct their business. Third, the current special law constrains the commercial management organization, as under the civil law yields a limitation on finding a profitable business model. Fourth, to efficiently, constructing a system that links the other central government businesses and is needed. into a regional development budget or a budget for funding small businesses that the central government can control, which is effective. Further, we offer some suggestions for medium- and long-term policies. First, an integrated coordination mechanism at the central office level should be installed while setting the basic policy to revitalize the Based on this policy, local governments need a system that exclusively based on the after establishing a comprehensive plan for urban regeneration and getting approval from the integration organization. Second, a system that enables an understanding of the problems with business promotion by monitoring the procedure of supporting projects and regularly assessing business achievements is needed. Third, a plan is needed for resolving conflicts between various interested parties that adopts the commercial area activation system for carrying out a total redevelopment of the commercial area where small shops are densely located. A market maintenance project has been conducted as a means to recover our traditional market, which was economically depressed, and to revive the local economy, but it is mostly conducted in the form of reconstruction or redevelopment and represents the interests of landowners and merchants. Thus, it is most likely to lead to a gradual disappearance of traditional markets. Conclusions - This study looks primarily into the problems that appeared in the legal text or the guidelines regarding the direction of improvement of the commercial area activation business that has been going on as a demonstration project since 2011 and suggests some solutions.

  • PDF

위치기반 소셜 미디어 데이터의 텍스트 마이닝 기반 공간적 클러스터링 분석 연구 (Spatial Clustering Analysis based on Text Mining of Location-Based Social Media Data)

  • 박우진;유기윤
    • 대한공간정보학회지
    • /
    • 제23권2호
    • /
    • pp.89-96
    • /
    • 2015
  • 위치기반 소셜 미디어 데이터는 빅데이터, 위치기반서비스 등 다양한 분야에서 활용가능성이 매우 큰 데이터이다. 본 연구에서는 위치기반 소셜 미디어 데이터의 텍스트 정보를 분석하여 주요한 키워드들이 공간적으로 어떻게 분포하고 있는지를 파악할 수 있는 일련의 분석방법론을 적용해보았다. 이를 위해, 위치태그를 지닌 트윗 데이터를 서울시 강남지역과 그 주변지역에 대하여 2013년 8월 한달 간 수집하였으며, 이 데이터를 대상으로 하여 텍스트 마이닝을 통해 주요 키워드들을 도출하였다. 이러한 키워드들 중 음식, 엔터테인먼트, 업무 및 공부의 세 카테고리에 해당하는 키워드들만 추출, 분류하였으며 각 카테고리에 해당하는 트윗 데이터들에 대해서 공간적 클러스터링을 실시하였다. 도출된 각 카테고리별 클러스터들을 실제 그 지역의 건물 또는 벤치마크 POI들과 비교한 결과, 음식 카테고리 클러스터는 대규모 상업지역들과 일치도가 높았고 엔터테인먼트 카테고리의 클러스터는 공연장, 극장, 잠실운동장 등과 일치하였다. 업무 및 공부 카테고리 클러스터들은 학원 밀집지역 및 사무용 빌딩 밀집지역과 높은 일치도를 나타내었다.

모자건강관리를 위한 위험요인별 감별평점분류기준 개발에 관한 연구 (A Study on the Risk Factors for Maternal and Child Health Care Program with Emphasis on Developing the Risk Score System)

  • 이광옥
    • 대한간호학회지
    • /
    • 제13권1호
    • /
    • pp.7-21
    • /
    • 1983
  • For the flexible and rational distribution of limited existing health resources based on measurements of individual risk, the socalled Risk Approach is being proposed by the World Health Organization as a managerial tool in maternal and child health care program. This approach, in principle, puts us under the necessity of developing a technique by which we will be able to measure the degree of risk or to discriminate the future outcomes of pregnancy on the basis of prior information obtainable at prenatal care delivery settings. Numerous recent studies have focussed on the identification of relevant risk factors as the Prior infer mation and on defining the adverse outcomes of pregnancy to be dicriminated, and also have tried on how to develope scoring system of risk factors for the quantitative assessment of the factors as the determinant of pregnancy outcomes. Once the scoring system is established the technique of classifying the patients into with normal and with adverse outcomes will be easily de veloped. The scoring system should be developed to meet the following four basic requirements. 1) Easy to construct 2) Easy to use 3) To be theoretically sound 4) To be valid In searching for a feasible methodology which will meet these requirements, the author has attempted to apply the“Likelihood Method”, one of the well known principles in statistical analysis, to develop such scoring system according to the process as follows. Step 1. Classify the patients into four groups: Group $A_1$: With adverse outcomes on fetal (neonatal) side only. Group $A_2$: With adverse outcomes on maternal side only. Group $A_3$: With adverse outcome on both maternal and fetal (neonatal) sides. Group B: With normal outcomes. Step 2. Construct the marginal tabulation on the distribution of risk factors for each group. Step 3. For the calculation of risk score, take logarithmic transformation of relative proport-ions of the distribution and round them off to integers. Step 4. Test the validity of the score chart. h total of 2, 282 maternity records registered during the period of January 1, 1982-December 31, 1982 at Ewha Womans University Hospital were used for this study and the“Questionnaire for Maternity Record for Prenatal and Intrapartum High Risk Screening”developed by the Korean Institute for Population and Health was used to rearrange the information on the records into an easy analytic form. The findings of the study are summarized as follows. 1) The risk score chart constructed on the basis of“Likelihood Method”ispresented in Table 4 in the main text. 2) From the analysis of the risk score chart it was observed that a total of 24 risk factors could be identified as having significant predicting power for the discrimination of pregnancy outcomes into four groups as defined above. They are: (1) age (2) marital status (3) age at first pregnancy (4) medical insurance (5) number of pregnancies (6) history of Cesarean sections (7). number of living child (8) history of premature infants (9) history of over weighted new born (10) history of congenital anomalies (11) history of multiple pregnancies (12) history of abnormal presentation (13) history of obstetric abnormalities (14) past illness (15) hemoglobin level (16) blood pressure (17) heart status (18) general appearance (19) edema status (20) result of abdominal examination (21) cervix status (22) pelvis status (23) chief complaints (24) Reasons for examination 3) The validity of the score chart turned out to be as follows: a) Sensitivity: Group $A_1$: 0.75 Group $A_2$: 0.78 Group $A_3$: 0.92 All combined : 0.85 b) Specificity : 0.68 4) The diagnosabilities of the“score chart”for a set of hypothetical prevalence of adverse outcomes were calculated as follows (the sensitivity“for all combined”was used). Hypothetidal Prevalence : 5% 10% 20% 30% 40% 50% 60% Diagnosability : 12% 23% 40% 53% 64% 75% 80%.

  • PDF