• Title/Summary/Keyword: 분류 알고리즘

Search Result 3,152, Processing Time 0.03 seconds

Quality Control of Agro-meteorological Data Measured at Suwon Weather Station of Korea Meteorological Administration (기상청 수원기상대 농업기상 관측요소의 품질관리)

  • Oh, Gyu-Lim;Lee, Seung-Jae;Choi, Byoung-Choel;Kim, Joon;Kim, Kyu-Rang;Choi, Sung-Won;Lee, Byong-Lyol
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.17 no.1
    • /
    • pp.25-34
    • /
    • 2015
  • In this research, we applied a procedure of quality control (QC) to the agro-meteorological data measured at the Suwon weather station of Korea Meteorological Administration (KMA). The QC was conducted through six steps based on the KMA Real-time Quality control system for Meteorological Observation Data (RQMOD) and four steps based on the International Soil Moisture Network (ISMN) QC modules. In addition, we set up our own empirical method to remove erroneous data which could not be filtered by the RQMOD and ISMN methods. After all these QC procedures, a well-refined agro-meteorological dataset was complied at both air and soil temperatures. Our research suggests that soil moisture requires more detailed and reliable grounds to remove doubtful data, especially in winter with its abnormal variations. The raw data and the data after QC are now available at the NCAM website (http://ncam.kr/page/req/agri_weather.php).

Studies on the ecological variations of rice plant under the different seasonal cultures -II. A study on the year variations and prediction of heading dates of paddy rice under the different seasonal cultures- (재배시기 이동에 의한 수도의 생태변이에 관한 연구 -II. 재배시기 이동에 의한 수도출수기의 년차간변이와 그 조기예측-)

  • Hyun-Ok Choi
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.3
    • /
    • pp.41-48
    • /
    • 1965
  • This study was aimed at knowing the magnitude of year variation in rice heading dates under the different seasonal cultures, and to estimate the heading date in advance. Using six rice varieties such as Kwansan, Suwon#82, Suwon #144, Norin#17, Yukoo#132 and Paltal, the early, ordinary and late seasonal cultures had been carried out at Paddy Crop Division, Crop Experiment Station at Suwon for the six-year period 1959 to 1964. In addition the data of the standard rice cultures at the Provincial Offices of Rural Development for the 12-year period 1953 to 1954, were analyzed for the purpose of clarifying a relationship between variation of rice heading dates and some of meteorological data related to the locations and years. The results of this study are as follows: 1. Year variation of rice heading dates was as high as 14 to 21 days in the early seasonal culture and 7 to 14 days in the ordinary seasonal culture, while as low as one to seven days in the late seasonal culture which was the lowest among three cultures. The magnitude of variation depended greatly on variety, cultural season and location. 2. It was found out that there was a close negative correlation between the accumulated average air temperature for 40 days from 31 days after seeding and number of days to heading in the early seasonal culture. Accordingly, it was considered possible to predict the rice heading date through calculation of the accumulated average air temperature for the above period and then the linear regression(Y=a+bx). On the other hand, an estimation of the heading date in the late seasonal culture requires for the further studies. In the ordinary seasonal culture, no significant correlation between the accumulated average air temperature and number of days to heading was obtained in the six-year experiments conducted at Suwon. There was a varietal difference in relationship between the accumulated average air temperature for 70 days from seeding and number of days to heading in the standard cultures at the provincial offices of rural development. Some of varieties showed a significant correlation between two factors while the others didn't show any significant correlation. However, there was no regional difference in this relationship.

  • PDF

Ecoclimatic Map over North-East Asia Using SPOT/VEGETATION 10-day Synthesis Data (SPOT/VEGETATION NDVI 자료를 이용한 동북아시아의 생태기후지도)

  • Park Youn-Young;Han Kyung-Soo
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.8 no.2
    • /
    • pp.86-96
    • /
    • 2006
  • Ecoclimap-1, a new complete surface parameter global database at a 1-km resolution, was previously presented. It is intended to be used to initialize the soil-vegetation- atmosphere transfer schemes in meteorological and climate models. Surface parameters in the Ecoclimap-1 database are provided in the form of a per-class value by an ecoclimatic base map from a simple merging of land cover and climate maps. The principal objective of this ecoclimatic map is to consider intra-class variability of life cycle that the usual land cover map cannot describe. Although the ecoclimatic map considering land cover and climate is used, the intra-class variability was still too high inside some classes. In this study, a new strategy is defined; the idea is to use the information contained in S10 NDVI SPOT/VEGETATION profiles to split a land cover into more homogeneous sub-classes. This utilizes an intra-class unsupervised sub-clustering methodology instead of simple merging. This study was performed to provide a new ecolimatic map over Northeast Asia in the framework of Ecoclimap-2 global database construction for surface parameters. We used the University of Maryland's 1km Global Land Cover Database (UMD) and a climate map to determine the initial number of clusters for intra-class sub-clustering. An unsupervised classification process using six years of NDVI profiles allows the discrimination of different behavior for each land cover class. We checked the spatial coherence of the classes and, if necessary, carried out an aggregation step of the clusters having a similar NDVI time series profile. From the mapping system, 29 ecosystems resulted for the study area. In terms of climate-related studies, this new ecosystem map may be useful as a base map to construct an Ecoclimap-2 database and to improve the surface climatology quality in the climate model.

Predicting Suitable Restoration Areas for Warm-Temperate Evergreen Broad-Leaved Forests of the Islands of Jeollanamdo (전라남도 섬 지역의 난온대 상록활엽수림 복원을 위한 적합지 예측)

  • Sung, Chan Yong;Kang, Hyun-Mi;Park, Seok-Gon
    • Korean Journal of Environment and Ecology
    • /
    • v.35 no.5
    • /
    • pp.558-568
    • /
    • 2021
  • Poor supervision and tourism activities have resulted in forest degradation in islands in Korea. Since the southern coastal region of the Korean peninsula was originally dominated by warm-temperate evergreen broad-leaved forests, it is desirable to restore forests in this region to their original vegetation. In this study, we identified suitable areas to be restored as evergreen broad-leaved forests by analyzing the environmental factors of existing evergreen broad-leaved forests in the islands of Jeollanam-do. We classified forest lands in the study area into six vegetation types from Sentinel-2 satellite images using a deep learning algorithm and analyzed the tolerance ranges of existing evergreen broad-leaved forests by measuring the locational, topographic, and climatic attributes of the classified vegetation types. Results showed that evergreen broad-leaved forests were distributed more in areas with a high altitudes and steep slope, where human intervention was relatively low. The human intervention has led to a higher distribution of evergreen broad-leaved forests in areas with lower annual average temperature, which was an unexpected but understandable result because an area with higher altitude has a lower temperature. Of the environmental factors, latitude and average temperature in the coldest month (January) were relatively less contaminated by the effects of human intervention, thus enabling the identification of suitable restoration areas of the evergreen broad-leaved forests. The tolerance range analysis of evergreen broad-leaved forests showed that they mainly grew in areas south of the latitude of 34.7° and a monthly average temperature of 1.7℃ or higher in the coldest month. Therefore, we predicted the areas meeting these criteria to be suitable for restoring evergreen broad-leaved forests. The suitable areas cover 614.5 km2, which occupies 59.0% of the total forest lands on the islands of Jeollanamdo, and 73% of actual forests that exclude agricultural and other non-restorable forest lands. The findings of this study can help forest managers prepare a restoration plan and budget for island forests.

Comparison of Housewives' Agricultural Food Consumption Characteristics by Age (주부의 연령대별 농식품 소비 특성 비교)

  • Hong, Jun-Ho;Kim, Jin-Sil;Yu, Yeon-Ju;Lee, Kyung-Hee;Cho, Wan-Sup
    • The Journal of Bigdata
    • /
    • v.6 no.1
    • /
    • pp.83-89
    • /
    • 2021
  • Lifestyle is changing rapidly, and food consumption patterns vary widely among households as dietary and food processing technologies evolve. This paper reclassified the food group of consumer panel data established by the Rural Development Administration, which contains information on purchasing agricultural products by household unit, and compared the consumption characteristics of agricultural products by age group. The criteria for age classification were divided into groups in their 60s and older with a prevalence of 20% or more metabolic diseases and groups in their 30s and 40s with less than 10%. Using the LightGBM algorithm, we classified the differences in food consumption patterns in their 30s and 50s and 60s and found that the precision was 0.85, the reproducibility was 0.71, and F1_score was 0.77. The results of variable importance were confectionery, folio, seasoned vegetables, fruit vegetables, and marine products, followed by the top five values of the SHAP indicator: confectionery, marine products, seasoned vegetables, fruit vegetables, and folio vegetables. As a result of binary classification of consumption patterns as a median instead of the average sensitive to outliers, confectionery showed that those in their 30s and 40s were more than twice as high as those in their 60s. Other variables also showed significant differences between those in their 30s and 40s and those in their 60s and older. According to the study, people in their 30s and 40s consumed more than twice as much confectionery as those in their 60s, while those in their 60s consumed more than twice as much marine products, seasoned vegetables, fruit vegetables, and folioce or logistics as much as those in their 30s and 40s. In addition to the top five items, consumption of 30s and 40s in wheat-processed snacks, breads and noodles was high, which differed from food consumption patterns in their 60s.

A study on the classification of research topics based on COVID-19 academic research using Topic modeling (토픽모델링을 활용한 COVID-19 학술 연구 기반 연구 주제 분류에 관한 연구)

  • Yoo, So-yeon;Lim, Gyoo-gun
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.155-174
    • /
    • 2022
  • From January 2020 to October 2021, more than 500,000 academic studies related to COVID-19 (Coronavirus-2, a fatal respiratory syndrome) have been published. The rapid increase in the number of papers related to COVID-19 is putting time and technical constraints on healthcare professionals and policy makers to quickly find important research. Therefore, in this study, we propose a method of extracting useful information from text data of extensive literature using LDA and Word2vec algorithm. Papers related to keywords to be searched were extracted from papers related to COVID-19, and detailed topics were identified. The data used the CORD-19 data set on Kaggle, a free academic resource prepared by major research groups and the White House to respond to the COVID-19 pandemic, updated weekly. The research methods are divided into two main categories. First, 41,062 articles were collected through data filtering and pre-processing of the abstracts of 47,110 academic papers including full text. For this purpose, the number of publications related to COVID-19 by year was analyzed through exploratory data analysis using a Python program, and the top 10 journals under active research were identified. LDA and Word2vec algorithm were used to derive research topics related to COVID-19, and after analyzing related words, similarity was measured. Second, papers containing 'vaccine' and 'treatment' were extracted from among the topics derived from all papers, and a total of 4,555 papers related to 'vaccine' and 5,971 papers related to 'treatment' were extracted. did For each collected paper, detailed topics were analyzed using LDA and Word2vec algorithms, and a clustering method through PCA dimension reduction was applied to visualize groups of papers with similar themes using the t-SNE algorithm. A noteworthy point from the results of this study is that the topics that were not derived from the topics derived for all papers being researched in relation to COVID-19 (

    ) were the topic modeling results for each research topic (
    ) was found to be derived from For example, as a result of topic modeling for papers related to 'vaccine', a new topic titled Topic 05 'neutralizing antibodies' was extracted. A neutralizing antibody is an antibody that protects cells from infection when a virus enters the body, and is said to play an important role in the production of therapeutic agents and vaccine development. In addition, as a result of extracting topics from papers related to 'treatment', a new topic called Topic 05 'cytokine' was discovered. A cytokine storm is when the immune cells of our body do not defend against attacks, but attack normal cells. Hidden topics that could not be found for the entire thesis were classified according to keywords, and topic modeling was performed to find detailed topics. In this study, we proposed a method of extracting topics from a large amount of literature using the LDA algorithm and extracting similar words using the Skip-gram method that predicts the similar words as the central word among the Word2vec models. The combination of the LDA model and the Word2vec model tried to show better performance by identifying the relationship between the document and the LDA subject and the relationship between the Word2vec document. In addition, as a clustering method through PCA dimension reduction, a method for intuitively classifying documents by using the t-SNE technique to classify documents with similar themes and forming groups into a structured organization of documents was presented. In a situation where the efforts of many researchers to overcome COVID-19 cannot keep up with the rapid publication of academic papers related to COVID-19, it will reduce the precious time and effort of healthcare professionals and policy makers, and rapidly gain new insights. We hope to help you get It is also expected to be used as basic data for researchers to explore new research directions.

  • A preliminary study for development of an automatic incident detection system on CCTV in tunnels based on a machine learning algorithm (기계학습(machine learning) 기반 터널 영상유고 자동 감지 시스템 개발을 위한 사전검토 연구)

    • Shin, Hyu-Soung;Kim, Dong-Gyou;Yim, Min-Jin;Lee, Kyu-Beom;Oh, Young-Sup
      • Journal of Korean Tunnelling and Underground Space Association
      • /
      • v.19 no.1
      • /
      • pp.95-107
      • /
      • 2017
    • In this study, a preliminary study was undertaken for development of a tunnel incident automatic detection system based on a machine learning algorithm which is to detect a number of incidents taking place in tunnel in real time and also to be able to identify the type of incident. Two road sites where CCTVs are operating have been selected and a part of CCTV images are treated to produce sets of training data. The data sets are composed of position and time information of moving objects on CCTV screen which are extracted by initially detecting and tracking of incoming objects into CCTV screen by using a conventional image processing technique available in this study. And the data sets are matched with 6 categories of events such as lane change, stoping, etc which are also involved in the training data sets. The training data are learnt by a resilience neural network where two hidden layers are applied and 9 architectural models are set up for parametric studies, from which the architectural model, 300(first hidden layer)-150(second hidden layer) is found to be optimum in highest accuracy with respect to training data as well as testing data not used for training. From this study, it was shown that the highly variable and complex traffic and incident features could be well identified without any definition of feature regulation by using a concept of machine learning. In addition, detection capability and accuracy of the machine learning based system will be automatically enhanced as much as big data of CCTV images in tunnel becomes rich.

    An Study on the Correlation between Sound Characteristics and Sasang Constitution by CSL (CSL을 통한 음향특성과 사상체질간의 상관성 연구)

    • Shin, Mi-ran;Kim, Dal-lae
      • Journal of Sasang Constitutional Medicine
      • /
      • v.11 no.1
      • /
      • pp.137-157
      • /
      • 1999
    • The purpose of this study is to help classifying Sasang Constitution through correlation with sound characteristic. This study was done it under the suppose that Sasang Constitution has correlation with sound spectrogram. The following result were obtained about correlation between sound spectrogram and Sasang Constitution by comparison and analysis 1. Soeumin answered his voice low tone, smooth and quiet in the survey. Soyangin answered his voice high, clear, fast and speaking random. Taeumin answered his voice low, thick and muddy. 2. Taeyangin was significantly slow compared with the others in the time of reading composition. Taeyangin was significantly slow compared with the others in Formant frequency 1. Taeyangin was significantly discriminated from Soeumin in Formant frequency 5. Taeyangin was significantly low compared with the others in Bandwidth 2. Soeumln was significantly low compared with Taeyangin in Pitch Maximum and Pitch Maximum-Pitch Minimum. Taeyangin was significantly high compared with the others in Energy mean. 3. In list of specification, the discrimination rate was higher than that by lists of 13 in the results of Multi-dimensional 4-class minimum-distance. The discrimination rate of three disposition except Soyangin was higher than that of four disposition in the results of One way ANOVA and Analysis of dis crimination in SPSS/PC+. In CART, the estimate rate of Sasang Constitution discrimination was higher than any other method. It is considered that there is a correlation between sound spectrogram and Sasang constitution according to the results. And method of Sasang constitution classification through sound spectrogram analysis can be one method as assistant for the objectification of Sasang constitution classification.

    • PDF

    An Implementation of Dynamic Gesture Recognizer Based on WPS and Data Glove (WPS와 장갑 장치 기반의 동적 제스처 인식기의 구현)

    • Kim, Jung-Hyun;Roh, Yong-Wan;Hong, Kwang-Seok
      • The KIPS Transactions:PartB
      • /
      • v.13B no.5 s.108
      • /
      • pp.561-568
      • /
      • 2006
    • WPS(Wearable Personal Station) for next generation PC can define as a core terminal of 'Ubiquitous Computing' that include information processing and network function and overcome spatial limitation in acquisition of new information. As a way to acquire significant dynamic gesture data of user from haptic devices, traditional gesture recognizer based on desktop-PC using wire communication module has several restrictions such as conditionality on space, complexity between transmission mediums(cable elements), limitation of motion and incommodiousness on use. Accordingly, in this paper, in order to overcome these problems, we implement hand gesture recognition system using fuzzy algorithm and neural network for Post PC(the embedded-ubiquitous environment using blue-tooth module and WPS). Also, we propose most efficient and reasonable hand gesture recognition interface for Post PC through evaluation and analysis of performance about each gesture recognition system. The proposed gesture recognition system consists of three modules: 1) gesture input module that processes motion of dynamic hand to input data 2) Relational Database Management System(hereafter, RDBMS) module to segment significant gestures from input data and 3) 2 each different recognition modulo: fuzzy max-min and neural network recognition module to recognize significant gesture of continuous / dynamic gestures. Experimental result shows the average recognition rate of 98.8% in fuzzy min-nin module and 96.7% in neural network recognition module about significantly dynamic gestures.

    Comparative Study of KOMPSAT-1 EOC Images and SSM/I NASA Team Sea Ice Concentration of the Arctic (북극의 KOMPSAT-1 EOC 영상과 SSM/I NASA Team 해빙 면적비의 비교 연구)

    • Han, Hyang-Sun;Lee, Hoon-Yol
      • Korean Journal of Remote Sensing
      • /
      • v.23 no.6
      • /
      • pp.507-520
      • /
      • 2007
    • Satellite passive microwave(PM) sensors have been observing polar sea ice concentration(SIC), ice temperature, and snow depth since 1970s. Among them SIC is playing an important role in the various studies as it is considered the first factor for the monitoring of global climate and environment changes. Verification and correction of PM SIC is essential for this purpose. In this study, we calculated SIC from KOMPSAT-1 EOC images obtained from Arctic sea ice edges from July to August 2005 and compared with SSM/I SIC calculated from NASA Team(NT) algorithm. When we have no consideration of sea ice types, EOC and SSM/I NT SIC showed low correlation coefficient of 0.574. This is because there are differences in spatial resolution and observing time between two sensors, and the temporal and spatial variation of sea ice was high in summer Arctic ice edge. For the verification of SSM/I NT SIC according to sea ice types, we divided sea ice into land-fast ice, pack ice, and drift ice from EOC images, and compared them with SSM/I NT SIC corresponding to each ice type. The concentration of land-fast ice between EOC and SSM/I SIC were calculated very similarly to each other with the mean difference of 0.38%. This is because the temporal and spatial variation of land-fast ice is small, and the snow condition on the ice surface is relatively dry. In case of pack ice, there were lots of ice ridge and new ice that are known to be underestimated by NT algorithm. SSM/I NT SIC were lower than EOC SIC by 19.63% in average. In drift ice, SSM/I NT SIC showed 20.17% higher than EOC SIC in average. The sea ice with high concentration could be included inside the wide IFOV of SSM/I because the drift ice was located near the edge of pack ice. It is also suggested that SSM/I NT SIC overestimated the drift ice covered by wet snow.


    (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.