• Title/Summary/Keyword: Stepwise selection

검색결과 156건 처리시간 0.032초

연속하는 공간적 특징의 시간적 유사성 검출을 이용한 고속 동영상 검색 (Fast Video Detection Using Temporal Similarity Extraction of Successive Spatial Features)

  • 조아영;양원근;조주희;임예은;정동석
    • 한국통신학회논문지
    • /
    • 제35권11C호
    • /
    • pp.929-939
    • /
    • 2010
  • 멀티미디어 기술이 발전함에 따라 대용량의 데이터베이스의 관리와 불법 복제물 검출을 위한 동영상 검색의 필요성이 커지고 있다. 본 논문에서는 이러한 요구에 맞춰 대용량 데이터베이스에서 고속 동영상 검색을 수행할 수 있는 방법을 제안한다. 고속 동영상 검색 방법은 프레임의 휘도 분포를 이용하여 공간적 특징을 추출하고, 동영상의 시간적 유사성 지도를 생성하여 시간적 특정을 추출한다. 동영상의 공간적 특정과 시간적 특정을 식별자로 구성하고 단계적인 정합 방법을 수행한다. 실험에서는 원본 동영상과 밝기 변화, 압축률 변환, 자막/로고 삽입과 같은 다양한 변형을 이용하여 정확성, 추출 및 정합 속도, 식별자 크기를 측정하여 성능을 평가하였다. 또한, 제안한 방법의 파라미터를 실험적으로 선택한 과정을 기술하고 비교 알고리즘과 공간적 특정만을 이용한 단순 정합 결과를 제시하였다. 정확성, 경색 속도 식별자 크기의 모든 결과에서, 제안한 고속 검색 방법이 대용량 데이터베이스의 동영상 경색에 가장 적합한 기술임을 보였다.

데이터마이닝을 활용한 한국프로야구 승패예측모형 수립에 관한 연구 (Using Data Mining Techniques to Predict Win-Loss in Korean Professional Baseball Games)

  • 오윤학;김한;윤재섭;이종석
    • 대한산업공학회지
    • /
    • 제40권1호
    • /
    • pp.8-17
    • /
    • 2014
  • In this research, we employed various data mining techniques to build predictive models for win-loss prediction in Korean professional baseball games. The historical data containing information about players and teams was obtained from the official materials that are provided by the KBO website. Using the collected raw data, we additionally prepared two more types of dataset, which are in ratio and binary format respectively. Dividing away-team's records by the records of the corresponding home-team generated the ratio dataset, while the binary dataset was obtained by comparing the record values. We applied seven classification techniques to three (raw, ratio, and binary) datasets. The employed data mining techniques are decision tree, random forest, logistic regression, neural network, support vector machine, linear discriminant analysis, and quadratic discriminant analysis. Among 21(= 3 datasets${\times}$7 techniques) prediction scenarios, the most accurate model was obtained from the random forest technique based on the binary dataset, which prediction accuracy was 84.14%. It was also observed that using the ratio and the binary dataset helped to build better prediction models than using the raw data. From the capability of variable selection in decision tree, random forest, and stepwise logistic regression, we found that annual salary, earned run, strikeout, pitcher's winning percentage, and four balls are important winning factors of a game. This research is distinct from existing studies in that we used three different types of data and various data mining techniques for win-loss prediction in Korean professional baseball games.

슈퍼 그래픽의 이미지와 선호성 분석에 관한 연구 -시각디자인 요 소를 중심으로- (Studies on the Analysis of Super Graphic Image and Preference -with Visual Design Element-)

  • 나성숙
    • 한국조경학회지
    • /
    • 제20권4호
    • /
    • pp.54-75
    • /
    • 1993
  • The purpose of this thesis is to suggest objective basic data for the super graphics in the urban landscape through the quantitative visual quality analysis. For this, the image structure of super graphics have been measured mainly by questionnaries and semantic differential scle method and analyzed by the method of factor analysis, means and multiple regression. Degree of visual preference have been measured mainly by questionnaries and likert attitude scale method and finaly these data have been analyzed by using the stepwise method. The data were collected by presenting 12 super graphics photographs-4 each sample pictures from the 3 each selected districts representing typical urban landscape style(central business district, shopping district, apartment complex). Observer groups were categorized as professionals, students, the others. Result of this thesis can be summarized as fallows: 1. From all 12(3${\times}$4) sample super graphics, the value of each semantic differential scale among the observer groups were presented significant group difference. But no significant difference of the S.D. scale value were observed among central business district, shopping district and apartment complex super graphics. 2. For all experimental points, 4 types of factor have been observed. Factors covering the image of super graphics were found to be the evaluation, the intimacy, the potentiality and the tidiness. 3. Main factors of the super graphics image and factors indicating the group variations yielded high significance between areas. 4. The harmony with surrounding environment, the proper selection of super graphics subject yielded high values for all groups. Especially, the good color sense with building was the most important variable determining the degree of visual preference. 5. The urban C.B.D. super graphics obtained 5∼12 ranks of regional visual preference and the shopping district super graphics obtained 2∼11 ranks, and apartment complex super graphics obtained 1∼7 ranks.

  • PDF

분광특성 분석에 의한 논 잡초 검출의 기초연구 (A Fundamental Study on Detection of Weeds in Paddy Field using Spectrophotometric Analysis)

  • 서규현;서상룡;성제훈
    • Journal of Biosystems Engineering
    • /
    • 제27권2호
    • /
    • pp.133-142
    • /
    • 2002
  • This is a fundamental study to develop a sensor to detect weeds in paddy field using machine vision adopted spectralphotometric technique in order to use the sensor to spread herbicide selectively. A set of spectral reflectance data was collected from dry and wet soil and leaves of rice and 6 kinds of weed to select desirable wavelengths to classify soil, rice and weeds. Stepwise variable selection method of discriminant analysis was applied to the data set and wavelengths of 680 and 802 m were selected to distinguish plants (including rice and weeds) from dry and wet soil, respectively. And wavelengths of 580 and 680 nm were selected to classify rice and weeds by the same method. Validity of the wavelengths to distinguish the plants from soil was tested by cross-validation test with built discriminant function to prove that all of soil and plants were classified correctly without any failure. Validity of the wavelengths for classification of rice and weeds was tested by the same method and the test resulted that 98% of rice and 83% of weeds were classified correctly. Feasibility of CCD color camera to detect weeds in paddy field was tested with the spectral reflectance data by the same statistical method as above. Central wavelengths of RGB frame of color camera were tried as tile effective wavelengths to distingush plants from soil and weeds from plants. The trial resulted that 100% and 94% of plants in dry soil and wet soil, respectively, were classified correctly by the central wavelength or R frame only, and 95% of rice and 85% of weeds were classified correctly by the central wavelengths of RGB frames. As a result, it was concluded that CCD color camera has good potential to be used to detect weeds in paddy field.

충청남도 4대수계 주요 지류하천 수질 모니터링을 통한 유역 관리 방안 (Watershed Management Plan through Water Quality Monitoring for Main Branches of 4 Water Systems in Chungcheongnamdo)

  • 박상현;김홍수;조병욱;문은호;최진하
    • 한국물환경학회지
    • /
    • 제32권2호
    • /
    • pp.163-172
    • /
    • 2016
  • This study aimed to develop a plan for effective performance of a watershed through correct identification of a river watershed by using the flowrate of the river and water quality data, which is the basis for the establishment of the water environment policy. The target river for water quality improvement was selected based on the monitoring result for 4 water systems in Chungcheongnamdo province in the recent 3 years. The result of analysis for the distribution of discharge capacity by a pollution source group for the water quality improvement target river showed that most of the target river has a high discharge capacity in the water system for living and livestock. Analysis for the density of the total discharge capacity of the whole watershed of Chungcheongnamdo indicated that the river that needs water quality improvement has high BOD concentration and high discharge load density at the point that this river is located. Thus, for efficient watershed management through selection and concentration, Chungcheongnamdo needs to improve the target river in priority. Stepwise planning is also required to establish and execute the water quality improvement in order to satisfy said target water quality, and establish the index for the water improvement rate for its evaluation.

Improvement of Thunderstorm Detection Method Using GK2A/AMI, RADAR, Lightning, and Numerical Model Data

  • Yu, Ha-Yeong;Suh, Myoung-Seok;Ryu, Seoung-Oh
    • 대한원격탐사학회지
    • /
    • 제37권1호
    • /
    • pp.41-55
    • /
    • 2021
  • To detect thunderstorms occurring in Korea, National Meteorological Satellite Center (NMSC) also introduced the rapid-development thunderstorm (RDT) algorithm developed by EUMETSAT. At NMCS, the H-RDT (HR) based on the Himawari-8 satellite and the K-RDT (KR) which combines the GK2A convection initiation output with the RDT were developed. In this study, we optimized the KR (KU) to improve the detection level of thunderstorms occurring in Korea. For this, we used all available data, such as GK2A/AMI, RADAR, lightning, and numerical model data from the recent two years (2019-2020). The machine learning of logistic regression and stepwise variable selection was used to optimize the KU algorithms. For considering the developing stages and duration time of thunderstorms, and data availability of GK2A/AMI, a total of 72 types of detection algorithms were developed. The level of detection of the KR, HR, and KU was evaluated qualitatively and quantitatively using lightning and RADAR data. Visual inspection using the lightning and RADAR data showed that all three algorithms detect thunderstorms that occurred in Korea well. However, the level of detection differs according to the lightning frequency and day/night, and the higher the frequency of lightning, the higher the detection level is. And the level of detection is generally higher at night than day. The quantitative verification of KU using lightning (RADAR) data showed that POD and FAR are 0.70 (0.34) and 0.57 (0.04), respectively. The verification results showed that the detection level of KU is slightly better than that of KR and HR.

어린이·청소년의 비스페놀 A 인체 노출에 영향을 미치는 요인: 제3기 국민환경보건 기초조사(2015-2017) (Factors Affecting on Human Exposure to Bisphenol A in Children and Adolescents: Korean National Environmental Health Survey (KoNEHS) Cycle 3, 2015-2017)

  • 정선경;신형호;박상신
    • 한국환경보건학회지
    • /
    • 제47권1호
    • /
    • pp.87-100
    • /
    • 2021
  • Objectives: The purpose of this study was to analyze the factors affecting Bisphenol A (BPA) exposure in children and adolescents using the results of the Korean National Environmental Health Survey (KoNEHS) cycle 3. Methods: A total of 2,380 subjects (n=571, 887, and 922 for 3-5, 6-11, and 12-17 years of age, respectively) were analyzed using an environmental exposure survey and environmental chemical substances concentration levels. Univariable linear regression analysis was performed to determine associated variables such as sex, age, income level, housing type, secondhand smoke time, cup noodles and canned food consumption, seafood consumption, new furniture (within the previous six months), drinking water type, and consumption of herbal medicines. Variables with p-values of less than 0.2 were extracted from the results and a multivariable linear regression analysis was performed using stepwise selection. Results: Univariable linear regression analysis showed positive associations between BPA concentration levels and variables including sex, age, secondhand smoke time, new furniture (within the previous six months), renovated living space (within the previous six months), fish and shellfish consumption, plastic-bottled drink consumption, and herbal medicine. As a result of performing multivariable linear regression analysis, the lower was the age the higher was the concentration of BPA levels. Additionally, women showed higher BPA levels than those of men. The more frequently fish was consumed, the higher was the BPA concentration. Moreover, higher BPA concentrations were observed when taking herbal medicine. Conclusions: The main factors affecting BPA concentration levels were age, gender, and consumption of fish and herbal medicine.

소비자 사이의 중고 태블릿PC 거래 가격의 통계적 예측 (Statistical Prediction of Used Tablet PC Transaction Price among Consumers)

  • 고영희;김소형;정유진
    • 산업융합연구
    • /
    • 제20권12호
    • /
    • pp.179-186
    • /
    • 2022
  • 본 연구에서는 태블릿PC 중고제품의 거래 시, 판매자와 구매자 모두에게 판매가격을 제시할 수 있는 예측모형을 개발하는 것을 목표로 한다. 모형 개발을 위하여 실제 태블릿PC 중고거래 데이터와 제품에 대한 상세 정보를 추가 수집한 데이터를 사용하였다. 데이터 분석을 통하여 여러 가지 예측모형을 개발하였으며, 이 중 태블릿PC 중고가격 예측 성능이 가장 뛰어난 모형을 최종 예측모형으로 선택하였다. 구체적으로 중고 태블릿의 판매가격을 종속변수로 하고, 통합된 데이터에서 판매가격과 연관성이 있는 변수들을 독립변수로 한 다중선형회귀모형, 교호작용을 포함한 다중선형회귀모형, 그리고 각 모형에서 단계적 변수 선택법을 통해 얻은 모형들을 고려하였다. 이들 모형 중 교차타당성을 통해 최종적으로 예측 성능이 가장 뛰어난 모형을 태블릿PC 중고가격을 예측하는 모형으로 선택하였다. 본 연구를 통하여 중고제품 판매가격을 예측하고 판매자와 구매자에게 적절한 중고 거래 가격을 제시해 볼 수 있을 것이다.

Assessment of growing condition variables on alfalfa productivity

  • Ji Yung Kim;Kun Jun Han;Kyung Il Sung;Byong Wan Kim;Moonju Kim
    • Journal of Animal Science and Technology
    • /
    • 제65권5호
    • /
    • pp.939-950
    • /
    • 2023
  • This study was conducted to assess the impact of growing condition variables on alfalfa (Medicago sativa L.) productivity. A total of 197 alfalfa yield results were acquired from the alfalfa field trials conducted by the South Korean National Agricultural Cooperative Federation or Rural Development Administration between 1983 and 2008. The corresponding climate and soil data were collected from the database of the Korean Meteorological Administration. Twenty-three growing condition variables were developed as explaining variables for alfalfa forage biomass production. Among them, twelve variables were chosen based on the significance of the partial-correlation coefficients or potential agricultural values. The selected partial correlation coefficients between the variables and alfalfa forage biomass ranged from -0.021 to 0.696. The influence of the selected twelve variables on yearly alfalfa production was summarized into three dominant factors through factor analysis. Along with the accumulated temperature variables, the loading scores of the daily mean temperature higher than 25℃ were over 0.88 in factor 1. The sunshine duration at temperature between 0℃-25℃ was 0.939 in factor 2. Precipitation days were 0.82, which was the greatest in factor 3. Stepwise regression applied with the three dominant factors resulted in the coefficients of factors 1, 2, and 3 for 0.633, 0.485, and 0.115, respectively, and the R-square of the model was 0.602. The environmental conditions limiting alfalfa growth, such as daily temperature higher than 25℃ or daily mean temperature affected annual alfalfa production most substantially among the growing condition variables. Therefore, future cultivar selection should consider the capability of alfalfa to be tolerant to extreme summer weather along with biomass production potential.

한우의 성장 및 도체형질에 대한 유전모수 추정 (Genetic Parameter Estimation on the Growth and Carcass Traits in Hanwoo(Korean Cattle))

  • 최태정;김시동;;백동훈
    • Journal of Animal Science and Technology
    • /
    • 제48권6호
    • /
    • pp.759-766
    • /
    • 2006
  • 본 연구는 농협중앙회 가축개량사업소가 보유하고 있는 1998년부터 2005년까지 실시한 한우 당대검정 대상우 2,532두의 자료와 1996년부터 2002년까지 실시한 한우 후대검정 대상우 1,819두의 자료를 이용하여 당대 및 후대의 형질을 다형질 개체모형을 이용하여 유전모수를 추정하였다. 적합한 통계분석모형을 찾기 위하여 각 형질별로 회귀분석을 통한 변수선택방법으로 고정효과와 공변량을 결정하고, MTDFREML 패키지를 이용하여 유전모수를 추정하였다. 분석형질은 등지방두께, 도체중, 도체율, 등심단면적, 근내지방도 및 12개월령 체중으로, 이들의 유전력은 각각 0.51, 0.32, 0.27, 0.33, 0.50 및 0.26로 나타났다. 한편 유전평가에서 제외되었던 등지방두께 및 도체율의 유전력이 각각 0.51 및 0.27으로 추정되어 이에 대한 선발이 가능할 것으로 나타났다. 특히 등지방두께는 현 육량지수 산출식에서 그 영향력이 매우 크므로 한우보증씨수소 선발지수에 포함하는 방안을 고려할 필요가 있을 것으로 사료된다. 도체형질간의 유전상관은 등지방두께와 등심단면적을 제외하고 모두 양의 상관을 나타났다. 그러나, 12개월령 보정체중과 도체율 및 근내지방도의 유전상관은 각각 0.09 및 0.27으로 나타나 현행 한우개량체계에서 처럼 당대검정우를 12개월령 체중으로 선발할 경우 등심단면적과 근내지방도가 우수한 개체가 탈락할 가능성이 있는 것으로 나타나 이의 보완을 위한 혈통지수의 활용방안이나 초음파단층촬영기술의 이용방법에 대한 추가연구가 필요한 것으로 사료된다.