• Title/Summary/Keyword: 선형회귀 모델

Search Result 440, Processing Time 0.026 seconds

A FFP-based Model to Estimate Software Development Cost (소프트웨어 개발비용을 추정하기 위한 FFP 기반 모델)

  • Park, Ju-Seok;Chong, Ki-Won
    • The KIPS Transactions:PartD
    • /
    • v.10D no.7
    • /
    • pp.1137-1144
    • /
    • 2003
  • The existing Function Point method to estimate the software size has been utilized frequently with the management information system. Due to the expanding usage of the real-time and embedded system, the Full Function Point method is being proposed. However, despite many research is being carried out relation to the software size, the research on the model to estimate the development cost from the measured software size is inadequate. This paper analyzed the linear regression model and power regression model which estimate the development cost from the software FFP The power model is selected, which shows its estimation is most adequate.

A Model for Software Effort Estimation in the Development Subcycles (소프트웨어 개발 세부단계 노력 추정 모델)

  • 박석규;박영목;박재흥
    • Journal of the Korea Computer Industry Society
    • /
    • v.2 no.6
    • /
    • pp.859-866
    • /
    • 2001
  • Successful project planning relies on a good estimation of the effort required to complete a project, together with the schedule options that may be available. Despite the extensive research done developing new and better models, existing software effort estimation models are present only the total effort and effort (or manpower: people per unit time) function for the software life-cycle. Also, Putnam presents constant effort rate in each subcycles. However, the size of total efforts are variable according to the software projects under the influence of its size, complexity and operational environment. As a result, the allocated effort in subcycle also differ from project to project. This paper suggests the linear and polynomial effort estimation models in specifying, building and testing phase followed by the project total effort. These models are derived from 128 different projects. This result can be considered as a practical guideline in management of project schedule and effort allocation.

  • PDF

The Unsupervised Learning-based Language Modeling of Word Comprehension in Korean

  • Kim, Euhee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.11
    • /
    • pp.41-49
    • /
    • 2019
  • We are to build an unsupervised machine learning-based language model which can estimate the amount of information that are in need to process words consisting of subword-level morphemes and syllables. We are then to investigate whether the reading times of words reflecting their morphemic and syllabic structures are predicted by an information-theoretic measure such as surprisal. Specifically, the proposed Morfessor-based unsupervised machine learning model is first to be trained on the large dataset of sentences on Sejong Corpus and is then to be applied to estimate the information-theoretic measure on each word in the test data of Korean words. The reading times of the words in the test data are to be recruited from Korean Lexicon Project (KLP) Database. A comparison between the information-theoretic measures of the words in point and the corresponding reading times by using a linear mixed effect model reveals a reliable correlation between surprisal and reading time. We conclude that surprisal is positively related to the processing effort (i.e. reading time), confirming the surprisal hypothesis.

Hydrometeorological Drivers of Particulate Matter Using Satellite and Reanalysis Data (인공위성 및 재분석 자료를 이용한 미세먼지 농도와 수문기상인자의 상관성 분석)

  • Lee, Seul Chan;Jeong, Jae Hwan;Choi, Min Ha
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2019.05a
    • /
    • pp.100-100
    • /
    • 2019
  • 최근 대기 중 미세먼지의 농도가 높은 일수가 급증하면서, 미세먼지를 저감하고자 하는 연구가 활발히 이루어지고 있다. 미세먼지는 주로 자동차 혹은 공장 등 인간 활동에 의한 오염물질 배출에 의해 발생하는 것으로 알려져 있으며, 태양복사에너지, 토양수분, 강우, 풍속 등의 수문기상학적 인자에 의해 발생, 이동, 소멸의 과정을 거친다. 현재 우리나라에서는 미세먼지 농도를 관측하기 위해 지점 기반의 관측소를 운영하고 있으며, 관측소가 위치하지 않은 지역의 미세먼지 농도는 선형 보간법 등을 활용한 내삽 기법을 통해 제공하고 있다. 그러나 미세먼지 농도는 다양한 수문기상인자들의 영향에 의한 차이가 크게 나타나기 때문에 지점 기반의 자료로는 해당 지역의 미세먼지 농도를 추정하는 데 어려움이 많다. 본 연구에서는 미세먼지의 공간적인 분포를 추정하고자 MODerate resolution Imaging Spectroradiometer (MODIS) 에어로졸 자료와 Global Land Data Assimilation System (GLDAS) 수문기상인자를 활용하여 미세먼지 농도에 영향을 주는 것으로 판단되는 다양한 수문기상인자들과의 상관성을 분석하였다. 미세먼지와 각 인자간의 상관성을 분석하여 높은 상관성을 갖는 수문기상인자들을 도출하고 최적의 선형회귀분석 모델을 구축하기 위해 베이지안 모델 평균(Bayesian Model Averaging, BMA)을 사용하였으며, 지점 데이터와의 비교를 통해 활용성을 검증하였다. 전체적으로 수문기상인자를 사용한 선형회귀분석 결과에서는 미세먼지농도 변화의 경향을 반영하고 있는 것을 확인할 수 있었으나, 계절별, 지역별 등 대기 특성을 고려하지 않아 각 기간의 급격한 농도 변화를 감지하기에 어려움이 있었다. 이러한 연구를 바탕으로 수문기상인자와 미세먼지 농도의 패턴이 더욱 정확히 분석된다면, 미세먼지 농도 모니터링과 정확한 예보 시스템의 구축에 효과적으로 활용 될 것으로 기대된다.

  • PDF

Proposal of Models to Estimate the Coefficient of Permeability of Soils on the Natural Terrain considering Geological Conditions (지질조건에 따른 자연사면 토층의 투수계수 산정모델 제안)

  • Jun, Duk-Chan;Song, Young-Suk;Han, Shin-In
    • The Journal of Engineering Geology
    • /
    • v.20 no.1
    • /
    • pp.35-45
    • /
    • 2010
  • The soil tests have been performed on the specimens obtained from about 1,150 sites including landslides and non-landslides areas in natural terrains for last 10 years. Based on the results of those tests, the average soil properties are estimated and the simple equations for estimating permeability are proposed according to geologic conditions. The average permeability in Granite and Mudstone sites is higher than other sites and the content of silt and clay in Mudstone and Gneiss sites is higher than other sites. The correlation analysis and the regression analysis were performed to estimate the coefficient of permeability according to geological conditions. As the result of the correlation analysis, the coefficient of permeability is selected as a dependent variable, and the silt and clay contents, the water contents and the dry unit weights are selected as independent variables. As the result of the regression analysis, the silt and clay contents and the void ratio were involved commonly in the linear regression equations according to geological conditions. To verify the proposed the linear regression equations, the measured result of the coefficient of permeability at other sites was compared with the result predicted with the proposed equations. As the result of comparison, there were a little bit different between them for some data. However the difference was relatively small. Therefore, the linear regression equations for estimating the coefficient of permeability according to geological conditions may be applied to Korean soils. However, these equations should be verified and corrected continuously to improve the accuracy.

A study on Prediction of Simulator Sickness in Driving Simulation (자동차 모의운전환경에서 Simulator Sickness의 예측에 관한 연구)

  • 김도희
    • Proceedings of the Korea Society for Simulation Conference
    • /
    • 1998.10a
    • /
    • pp.170-173
    • /
    • 1998
  • 본 연구에서는 시뮬레이터나 그와 유사한 가상현실환경(Virtual Reality Environment ; VRE)에서 일어날 수 있는 Simulator Sickness가 어떤 사람들에게 쉽게 발생하는지를 예측하기 위하여 다중선형회귀(Multiple linear regression) 방정식으로 예측회귀모형을 제시하였다. 이 회귀모형에서의 종속변수는 김도희 외(1998)에 의해 개발된 RSSQ의 종합점수이고, 독립변수는 실제운전경력에 1을 더한 값에 나이를 곱한 값, 과거 멀미를 경험한 정도, 1주일 평균 동화상 시간, 현재의 건강상태로 되어져 있다. 이 회귀모형의 R2값은 약 0.52로 Kolasinski(1996)의 모델보다 설명력이 18% 증가하였고, 부수적인 별도의 실험을 하지 않고도 간단한 개인 신상에 관한 간단한 자료만으로도 훨씬 좋은 결과를 예측할 수 있게 되었다. 따라서 시뮬레이터나 가상현실에서 일어나는 Simulator Sickness가 어떠한 사람에게 걸리기가 쉬운지를 쉽게 예측할 수 있게 되었고, 이러한 사람들에게는 시뮬레이터나 가상현실의 이용을 자제시키거나 주의를 주어 특별관리 함으로써 시뮬레이터나 가상현실을 운영하는데 많은 도움을 줄 수 있을 것이다.

  • PDF

Analysis of Empirical Multiple Linear Regression Models for the Production of PM2.5 Concentrations (PM2.5농도 산출을 위한 경험적 다중선형 모델 분석)

  • Choo, Gyo-Hwang;Lee, Kyu-Tae;Jeong, Myeong-Jae
    • Journal of the Korean earth science society
    • /
    • v.38 no.4
    • /
    • pp.283-292
    • /
    • 2017
  • In this study, the empirical models were established to estimate the concentrations of surface-level $PM_{2.5}$ over Seoul, Korea from 1 January 2012 to 31 December 2013. We used six different multiple linear regression models with aerosol optical thickness (AOT), ${\AA}ngstr{\ddot{o}}m$ exponents (AE) data from Moderate Resolution Imaging Spectroradiometer (MODIS) aboard Terra and Aqua satellites, meteorological data, and planetary boundary layer depth (PBLD) data. The results showed that $M_6$ was the best empirical model and AOT, AE, relative humidity (RH), wind speed, wind direction, PBLD, and air temperature data were used as input data. Statistical analysis showed that the result between the observed $PM_{2.5}$ and the estimated $PM_{2.5}$ concentrations using $M_6$ model were correlations (R=0.62) and root square mean error ($RMSE=10.70{\mu}gm^{-3}$). In addition, our study show that the relation strongly depends on the seasons due to seasonal observation characteristics of AOT, with a relatively better correlation in spring (R=0.66) and autumntime (R=0.75) than summer and wintertime (R was about 0.38 and 0.56). These results were due to cloud contamination of summertime and the influence of snow/ice surface of wintertime, compared with those of other seasons. Therefore, the empirical multiple linear regression model used in this study showed that the AOT data retrieved from the satellite was important a dominant variable and we will need to use additional weather variables to improve the results of $PM_{2.5}$. Also, the result calculated for $PM_{2.5}$ using empirical multi linear regression model will be useful as a method to enable monitoring of atmospheric environment from satellite and ground meteorological data.

A Comparative Study on Game-Score Prediction Models Using Compuational Thinking Education Game Data (컴퓨팅 사고 교육 게임 데이터를 사용한 게임 점수 예측 모델 성능 비교 연구)

  • Yang, Yeongwook
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.11
    • /
    • pp.529-534
    • /
    • 2021
  • Computing thinking is regarded as one of the important skills required in the 21st century, and many countries have introduced and implemented computing thinking training courses. Among computational thinking education methods, educational game-based methods increase student participation and motivation, and increase access to computational thinking. Autothinking is an educational game developed for the purpose of providing computational thinking education to learners. It is an adaptive system that dynamically provides feedback to learners and automatically adjusts the difficulty according to the learner's computational thinking ability. However, because the game was designed based on rules, it cannot intelligently consider the computational thinking of learners or give feedback. In this study, game data collected through Autothikning is introduced, and game score prediction that reflects computational thinking is performed in order to increase the adaptability of the game by using it. To solve this problem, a comparative study was conducted on linear regression, decision tree, random forest, and support vector machine algorithms, which are most commonly used in regression problems. As a result of the study, the linear regression method showed the best performance in predicting game scores.

Ultimate Resisting Capacity of RC Columns Considering P-$\Delta$ Effect (P-$\Delta$ 효과를 고려한 RC 기둥의 극한저항력 산정)

  • 곽효경;김진국;김한수
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.15 no.1
    • /
    • pp.105-116
    • /
    • 2002
  • In this paper, an analytical model to predict the resisting capacity of slender RC columns is introduced. Material and geometric nonlinearities are taken into account, and the layer approach is adopted to simulate the different material properties across the sectional depth. On the basis of the obtained numerical analysis results, an improved design equation as a function of concrete strength, slenderness ratio, steel ratio and eccentricity for slender RC columns, which can be used effectively in the preliminary design stage, is introduced. Finally, P-M interaction diagrams constructed by the introduced equation are compared with the ACI method with the objective of establishing the relative efficiencies of the introduced equation.

A study on the forecast of container traffic using hybrid ARIMA-neural network model (하이브리드 ARIMA-신경망 모델을 통한 항만물동량 예측에 관한 연구)

  • Shin, Chang-Hoon;Kang, Jeong-Sick;Park, Soo-Nam;Lee, Ji-Hoon
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2007.12a
    • /
    • pp.259-260
    • /
    • 2007
  • The forecast of a container traffic has been very important for port plan and development Generally, statistic methods, such as regression analysis, ARIMA, have been much used for traffic forecasting. Recent research activities in forecasting with artificial neural networks(ANNs) suggest tint ANNs am be a promising alternative to the traditional linear methods. In this paper, a hybrid methodology that combines both ARIMA and ANN models is proposed to take advantage of the unique strength of ARIMA and ANN models in linear and nonlinear modeling. The results with port traffic data indicate tint effectiveness can differ according to the ch1racteristics of ports.

  • PDF