• Title/Summary/Keyword: tree age

Search Result 537, Processing Time 0.03 seconds

Clickstream Big Data Mining for Demographics based Digital Marketing (인구통계특성 기반 디지털 마케팅을 위한 클릭스트림 빅데이터 마이닝)

  • Park, Jiae;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.143-163
    • /
    • 2016
  • The demographics of Internet users are the most basic and important sources for target marketing or personalized advertisements on the digital marketing channels which include email, mobile, and social media. However, it gradually has become difficult to collect the demographics of Internet users because their activities are anonymous in many cases. Although the marketing department is able to get the demographics using online or offline surveys, these approaches are very expensive, long processes, and likely to include false statements. Clickstream data is the recording an Internet user leaves behind while visiting websites. As the user clicks anywhere in the webpage, the activity is logged in semi-structured website log files. Such data allows us to see what pages users visited, how long they stayed there, how often they visited, when they usually visited, which site they prefer, what keywords they used to find the site, whether they purchased any, and so forth. For such a reason, some researchers tried to guess the demographics of Internet users by using their clickstream data. They derived various independent variables likely to be correlated to the demographics. The variables include search keyword, frequency and intensity for time, day and month, variety of websites visited, text information for web pages visited, etc. The demographic attributes to predict are also diverse according to the paper, and cover gender, age, job, location, income, education, marital status, presence of children. A variety of data mining methods, such as LSA, SVM, decision tree, neural network, logistic regression, and k-nearest neighbors, were used for prediction model building. However, this research has not yet identified which data mining method is appropriate to predict each demographic variable. Moreover, it is required to review independent variables studied so far and combine them as needed, and evaluate them for building the best prediction model. The objective of this study is to choose clickstream attributes mostly likely to be correlated to the demographics from the results of previous research, and then to identify which data mining method is fitting to predict each demographic attribute. Among the demographic attributes, this paper focus on predicting gender, age, marital status, residence, and job. And from the results of previous research, 64 clickstream attributes are applied to predict the demographic attributes. The overall process of predictive model building is compose of 4 steps. In the first step, we create user profiles which include 64 clickstream attributes and 5 demographic attributes. The second step performs the dimension reduction of clickstream variables to solve the curse of dimensionality and overfitting problem. We utilize three approaches which are based on decision tree, PCA, and cluster analysis. We build alternative predictive models for each demographic variable in the third step. SVM, neural network, and logistic regression are used for modeling. The last step evaluates the alternative models in view of model accuracy and selects the best model. For the experiments, we used clickstream data which represents 5 demographics and 16,962,705 online activities for 5,000 Internet users. IBM SPSS Modeler 17.0 was used for our prediction process, and the 5-fold cross validation was conducted to enhance the reliability of our experiments. As the experimental results, we can verify that there are a specific data mining method well-suited for each demographic variable. For example, age prediction is best performed when using the decision tree based dimension reduction and neural network whereas the prediction of gender and marital status is the most accurate by applying SVM without dimension reduction. We conclude that the online behaviors of the Internet users, captured from the clickstream data analysis, could be well used to predict their demographics, thereby being utilized to the digital marketing.

Estimation of Productivity for Quercus variabilis Stand by Forest Environmental Factors (삼림환경인자(森林環境因子)에 의한 굴참나무임분(林分)의 생산력추정(生産力推定))

  • Lee, Dong Sup;Chung, Young Gwan
    • Journal of Korean Society of Forest Science
    • /
    • v.75 no.1
    • /
    • pp.1-18
    • /
    • 1986
  • This study was initiated to estimate productivity of Quercus variabilis stand. However the practical objective of this study was to provide some information to establish the basis of selecting the suitable site for Quercus variabilis. The productivity measured in terms of DBH, height, basal area and stem volume was hypothesized, respectively, to be a function of a group of factors. This study considered 32 factors, 20 of which were related to the forest environmental factors such as tree age, latitude, percent slope, etc. and the rest of which were related to soil factors such as soil moisture, total nitrogen, available $P_2O_5$, etc. The data on 4 productivity measurements of Quercus variabilis growth and related factors cited were collected from 99 sample plots in Kyeongbook and chungbook provinces. Some factors considered were, in nature, discrete variables and the others continuous variables. Each kind of factor was classified into 3 or 4 categories and total numbers of such categories were eventually amounted to 110. Then each category was treated as an independent variable. This is amounted to saying that individual variable was treated a dummy variable and assigned a value 1 or 0. However the first category of each factor was deleted from the normal equation for statistical consideration. First of all, each of 4 productivity measurements of Quercus variabilis growth was regressed and, at the same time, those 110 categories. Secondly, the partial correlation coefficients were measured between each pair of 4 productivity measurements and 32 individual foctors. Finally, the relative scores were estimated in order to derive the category ranges. The result of these statistical analyses could be summarized as follows: 1) Growth measurement in terms of height seems to be a more significant criterion for estimation of productivity of Quercus variabilis. 2) Productivity of forest on stocked land may better be estimated in terms of forest environmental factors, on the other hand, that of unstocked land may be estimated in terms of physio-chemical factors of soil. 3) The factors that a strongly positive relation to all growth factors of tree are age group, effective soil, soil moisture, etc. This implies that these factors might effectively be used for criteria for selecting the suitable site for Quercus variabilis. 4) Parent rock, latitude, total nitrogen, age group, effective soil depth, soil moisture, organic matter, etc., had more significant category range for tree growth. Therefore, the suitable site for Quercus variabilis may be selected, based on this information. In conclusion, the above results obtained by the multivariable analysis can be not only the important criteria for estimating the growth of Quercus variabilis but also the useful guidance for selecting the suitable sites and performing the rational of Quercus variabilis forest.

  • PDF

Influences of Environmental Gradients on the Patterns of Vegetation Structure and Tree Age Distribution in the East Side of Cascade Range, Washington, USA (워싱턴주(州) 케스케이드산맥(山脈) 동(東)쪽 산림(山林)에서 환경구배(環境勾配)가 식생구조(植生構造)와 연령분포(年齡分布)에 미치는 영향(影響))

  • Woo, Su Young;Lee, Kyung Joon;Lee, Sang Don
    • Journal of Korean Society of Forest Science
    • /
    • v.85 no.1
    • /
    • pp.107-119
    • /
    • 1996
  • To understand vegetation changes along environmental gradients in the natural forests in the east side of the Cascade Range in Washington state, USA, line transects were used to sample six different forest environments in the Wenatchee National Forest in the north-facing and south-facing sites at 975, 1280 and 1700m elevation. Data were analyzed using ordination by detranded correspondence analysis. Pseudotsuga menziesii was found as one of the dominant species on all the six sites regardless of elevation or aspect, while Pinus ponderosa was dominant on south slopes only. Abies grandis and A. lasiocarpa were dominant species on north slopes at elevations of 1280 and 1700m, respectively. Moisture, as it related to aspect, was identified as one of the most important environmental gradients for explaining the variation of vegetation types. On north-facing slopes, compared to south-facing slopes, where moisture was not as limiting and canopies could grow denser, probably, elevation or competitive interaction was more important. Species diversity tended to decrease with increasing environmental severity, with south slopes having less diversity than north slopes due to extended water stress and harsher temperature extremes on south slopes. The age structure on north-facing and south-facing slopes was different. Light intensity, moisture and climate were different between these two slopes. Large scale disturbances(e.g., big fire or insects) were major causes in changing age structure. Younger trees showed a closer relationship between size and age than adult trees. DBH values of shade intolerant species in south-facing slope were bigger than those of north-facing slope, which suggested that aspect of stands be the most important factor for age and size.

  • PDF

Detection of Site Environment and Estimation of Stand Yield in Mixed Forests Using National Forest Inventory (국가산림자원조사를 이용한 혼효림의 입지환경 탐색 및 임분수확량 추정)

  • Seongyeop Jeong;Jongsu Yim;Sunjung Lee;Jungeun Song;Hyokeun Park;JungBin Lee;Kyujin Yeom;Yeongmo Son
    • Journal of Korean Society of Forest Science
    • /
    • v.112 no.1
    • /
    • pp.83-92
    • /
    • 2023
  • This study was established to investigate the site environment of mixed forests in Korea and to estimate the growth and yield of stands using national forest resources inventory data. The growth of mixed forests was derived by applying the Chapman-Richards model with diameter at breast height (DBH), height, and cross-sectional area at breast height (BA), and the yield of mixed forests was derived by applying stepwise regression analysis with factors such as cross-sectional area at breast height, site index (SI), age, and standing tree density per ha. Mixed forests were found to be growing in various locations. By climate zone, more than half of them were distributed in the temperate central region. By altitude, about 62% were distributed at 101-400 m. The fitness indexes (FI) for the growth model of mixed forests, which is the independent variable of stand age, were 0.32 for the DBH estimation, 0.22 for the height estimation, and 0.18 for the basal area at breast height estimation, which were somewhat low. However, considering the graph and residual between the estimated and measured values of the estimation equation, the use of this estimation model is not expected to cause any particular problems. The yield prediction model of mixed forests was derived as follows: Stand volume =-162.6859+6.3434 ∙ BA+9.9214 ∙ SI+0.7271 ∙ Age, which is a step- by-step input of basal area at breast height (BA), site index (SI), and age among several growth factors, and the determination coefficient (R2) of the equation was about 96%. Using our optimal growth and yield prediction model, a makeshift stand yield table was created. This table of mixed forests was also used to derive the rotation of the highest production in volume.

Injury Responses of Landscape Woody Plants to Air Pollutants - Malondialdehyde content - (조경수목(造景樹木)의 대기오염물질(大氣汚染物質)에 대한 피해반응(被害反應)(III) - Malondialdehyde 함량(含量)을 중심으로 -)

  • Kim, Myung Hee;Lee, Soo Wook
    • Journal of Korean Society of Forest Science
    • /
    • v.83 no.1
    • /
    • pp.25-31
    • /
    • 1994
  • This study was conducted to investigate sensitivity of woody plants growing in urban and industrial regions of Seoul and Taejon, Korea. Malondialdehyde(MDA) contents were analyzed in tree foliage of Pinus densiflora, Pinus koraiensis, Ginkgo biloba, Liriodendron tulipifera and Platanus occidentalis. In addition, MDA contents were analyzed in the foliage of tree seedlings, i.e. Pinus densiflora. Pinus koraiensis, Ginkgo biloba and Liriodendron tulipifera, with the fumigation of $SO_2$ in gas chamber 4 hours a day for six days. MDA contents of leaves in Ginkgo biloba, Platanus occidentalis and Liriodendron tulipifera in polluted regions were higher than those in unpolluted region and among them Liriodendron tulipifera had the highest. MDA contents of Pinus densiflora and Pinus koraiensis increased with the increase of needle age. MDA contents of Liriodendron tulipifera seedlings were increased with the higher concentrations of $SO_2$ but MDA contents in other seedlings showed no changes with $SO_2$ treatment concentrations. MDA contents in all species were increased with the passage of exposure day. Especially. Liriodendron tulipifera had higher MDA content than other species. In Liriodendron tulipifera the MDA production increased with the passage of exposure day until the fourth day after that decreased.

  • PDF

A Convergence Study in the Severity-adjusted Mortality Ratio on inpatients with multiple chronic conditions (복합만성질환 입원환자의 중증도 보정 사망비에 대한 융복합 연구)

  • Seo, Young-Suk;Kang, Sung-Hong
    • Journal of Digital Convergence
    • /
    • v.13 no.12
    • /
    • pp.245-257
    • /
    • 2015
  • This study was to develop the predictive model for severity-adjusted mortality of inpatients with multiple chronic conditions and analyse the factors on the variation of hospital standardized mortality ratio(HSMR) to propose the plan to reduce the variation. We collect the data "Korean National Hospital Discharge In-depth Injury Survey" from 2008 to 2010 and select the final 110,700 objects of study who have chronic diseases for principal diagnosis and who are over the age of 30 with more than 2 chronic diseases including principal diagnosis. We designed a severity-adjusted mortality predictive model with using data-mining methods (logistic regression analysis, decision tree and neural network method). In this study, we used the predictive model for severity-adjusted mortality ratio by the decision tree using Elixhauser comorbidity index. As the result of the hospital standardized mortality ratio(HSMR) of inpatients with multiple chronic conditions, there were statistically significant differences in HSMR by the insurance type, bed number of hospital, and the location of hospital. We should find the method based on the result of this study to manage mortality ratio of inpatients with multiple chronic conditions efficiently as the national level. So we should make an effort to increase the quality of medical treatment for inpatients with multiple chronic diseases and to reduce growing medical expenses.

Prediction of Carcass Yield by Ultrasound in Hanwoo (초음파 측정에 의한 한우의 도체육량 예측)

  • Rhee, Y. J.;Jeon, K. J.;Choi, S. B.;Seok, H. K.;Kim, S. J.;Lee, S. K.;Song, Y. H.
    • Journal of Animal Science and Technology
    • /
    • v.45 no.2
    • /
    • pp.335-342
    • /
    • 2003
  • This study was conducted to predict the carcass yield traits using ultrasound before slaughter and to enhance the prediction accuracy of carcass yield grade by applying various strategies. For this experiment, five hundred seventy three Hanwoo steers of 24 months of age were used. Difference between ultrasound result and carcass measure of BFT and LMA was 0.6$\pm$1.65mm and 0.7$\pm$5.56cm2, respectively. Correlation coefficient between ultrasound result and carcass measure of BFT and LMA was 0.86 and 0.82, respectively (p<0.001). Results for improving predictions of yield grade by four methods-the Korean yield grade index equation, fat depth alone, regression and decision tree methods were 80.3%, 81.3%, 80.1% and 81.8%, respectively. We conclude that the decision tree method can easily predict yield grade and is also useful for increasing prediction accuracy rate.

A Study on the Well-Dying Recognition and Decision of Death before and after Education Among University Students (대학생들의 죽음 교육 전과 후의 웰다잉 인식과 결정에 관한 연구)

  • Song, Hyeon-Dong;Ahn, Sang-Yoon;Kim, Yong-Ha;Hwang, Hye-Jeong;Lee, Seo-Hui;Kim, Kwang-Hwan
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.1
    • /
    • pp.300-310
    • /
    • 2018
  • The purpose of this study is to compare the change of Well-Dying awareness and decision of university student before and after taking the course of death study. A questionnaire survey was conducted for university students 93 before education, 117 after education who participated in the Death Studies related lectures at Daejeon Metropolitan City for 15 weeks from August to December 2016. The general characteristics of survey are gender, age. grade, major, marriage condition, religion, family member living together and health status. Four items on the perception aspect of death, five items on the aspect of acceptance of death, seven items of death decision and twelve items for death education's interest and importance were configured as a reference scale. The statistical method carried out the chi-square test, the independent sample t-test, and the decision tree analysis. Based on the decision tree, At the time of preparation for death(cancer patient, terminal patient, etc.) and the elderly(65 years old or older), the education transition rate was 66.7%. But After education, 65.3% of the respondents were in adult, middle and high school, under elementary school, university, and graduate school, which showed a significant difference. Therefore we are looking for death education's effectiveness and setting directions for education's period and contents. the negative viewpoints and worries about the implementation of death education at elementary, middle and high schools and universities are resolved and the death education will positively affect the change of attitude of students.

Developing Dynamic DBH Growth Prediction Model by Thinning Intensity and Cycle - Based on Yield Table Data - (간벌강도 및 주기에 따른 동적 흉고직경 생장예측 모형개발 - 기존 수확표 자료를 기반으로 -)

  • Kim, Moonil;Lee, Woo-Kyun;Park, Taejin;Kwak, Hanbin;Byun, Jungyeon;Nam, Kijun;Lee, Kyung-Hak;Son, Yung-Mo;Won, Hyung-Kyu;Lee, Sang-Min
    • Journal of Korean Society of Forest Science
    • /
    • v.101 no.2
    • /
    • pp.266-278
    • /
    • 2012
  • The objective of this study was developing dynamic stand growth model to predict diameter at breast height (DBH) growth by thinning intensity and cycle for major tree species of South Korea. The yield table, one of static stand growth models, constructed by Korea Forest Service was employed to prepare dynamic stand growth models for 8 tree species. In the process of model development, the thinning type was designated to thinning from below and equations for predicting the DBH change after thinning by different intensities was generated. In addition, stand density (N/ha), age and site index were adopted as explanatory variables for DBH prediction model. Thereafter, using the model, DBH growth under various silvicuture through integrating such equations considering thinning intensities, and cycles. The dynamic stand growth model of DBH developed in this study can provide understanding of effectiveness in forest growth and growing stock when thinning practice is performed in forest. Furthermore, results of this study is also applicable to quantitatively assess the carbon storage sequestration capability.

Invasion of Korean Pine Seedlings Originated from Neighbour Plantations into the Natural Mature Deciduous Broad-leaved Forest in Gwangneung, Korea (광릉 천연활엽수 성숙림에서 주변 인공림으로부터 잣나무 치수의 침입 정착)

  • Kang, Ho Sang;Lim, Jong-Hwan;Chun, Jung Hwa;Lee, Im Kyun;Kim, Young Kul;Lee, Jae Ho
    • Journal of Korean Society of Forest Science
    • /
    • v.96 no.1
    • /
    • pp.107-114
    • /
    • 2007
  • Establishments of the seedlings inside the natural forest from adjacent artificial forests would be an important factor in forest stand dynamics. This study was conducted to see the invasion of Korean pine (Pinus koraiensis) seedlings which is not native in this region, into the natural deciduous broad-leaved forest in Gwangneung, Korea. There is no mother tree at the I ha study site while the number of naturally regenerated P. koraiensis seedlings was 345 trees and 56% of them were clumped with more than two seedlings at each point. Applying the image segmentation method to IKONOS satellite image of January, 2003, the distance from the center of 1 ha study site to the nearest mother tree and plantation of Korean pine were 200 m and 270 m, respectively. The average height and root-collar diameter of the seedlings were 34 em and 7 mm, respectively and the age of 207 seedlings (60%) were below 5 years old. Most abundant range of soil moisture gradient and LAl (leaf area index) were from 16 to 20% and those of LAI were from 3.1 to 3.5. To understand the dynamics and seed dispersal pattern of Korean pine in the Gwangneung natural deciduous broad-leaved forests, additional studies not only long-term monitoring of growth and mortality of naturally regenerated Korean pine seedlings but also application of stable isotope analysis and molecular genetic techniques was recommended.