• Title/Summary/Keyword: random variable

Search Result 892, Processing Time 0.04 seconds

The Associated Factors of Health Examinations Behaviors among Some Elderly Persons in Urban and Rural Areas (일부 도시·농촌지역 고령자의 건강검진 수진행동에 관련된 요인)

  • Kim, Yong-Ik;Cho, Young-Chae
    • Journal of agricultural medicine and community health
    • /
    • v.29 no.1
    • /
    • pp.1-14
    • /
    • 2004
  • Objectives: We investigated the factors related to health examination behaviors, sociodemographic aspects and lifestyles of elderly persons with different social backgrounds, and compared sexual and regional differences in urban and rural elderlies. Methods: The total study subjects(464 individuals) from urban(236) and rural areas(228), recruited by a stratified cluster random sampling were interviewed and examined about their sociodemographic profiles, daily lifestyles, subjective health status, conditions concerning use of medical resources, hearing acuity, visual acuity and ADL(activity of daily living), and whether they receive health examination or not. For statistical analysis, Chi-square test was used for sexual and regional comparisons among the groups who have been given a health examination and the one who have not. Results: In urban areas, the rate of having underwent health examination was 54.5% in men and 46.9% in women, and in rural areas, it was 59.8% in men and 42.7% in women, showing its higher rate in men than in women in both areas. For regional differences between the group who have taken a health examination and the one who have not, there was a significant difference in terms of age, family pattern, current job, monthly household income, owning a house, drinking status, eating habit, subjective health status, whether they have taken outpatient medical service for the recent 3 months or not, anxiety for the health, and IADL conditions according to whether the community is rural or urban. In multiple regressions, the influential factors on the health examination behaviors were selected such as having their own house, their family doctor, amnesia, urinary incontinence and chronic disease in urban districts. But in rural districts, the variables were selected such as having or not of their family doctor, urinary incontinence, anxiety for the health, educational level, their own house and chronic disease. Conclusions: It is suggested that the approach to the health examination of an older patient requires substantial consideration of highly variable individual sociodemographic characteristics involving regional attributes as well as their daily life styles, subjective health status, status of performing health examination, physical health status and ADL conditions.

  • PDF

Predicting the Goshawk's habitat area using Species Distribution Modeling: Case Study area Chungcheongbuk-do, South Korea (종분포모형을 이용한 참매의 서식지 예측 -충청북도를 대상으로-)

  • Cho, Hae-Jin;Kim, Dal-Ho;Shin, Man-Seok;Kang, Tehan;Lee, Myungwoo
    • Korean Journal of Environment and Ecology
    • /
    • v.29 no.3
    • /
    • pp.333-343
    • /
    • 2015
  • This research aims at identifying the goshawk's possible and replaceable breeding ground by using the MaxEnt prediction model which has so far been insufficiently used in Korea, and providing evidence to expand possible protection areas for the goshawk's breeding for the future. The field research identified 10 goshawk's nests, and 23 appearance points confirmed during the 3rd round of environmental research were used for analysis. 4 geomorphic, 3 environmental, 7 distance, and 9 weather factors were used as model variables. The final environmental variables were selected through non-parametric verification between appearance and non-appearance coordinates identified by random sampling. The final predictive model (MaxEnt) was structured using 10 factors related to breeding ground and 7 factors related to appearance area selected by statistics verification. According to the results of the study, the factor that affected breeding point structure model the most was temperature seasonality, followed by distance from mixforest, density-class on the forest map and relief energy. The factor that affected appearance point structure model the most was temperature seasonality, followed by distance from rivers and ponds, distance from agricultural land and gradient. The nature of the goshawk's breeding environment and habit to breed inside forests were reflected in this modeling that targets breeding points. The northern central area which is about $189.5 km^2$(2.55 %) is expected to be suitable breeding ground. Large cities such as Cheongju and Chungju are located in the southern part of Chungcheongbuk-do whereas the northern part of Chungcheongbuk-do has evenly distributed forests and farmlands, which helps goshawks have a scope of influence and food source to breed. Appearance point modeling predicted an area of $3,071 km^2$(41.38 %) showing a wider ranging habitat than that of the breeding point modeling due to some limitations such as limited moving observation and non-consideration of seasonal changes. When targeting the breeding points, a specific predictive area can be deduced but it is difficult to check the points of nests and it is impossible to reflect the goshawk's behavioral area. On the other hand, when targeting appearance points, a wider ranging area can be covered but it is less accurate compared to predictive breeding point since simple movements and constant use status are not reflected. However, with these results, the goshawk's habitat can be predicted with reasonable accuracy. In particular, it is necessary to apply precise predictive breeding area data based on habitat modeling results when enforcing an environmental evaluation or establishing a development plan.

Risk Assessment of Pine Tree Dieback in Sogwang-Ri, Uljin (울진 소광리 금강소나무 고사발생 특성 분석 및 위험지역 평가)

  • Kim, Eun-Sook;Lee, Bora;Kim, Jaebeom;Cho, Nanghyun;Lim, Jong-Hwan
    • Journal of Korean Society of Forest Science
    • /
    • v.109 no.3
    • /
    • pp.259-270
    • /
    • 2020
  • Extreme weather events, such as heat and drought, have occurred frequently over the past two decades. This has led to continuous reports of cases of forest damage due to physiological stress, not pest damage. In 2014, pine trees were collectively damaged in the forest genetic resources reserve of Sogwang-ri, Uljin, South Korea. An investigation was launched to determine the causes of the dieback, so that a forest management plan could be prepared to deal with the current dieback, and to prevent future damage. This study aimedto 1) understand the topographic and structural characteristics of the area which experienced pine tree dieback, 2) identify the main causes of the dieback, and 3) predict future risk areas through the use of machine-learning techniques. A model for identifying risk areas was developed using 14 explanatory variables, including location, elevation, slope, and age class. When three machine-learning techniques-Decision Tree, Random Forest (RF), and Support Vector Machine (SVM) were applied to the model, RF and SVM showed higher predictability scores, with accuracies over 93%. Our analysis of the variable set showed that the topographical areas most vulnerable to pine dieback were those with high altitudes, high daily solar radiation, and limited water availability. We also found that, when it came to forest stand characteristics, pine trees with high vertical stand densities (5-15 m high) and higher age classes experienced a higher risk of dieback. The RF and SVM models predicted that 9.5% or 115 ha of the Geumgang Pine Forest are at high risk for pine dieback. Our study suggests the need for further investigation into the vulnerable areas of the Geumgang Pine Forest, and also for climate change adaptive forest management steps to protect those areas which remain undamaged.

Machine Learning Based MMS Point Cloud Semantic Segmentation (머신러닝 기반 MMS Point Cloud 의미론적 분할)

  • Bae, Jaegu;Seo, Dongju;Kim, Jinsoo
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.5_3
    • /
    • pp.939-951
    • /
    • 2022
  • The most important factor in designing autonomous driving systems is to recognize the exact location of the vehicle within the surrounding environment. To date, various sensors and navigation systems have been used for autonomous driving systems; however, all have limitations. Therefore, the need for high-definition (HD) maps that provide high-precision infrastructure information for safe and convenient autonomous driving is increasing. HD maps are drawn using three-dimensional point cloud data acquired through a mobile mapping system (MMS). However, this process requires manual work due to the large numbers of points and drawing layers, increasing the cost and effort associated with HD mapping. The objective of this study was to improve the efficiency of HD mapping by segmenting semantic information in an MMS point cloud into six classes: roads, curbs, sidewalks, medians, lanes, and other elements. Segmentation was performed using various machine learning techniques including random forest (RF), support vector machine (SVM), k-nearest neighbor (KNN), and gradient-boosting machine (GBM), and 11 variables including geometry, color, intensity, and other road design features. MMS point cloud data for a 130-m section of a five-lane road near Minam Station in Busan, were used to evaluate the segmentation models; the average F1 scores of the models were 95.43% for RF, 92.1% for SVM, 91.05% for GBM, and 82.63% for KNN. The RF model showed the best segmentation performance, with F1 scores of 99.3%, 95.5%, 94.5%, 93.5%, and 90.1% for roads, sidewalks, curbs, medians, and lanes, respectively. The variable importance results of the RF model showed high mean decrease accuracy and mean decrease gini for XY dist. and Z dist. variables related to road design, respectively. Thus, variables related to road design contributed significantly to the segmentation of semantic information. The results of this study demonstrate the applicability of segmentation of MMS point cloud data based on machine learning, and will help to reduce the cost and effort associated with HD mapping.

Determinants of Consumer Preference by type of Accommodation: Two Step Cluster Analysis (이단계 군집분석에 의한 농촌관광 편의시설 유형별 소비자 선호 결정요인)

  • Park, Duk-Byeong;Yoon, Yoo-Shik;Lee, Min-Soo
    • Journal of Global Scholars of Marketing Science
    • /
    • v.17 no.3
    • /
    • pp.1-19
    • /
    • 2007
  • 1. Purpose Rural tourism is made by individuals with different characteristics, needs and wants. It is important to have information on the characteristics and preferences of the consumers of the different types of existing rural accommodation. The stud aims to identify the determinants of consumer preference by type of accommodations. 2. Methodology 2.1 Sample Data were collected from 1000 people by telephone survey with three-stage stratified random sampling in seven metropolitan areas in Korea. Respondents were chosen by sampling internal on telephone book published in 2006. We surveyed from four to ten-thirty 0'clock afternoon so as to systematic sampling considering respondents' life cycle. 2.2 Two-step cluster Analysis Our study is accomplished through the use of a two-step cluster method to classify the accommodation in a reduced number of groups, so that each group constitutes a type. This method had been suggested as appropriate in clustering large data sets with mixed attributes. The method is based on a distance measure that enables data with both continuous and categorical attributes to be clustered. This is derived from a probabilistic model in which the distance between two clusters in equivalent to the decrease in log-likelihood function as a result of merging. 2.3 Multinomial Logit Analysis The estimation of a Multionmial Logit model determines the characteristics of tourist who is most likely to opt for each type of accommodation. The Multinomial Logit model constitutes an appropriate framework to explore and explain choice process where the choice set consists of more than two alternatives. Due to its ease and quick estimation of parameters, the Multinomial Logit model has been used for many empirical studies of choice in tourism. 3. Findings The auto-clustering algorithm indicated that a five-cluster solution was the best model, because it minimized the BIC value and the change in them between adjacent numbers of clusters. The accommodation establishments can be classified into five types: Traditional House, Typical Farmhouse, Farmstay house for group Tour, Log Cabin for Family, and Log Cabin for Individuals. Group 1 (Traditional House) includes mainly the large accommodation establishments, i.e. those with ondoll style room providing meals and one shower room on family tourist, of original construction style house. Group 2 (Typical Farmhouse) encompasses accommodation establishments of Ondoll rooms and each bathroom providing meals. It includes, in other words, the tourist accommodations Known as "rural houses." Group 3 (Farmstay House for Group) has accommodation establishments of Ondoll rooms not providing meals and self cooking facilities, large room size over five persons. Group 4 (Log Cabin for Family) includes mainly the popular accommodation establishments, i.e. those with Ondoll style room with on shower room on family tourist, of western styled log house. While the accommodations in this group are not defined as regards type of construction, the group does include all the original Korean style construction, Finally, group 5 (Log Cabin for Individuals)includes those accommodations that are bedroom western styled wooden house with each bathroom. First Multinomial Logit model is estimated including all the explicative variables considered and taking accommodation group 2 as base alternative. The results show that the variables and the estimated values of the parameters for the model giving the probability of each of the five different types of accommodation available in rural tourism village in Korea, according to the socio-economic and trip related characteristics of the individuals. An initial observation of the analysis reveals that none of variables income, the number of journey, distance, and residential style of house is explicative in the choice of rural accommodation. The age and accompany variables are significant for accommodation establishment of group 1. The education and rural residential experience variables are significant for accommodation establishment of groups 4 and 5. The expenditure and marital status variables are significant for accommodation establishment of group 4. The gender and occupation variable are significant for accommodation establishment of group 3. The loyalty variable is significant for accommodation establishment of groups 3 and 4. The study indicates that significant differences exist among the individuals who choose each type of accommodation at a destination. From this investigation is evident that several profiles of tourists can be attracted by a rural destination according to the types of existing accommodations at this destination. Besides, the tourist profiles may be used as the basis for investment policy and promotion for each type of accommodation, making use in each case of the variables that indicate a greater likelihood of influencing the tourist choice of accommodation.

  • PDF

Analyses of the Efficiency in Hospital Management (병원 단위비용 결정요인에 관한 연구)

  • Ro, Kong-Kyun;Lee, Seon
    • Korea Journal of Hospital Management
    • /
    • v.9 no.1
    • /
    • pp.66-94
    • /
    • 2004
  • The objective of this study is to examine how to maximize the efficiency of hospital management by minimizing the unit cost of hospital operation. For this purpose, this paper proposes to develop a model of the profit maximization based on the cost minimization dictum using the statistical tools of arriving at the maximum likelihood values. The preliminary survey data are collected from the annual statistics and their analyses published by Korea Health Industry Development Institute and Korean Hospital Association. The maximum likelihood value statistical analyses are conducted from the information on the cost (function) of each of 36 hospitals selected by the random stratified sampling method according to the size and location (urban or rural) of hospitals. We believe that, although the size of sample is relatively small, because of the sampling method used and the high response rate, the power of estimation of the results of the statistical analyses of the sample hospitals is acceptable. The conceptual framework of analyses is adopted from the various models of the determinants of hospital costs used by the previous studies. According to this framework, the study postulates that the unit cost of hospital operation is determined by the size, scope of service, technology (production function) as measured by capacity utilization, labor capital ratio and labor input-mix variables, and by exogeneous variables. The variables to represent the above cost determinants are selected by using the step-wise regression so that only the statistically significant variables may be utilized in analyzing how these variables impact on the hospital unit cost. The results of the analyses show that the models of hospital cost determinants adopted are well chosen. The various models analyzed have the (goodness of fit) overall determination (R2) which all turned out to be significant, regardless of the variables put in to represent the cost determinants. Specifically, the size and scope of service, no matter how it is measured, i. e., number of admissions per bed, number of ambulatory visits per bed, adjusted inpatient days and adjusted outpatients, have overall effects of reducing the hospital unit costs as measured by the cost per admission, per inpatient day, or office visit implying the existence of the economy of scale in the hospital operation. Thirdly, the technology used in operating a hospital has turned out to have its ramifications on the hospital unit cost similar to those postulated in the static theory of the firm. For example, the capacity utilization as represented by the inpatient days per employee tuned out to have statistically significant negative impacts on the unit cost of hospital operation, while payroll expenses per inpatient cost has a positive effect. The input-mix of hospital operation, as represented by the ratio of the number of doctor, nurse or medical staff per general employee, supports the known thesis that the specialized manpower costs more than the general employees. The labor/capital ratio as represented by the employees per 100 beds is shown to have a positive effect on the cost as expected. As for the exogeneous variable's impacts on the cost, when this variable is represented by the percent of urban 100 population at the location where the hospital is located, the regression analysis shows that the hospitals located in the urban area have a higher cost than those in the rural area. Finally, the case study of the sample hospitals offers a specific information to hospital administrators about how they share in terms of the cost they are incurring in comparison to other hospitals. For example, if his/her hospital is of small size and located in a city, he/she can compare the various costs of his/her hospital operation with those of other similar hospitals. Therefore, he/she may be able to find the reasons why the cost of his/her hospital operation has a higher or lower cost than other similar hospitals in what factors of the hospital cost determinants.

  • PDF

The development of anti-DR4 single-chain Fv (ScFv) antibody fused to Escherichia coli alkaline phosphatase (대장균의 alkaline phosphatase가 융합된 anti-DR4 single-chain Fv (ScFv) 항체의 개발)

  • Han, Seung Hee;Kim, Jin-Kyoo
    • Korean Journal of Microbiology
    • /
    • v.52 no.1
    • /
    • pp.10-17
    • /
    • 2016
  • Enzyme immunoassay to analyze specific binding activity of antibody to antigen uses horseradish peroxidase (HRP) or alkaline phosphatase (AP). Chemical methods are usually used for coupling of these enzymes to antibody, which is complicated and random cross-linking process. As results, it causes decreases or loss of functional activity of either antibody or enzyme. In addition, most enzyme assays use secondary antibody to detect antigen binding activity of primary antibody. Enzymes coupled to secondary antibody provide a binding signal by substrate-based color development, suggesting secondary antibody is required in enzyme immunoassay. Additional incubation time for binding of secondary antibody should also be necessary. More importantly, non-specific binding activity caused by secondary antibody should also be eliminated. In this study, we cloned AP isolated from Escherichia coli (E. coli) chromosome by PCR and fused to) hAY4 single-chain variable domain fragment (ScFv) specific to death receptor (DR4) which is a receptor for tumor necrosis factor ${\alpha}$ related apoptosis induced ligand (TRAIL). hAY4 ScFv-AP expressed in E. coli showed 73.8 kDa as a monomer in SDS-PAGE. However, this fusion protein shown in size-exclusion chromatography (SEC) exhibited 147.6 kDa as a dimer confirming that natural dimerization of AP by non-covalent association induced ScFv-AP dimerization. In several immunoassay such as ELISA, Western blot and immunocytochemistry, it showed antigen binding activity by color development of substrates catalyzed by AP directly fused to primary hAY4 ScFv without secondary antibody. In summary, hAY4 ScFv-AP fusion protein was successfully purified as a soluble dimeric form in E. coli and showed antigen binding activity in several immunoassays without addition of secondary antibody which sometimes causes time-consuming, expensive and non-specific false binding.

Relation of Social Security Network, Community Unity and Local Government Trust (지역사회 사회안전망구축과 지역사회결속 및 지방자치단체 신뢰의 관계)

  • Kim, Yeong-Nam;Kim, Chan-Sun
    • Korean Security Journal
    • /
    • no.42
    • /
    • pp.7-36
    • /
    • 2015
  • This study aims at analyzing difference of social Security network, Community unity and local government trust according to socio-demographical features, exploring the relation of social Security network, Community unity and local government trust according to socio-demographical features, presenting results between each variable as a model and verifying the property of mutual ones. This study sampled general citizens in Gwangju for about 15 days Aug. 15 through Aug. 30, 2014, distributed total 450 copies using cluster random sampling, gathered 438 persons, 412 persons of whom were used for analysis. This study verified the validity and credibility of the questionnaire through an experts' meeting, preliminary test, factor analysis and credibility analysis. The credibility of questionnaire was ${\alpha}=.809{\sim}{\alpha}=.890$. The inout data were analyzed by study purpose using SPSSWIN 18.0, as statistical techniques, factor analysis, credibility analysis, correlation analysis, independent sample t verification, ANOVA, multi-regression analysis, path analysis etc. were used. the findings obtained through the above study methods are as follows. First, building a social Security network has an effect on Community institution. That is, the more activated a, the higher awareness on institution. the more activated street CCTV facilities, anti-crime design, local government Security education, the higher the stability. Second, building a social Security network has an effect on trust of local government. That is, the activated local autonomous anti-crime activity, anti-crime design. local government's Security education, police public oder service, the more increased trust of policy, service management, busines performance. Third, Community unity has an effect on trust of local government. That is, the better Community institution is achieved, the higher trust of policy. Also the stabler Community institution, the higher trust of business performance. Fourth, building a social Security network has a direct or indirect effect on Community unity and local government trust. That is, social Security network has a direct effect on trust of local government, but it has a higher effect through Community unity of parameter. Such results showed that Community unity in Gwangju Region is an important factor, which means it is an important variable mediating building a social Security network and trust of local government. To win trust of local residents, we need to prepare for various cultural events and active communication space and build a social Security network for uniting them.

  • PDF

Estimation of Annual Trends and Environmental Effects on the Racing Records of Jeju Horses (제주마 주파기록에 대한 연도별 추세 및 환경효과 분석)

  • Lee, Jongan;Lee, Soo Hyun;Lee, Jae-Gu;Kim, Nam-Young;Choi, Jae-Young;Shin, Sang-Min;Choi, Jung-Woo;Cho, In-Cheol;Yang, Byoung-Chul
    • Journal of Life Science
    • /
    • v.31 no.9
    • /
    • pp.840-848
    • /
    • 2021
  • This study was conducted to estimate annual trends and the environmental effects in the racing records of Jeju horses. The Korean Racing Authority (KRA) collected 48,645 observations for 2,167 Jeju horses from 2002 to 2019. Racing records were preprocessed to eliminate errors that occur during the data collection. Racing times were adjusted for comparison between race distances. A stepwise Akaike information criterion (AIC) variable selection method was applied to select appropriate environment variables affecting racing records. The annual improvement of the race time was -0.242 seconds. The model with the lowest AIC value was established when variables were selected in the following order: year, budam classification, jockey ranking, trainer ranking, track condition, weather, age, and gender. The most suitable model was constructed when the jockey ranking and age variables were considered as random effects. Our findings have potential for application as basic data when building models for evaluating genetic abilities of Jeju horses.

A Time Series Graph based Convolutional Neural Network Model for Effective Input Variable Pattern Learning : Application to the Prediction of Stock Market (효과적인 입력변수 패턴 학습을 위한 시계열 그래프 기반 합성곱 신경망 모형: 주식시장 예측에의 응용)

  • Lee, Mo-Se;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.167-181
    • /
    • 2018
  • Over the past decade, deep learning has been in spotlight among various machine learning algorithms. In particular, CNN(Convolutional Neural Network), which is known as the effective solution for recognizing and classifying images or voices, has been popularly applied to classification and prediction problems. In this study, we investigate the way to apply CNN in business problem solving. Specifically, this study propose to apply CNN to stock market prediction, one of the most challenging tasks in the machine learning research. As mentioned, CNN has strength in interpreting images. Thus, the model proposed in this study adopts CNN as the binary classifier that predicts stock market direction (upward or downward) by using time series graphs as its inputs. That is, our proposal is to build a machine learning algorithm that mimics an experts called 'technical analysts' who examine the graph of past price movement, and predict future financial price movements. Our proposed model named 'CNN-FG(Convolutional Neural Network using Fluctuation Graph)' consists of five steps. In the first step, it divides the dataset into the intervals of 5 days. And then, it creates time series graphs for the divided dataset in step 2. The size of the image in which the graph is drawn is $40(pixels){\times}40(pixels)$, and the graph of each independent variable was drawn using different colors. In step 3, the model converts the images into the matrices. Each image is converted into the combination of three matrices in order to express the value of the color using R(red), G(green), and B(blue) scale. In the next step, it splits the dataset of the graph images into training and validation datasets. We used 80% of the total dataset as the training dataset, and the remaining 20% as the validation dataset. And then, CNN classifiers are trained using the images of training dataset in the final step. Regarding the parameters of CNN-FG, we adopted two convolution filters ($5{\times}5{\times}6$ and $5{\times}5{\times}9$) in the convolution layer. In the pooling layer, $2{\times}2$ max pooling filter was used. The numbers of the nodes in two hidden layers were set to, respectively, 900 and 32, and the number of the nodes in the output layer was set to 2(one is for the prediction of upward trend, and the other one is for downward trend). Activation functions for the convolution layer and the hidden layer were set to ReLU(Rectified Linear Unit), and one for the output layer set to Softmax function. To validate our model - CNN-FG, we applied it to the prediction of KOSPI200 for 2,026 days in eight years (from 2009 to 2016). To match the proportions of the two groups in the independent variable (i.e. tomorrow's stock market movement), we selected 1,950 samples by applying random sampling. Finally, we built the training dataset using 80% of the total dataset (1,560 samples), and the validation dataset using 20% (390 samples). The dependent variables of the experimental dataset included twelve technical indicators popularly been used in the previous studies. They include Stochastic %K, Stochastic %D, Momentum, ROC(rate of change), LW %R(Larry William's %R), A/D oscillator(accumulation/distribution oscillator), OSCP(price oscillator), CCI(commodity channel index), and so on. To confirm the superiority of CNN-FG, we compared its prediction accuracy with the ones of other classification models. Experimental results showed that CNN-FG outperforms LOGIT(logistic regression), ANN(artificial neural network), and SVM(support vector machine) with the statistical significance. These empirical results imply that converting time series business data into graphs and building CNN-based classification models using these graphs can be effective from the perspective of prediction accuracy. Thus, this paper sheds a light on how to apply deep learning techniques to the domain of business problem solving.