• Title/Summary/Keyword: Model Tree

Search Result 1,912, Processing Time 0.025 seconds

Suggestion of Urban Regeneration Type Recommendation System Based on Local Characteristics Using Text Mining (텍스트 마이닝을 활용한 지역 특성 기반 도시재생 유형 추천 시스템 제안)

  • Kim, Ikjun;Lee, Junho;Kim, Hyomin;Kang, Juyoung
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.3
    • /
    • pp.149-169
    • /
    • 2020
  • "The Urban Renewal New Deal project", one of the government's major national projects, is about developing underdeveloped areas by investing 50 trillion won in 100 locations on the first year and 500 over the next four years. This project is drawing keen attention from the media and local governments. However, the project model which fails to reflect the original characteristics of the area as it divides project area into five categories: "Our Neighborhood Restoration, Housing Maintenance Support Type, General Neighborhood Type, Central Urban Type, and Economic Base Type," According to keywords for successful urban regeneration in Korea, "resident participation," "regional specialization," "ministerial cooperation" and "public-private cooperation", when local governments propose urban regeneration projects to the government, they can see that it is most important to accurately understand the characteristics of the city and push ahead with the projects in a way that suits the characteristics of the city with the help of local residents and private companies. In addition, considering the gentrification problem, which is one of the side effects of urban regeneration projects, it is important to select and implement urban regeneration types suitable for the characteristics of the area. In order to supplement the limitations of the 'Urban Regeneration New Deal Project' methodology, this study aims to propose a system that recommends urban regeneration types suitable for urban regeneration sites by utilizing various machine learning algorithms, referring to the urban regeneration types of the '2025 Seoul Metropolitan Government Urban Regeneration Strategy Plan' promoted based on regional characteristics. There are four types of urban regeneration in Seoul: "Low-use Low-Level Development, Abandonment, Deteriorated Housing, and Specialization of Historical and Cultural Resources" (Shon and Park, 2017). In order to identify regional characteristics, approximately 100,000 text data were collected for 22 regions where the project was carried out for a total of four types of urban regeneration. Using the collected data, we drew key keywords for each region according to the type of urban regeneration and conducted topic modeling to explore whether there were differences between types. As a result, it was confirmed that a number of topics related to real estate and economy appeared in old residential areas, and in the case of declining and underdeveloped areas, topics reflecting the characteristics of areas where industrial activities were active in the past appeared. In the case of the historical and cultural resource area, since it is an area that contains traces of the past, many keywords related to the government appeared. Therefore, it was possible to confirm political topics and cultural topics resulting from various events. Finally, in the case of low-use and under-developed areas, many topics on real estate and accessibility are emerging, so accessibility is good. It mainly had the characteristics of a region where development is planned or is likely to be developed. Furthermore, a model was implemented that proposes urban regeneration types tailored to regional characteristics for regions other than Seoul. Machine learning technology was used to implement the model, and training data and test data were randomly extracted at an 8:2 ratio and used. In order to compare the performance between various models, the input variables are set in two ways: Count Vector and TF-IDF Vector, and as Classifier, there are 5 types of SVM (Support Vector Machine), Decision Tree, Random Forest, Logistic Regression, and Gradient Boosting. By applying it, performance comparison for a total of 10 models was conducted. The model with the highest performance was the Gradient Boosting method using TF-IDF Vector input data, and the accuracy was 97%. Therefore, the recommendation system proposed in this study is expected to recommend urban regeneration types based on the regional characteristics of new business sites in the process of carrying out urban regeneration projects."

Spatial Distribution Patterns and Prediction of Hotspot Area for Endangered Herpetofauna Species in Korea (국내 멸종위기양서·파충류의 공간적 분포형태와 주요 분포지역 예측에 대한 연구)

  • Do, Min Seock;Lee, Jin-Won;Jang, Hoan-Jin;Kim, Dae-In;Park, Jinwoo;Yoo, Jeong-Chil
    • Korean Journal of Environment and Ecology
    • /
    • v.31 no.4
    • /
    • pp.381-396
    • /
    • 2017
  • Understanding species distribution plays an important role in conservation as well as evolutionary biology. In this study, we applied a species distribution model to predict hotspot areas and habitat characteristics for endangered herpetofauna species in South Korea: the Korean Crevice Salamander (Karsenia koreana), Suweon-tree frog (Hyla suweonensis), Gold-spotted pond frog (Pelophylax chosenicus), Narrow-mouthed toad (Kaloula borealis), Korean ratsnake (Elaphe schrenckii), Mongolian racerunner (Eremias argus), Reeve's turtle (Mauremys reevesii) and Soft-shelled turtle (Pelodiscus sinensis). The Kori salamander (Hynobius yangi) and Black-headed snake (Sibynophis chinensis) were excluded from the analysis due to insufficient sample size. The results showed that the altitude was the most important environmental variable for their distribution, and the altitude at which these species were distributed correlated with the climate of that region. The predicted distribution area derived from the species distribution modelling adequately reflected the observation site used in this study as well as those reported in preceding studies. The average AUC value of the eigh species was relatively high ($0.845{\pm}0.08$), while the average omission rate value was relatively low ($0.087{\pm}0.01$). Therefore, the species overlaying model created for the endangered species is considered successful. When merging the distribution models, it was shown that five species shared their habitats in the coastal areas of Gyeonggi-do and Chungcheongnam-do, which are the western regions of the Korean Peninsula. Therefore, we suggest that protection should be a high priority in these area, and our overall results may serve as essential and fundamental data for the conservation of endangered amphibian and reptiles in Korea.

Change Detection of land-surface Environment in Gongju Areas Using Spatial Relationships between Land-surface Change and Geo-spatial Information (지표변화와 지리공간정보의 연관성 분석을 통한 공주지역 지표환경 변화 분석)

  • Jang Dong-Ho
    • Journal of the Korean Geographical Society
    • /
    • v.40 no.3 s.108
    • /
    • pp.296-309
    • /
    • 2005
  • In this study, we investigated the change of future land-surface and relationships of land-surface change with geo-spatial information, using a Bayesian prediction model based on a likelihood ratio function, for analysing the land-surface change of the Gongju area. We classified the land-surface satellite images, and then extracted the changing area using a way of post classification comparison. land-surface information related to the land-surface change is constructed in a GIS environment, and the map of land-surface change prediction is made using the likelihood ratio function. As the results of this study, the thematic maps which definitely influence land-surface change of rural or urban areas are elevation, water system, population density, roads, population moving, the number of establishments, land price, etc. Also, thematic maps which definitely influence the land-surface change of forests areas are elevation, slope, population density, population moving, land price, etc. As a result of land-surface change analysis, center proliferation of old and new downtown is composed near Gum-river, and the downtown area will spread around the local roads and interchange areas in the urban area. In case of agricultural areas, a small tributary of Gum-river or an area of local roads which are attached with adjacent areas showed the high probability of change. Most of the forest areas are located in southeast and from this result we can guess why the wide chestnut-tree cultivation complex is located in these areas and the capability of forest damage is very high. As a result of validation using a prediction rate curve, a capability of prediction of urban area is $80\%$, agriculture area is $55\%$, forest area is $40\%$ in higher $10\%$ of possibility which the land-surface change would occur. This integration model is unsatisfactory to Predict the forest area in the study area and thus as a future work, it is necessary to apply new thematic maps or prediction models In conclusion, we can expect that this way can be one of the most essential land-surface change studies in a few years.

A Study on Operation Strategy by Multi-variate Regression of Deagu Arboretum Visitor's Satisfaction (대구수목원 이용객 만족모델을 통한 운영 방안 연구)

  • Kang, Kee-Rae
    • Journal of Korean Society of Forest Science
    • /
    • v.101 no.1
    • /
    • pp.36-45
    • /
    • 2012
  • Education on the environment and plants offered by arboretum for today's people not only contribute to foster a better natural environment in urban region but also provide visitors with decent refreshment environment and beyond. In the study, the author undertook the observation on usage behavior and satisfaction model of arboretum visitors expect and investigated the facilities and programs to be offered by arboretum in order to propose the opinion regarding the service. For observation size of variables in a multiple regression analysis of variables is influencing satisfaction rankings walks the line of flow, the educational effect on the environment, cleanliness of the facility, visits pay, natural beauty, diversity of trees, accessibility and friendliness of staff, expansion of facilities in the arboretum and appeared as a complement. In case of visitor attribute, the residents living near the facility showed the highest visit frequency of more than 5 times, especially as part of taking a walk. This proves that the visit to arboretum is considered as part of everyday life, and thus a new program and walk path as well as movement route are needed to be developed for the visitors. In the question relating to the facilities and operation programs in Daegu Arboretum, particularly the requests by visitors, they responded that the establishment of cultural event, beautiful natural scenery, refreshment and convenience facilities is the most critical issue. In addition, the management on withered trees and bare lands is an urgent issue as well. In this sense, the Operation and Management Strategies based upon the visitor behaviors and model of satisfaction are needed to deal with the adoption of diverse events and festivals joined by local residents, ombudsman program, environmental program development for students and teachers within the region, negligent bare lands and withered tree replacement, and cafeteria facility improvement and supplement as well as the bench marking of other facilities than arboretums located in other regions. These items are thought to be sufficiently dealt with by Daegu Arboretum having no more external resources. It is recognized that the visitor satisfaction begins from a minor thing, and a small difference determines a great satisfaction, and thus the software approach rather than hardware one is in need.

Estimation of the Three-dimensional Vegetation Landscape of the Donhwamun Gate Area in Changdeokgung Palace through the Rubber Sheeting Transformation of (<동궐도(東闕圖)>의 러버쉬팅변환을 통한 창덕궁 돈화문 지역의 입체적 식생 경관 추정)

  • Lee, Jae-Yong
    • Korean Journal of Heritage: History & Science
    • /
    • v.51 no.2
    • /
    • pp.138-153
    • /
    • 2018
  • The purpose of this study was to analyze , which was made in the late Joseon Dynasty to specify the vegetation landscape of the Donhwamun Gate area in Changdeokgung Palace. The study results can be summarized as below. First, based on "Jieziyuan Huazhuan(芥子園畵傳)", the introductory book of tree expression delivered from China in the 17th century, allowed the classification criteria of the trees described in the picture to be established and helped identify their types. As a result of the classification, there were 10 species and 50 trees in the Donhwamun Gate area of . Second, it was possible to measure the real size of the trees described in the picture through the elevated drawing scale of . The height of the trees ranged from a minimum of 4.37 m to a maximum of 22.37 m. According to the measurement results, compared to the old trees currently living in Changdeokgung Palace, the trees described in the picture were found to be produced in almost actual size without exaggeration. Thus, the measured height of the trees turned out to be appropriate as baseline data for reproduction of the vegetation landscape. Third, through the Rubber Sheeting Transformation of , it was possible to make a ground plan for the planting of on the current digital topographic map. In particular, as the transformed area of was departmentalized and control points were added, the precision of transformation improved. It was possible to grasp the changed position of planting as well as the change in planting density through a ground plan of planting of . Lastly, it was possible to produce a three-dimensional vegetation landscape model by using the information of the shape of the trees and the ground plan for the planting of . Based on the three-dimensional model, it was easy to examine the characteristics of the three-dimensional view of the current vegetation via the view axis, skyline, and openness to and cover from the adjacent regions at the level of the eyes. This study is differentiated from others in that it verified the realism of and suggested the possibility of ascertaining the original form of the vegetation landscape described in the painting.

Felling Productivity in Korean Pine Stands by Using Chain Saw (체인톱을 이용한 잣나무의 벌도작업 공정 분석)

  • Han, Won Sung;Cho, Koo Hyun;Oh, Jae-Heun;Song, Tae-Young;Kim, Jae-Won;Shin, Man Yong
    • Journal of Korean Society of Forest Science
    • /
    • v.98 no.4
    • /
    • pp.451-457
    • /
    • 2009
  • This study was conducted to evaluate the felling productivity by chain saw in thinning operation of Korean pine (Pinus koraiensis) stands. Time study data were collected from 4 thinning site in Korean pine stands. This study derived a regression model to estimate the average felling cycle time for evaluating the productivity in felling, which was used to analyze the felling productivity by thinning period. In the study sites, the average felling cycle time per a tree was 463 sec/cycle and the productivity was $2.26m^3/hr$. Thinning period in Korean pine is divided into three groups by producing purposes; small-diameter log, medium-diameter log, and large-diameter log. And analyzed working time and productivity from thinning period fixed by producing purposes. For the small-diameter log producing purpose estimated to be thinning period operated once when the mean DBH was 16 cm and its productivity was $8.94m^3/man{\cdot}day$. For the medium-diameter and large-diameter log producing purposes, thinning period was twice and three times when the mean DBH of the 1st and 2nd thinning period was 16 cm and 21 cm, and its productivity was $9.06m^3/man{\cdot}day$ and $10.86m^3/man{\cdot}day$. The 30 cm in DBH and $15.12m^3/man{\cdot}day$ in productivity was operated 3rd thinning for the large-diameter log producing purposes.

Comparison of Association Rule Learning and Subgroup Discovery for Mining Traffic Accident Data (교통사고 데이터의 마이닝을 위한 연관규칙 학습기법과 서브그룹 발견기법의 비교)

  • Kim, Jeongmin;Ryu, Kwang Ryel
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.4
    • /
    • pp.1-16
    • /
    • 2015
  • Traffic accident is one of the major cause of death worldwide for the last several decades. According to the statistics of world health organization, approximately 1.24 million deaths occurred on the world's roads in 2010. In order to reduce future traffic accident, multipronged approaches have been adopted including traffic regulations, injury-reducing technologies, driving training program and so on. Records on traffic accidents are generated and maintained for this purpose. To make these records meaningful and effective, it is necessary to analyze relationship between traffic accident and related factors including vehicle design, road design, weather, driver behavior etc. Insight derived from these analysis can be used for accident prevention approaches. Traffic accident data mining is an activity to find useful knowledges about such relationship that is not well-known and user may interested in it. Many studies about mining accident data have been reported over the past two decades. Most of studies mainly focused on predict risk of accident using accident related factors. Supervised learning methods like decision tree, logistic regression, k-nearest neighbor, neural network are used for these prediction. However, derived prediction model from these algorithms are too complex to understand for human itself because the main purpose of these algorithms are prediction, not explanation of the data. Some of studies use unsupervised clustering algorithm to dividing the data into several groups, but derived group itself is still not easy to understand for human, so it is necessary to do some additional analytic works. Rule based learning methods are adequate when we want to derive comprehensive form of knowledge about the target domain. It derives a set of if-then rules that represent relationship between the target feature with other features. Rules are fairly easy for human to understand its meaning therefore it can help provide insight and comprehensible results for human. Association rule learning methods and subgroup discovery methods are representing rule based learning methods for descriptive task. These two algorithms have been used in a wide range of area from transaction analysis, accident data analysis, detection of statistically significant patient risk groups, discovering key person in social communities and so on. We use both the association rule learning method and the subgroup discovery method to discover useful patterns from a traffic accident dataset consisting of many features including profile of driver, location of accident, types of accident, information of vehicle, violation of regulation and so on. The association rule learning method, which is one of the unsupervised learning methods, searches for frequent item sets from the data and translates them into rules. In contrast, the subgroup discovery method is a kind of supervised learning method that discovers rules of user specified concepts satisfying certain degree of generality and unusualness. Depending on what aspect of the data we are focusing our attention to, we may combine different multiple relevant features of interest to make a synthetic target feature, and give it to the rule learning algorithms. After a set of rules is derived, some postprocessing steps are taken to make the ruleset more compact and easier to understand by removing some uninteresting or redundant rules. We conducted a set of experiments of mining our traffic accident data in both unsupervised mode and supervised mode for comparison of these rule based learning algorithms. Experiments with the traffic accident data reveals that the association rule learning, in its pure unsupervised mode, can discover some hidden relationship among the features. Under supervised learning setting with combinatorial target feature, however, the subgroup discovery method finds good rules much more easily than the association rule learning method that requires a lot of efforts to tune the parameters.

HACCP Model for Quality Control of Sushi Production in the Eine Japanese Restaurants in Korea (일본전문식당의 급식품질 개선을 위한 HACCP 시스템 적용 연구)

  • 김혜경;이복희;김인호;조경동
    • Journal of the East Asian Society of Dietary Life
    • /
    • v.13 no.1
    • /
    • pp.25-38
    • /
    • 2003
  • This study was conducted to establish the microbiological quality standards applying the HACCP system on sushi items of Japanese restaurant in Korea. The study evaluated hygienic conditions of kitchen and workers, pH time-temperature relationship, and microbial assessments during whole process of sushi making in 2001. Overall hygienic conditions were normal for both kitchen and for workers by 3 point scale, but hygienic controls against the cross-contamination were still needed. Each process of sushi making was performed under the risk of microbial contamination, since pH value of most of ingredients was over pH 4.6 and also production time(3.5~6 hrs) were long enough to cause problems. Microorganisms were high enough to cause foodborne illness ranged 8.0$\times$10$^2$~3.3$\times$10$^{6}$ CFU/g of TPC and 1.0$\times$10$^1$~1.6$\times$10$^3$CFU/g of coliforms, although TPC, coliforms and Staphylcoccus aureus were within the standard limits (TPC 10$^2$~10$^{6}$ CFU/g, coliforms 10$^3$CFU/g). However, Salmonella and Vibrio parahaemolyticus were not detected. High populations TPC and coliforms were also found in the cooks' hands and cooking utensils(TPC 10$^2$~10$^{6}$ CFU/100cm$^2$and Coliforms 10$^1$~10$^3$CFU/100cm$^2$). Based on the CCP decision tree analysis, the CCPs were the holding steps far six sushi production line except the tuna and the thawing step for tuna sushi. In conclusion, overall state of sushi production was fairly good but much improvement was still needed.

  • PDF

A Study on the Nature-friendly Management Regarding the User Pattern of Yangjae Stream (양재천의 이용특성을 고려한 환경친화적 관리방안에 관한 연구)

  • Kim Sun-Hee;Hong Suk-Hwan;Bae Jung-Nam
    • Korean Journal of Environment and Ecology
    • /
    • v.18 no.3
    • /
    • pp.306-315
    • /
    • 2004
  • Yangjae stream, stretching through Seocho-gu and Gangnam-gu, is a representative city stream with its environmentally friendly stream makeover project model, launched in 1995. The district of Gangnam-gu, the subject of this study, is under high pressure from the residents for its use as a huge residential areas close to the stream. The study has two main purposes. The first is to identify the condition and characteristic of utilization of Yangjae stream which is currently being increased in use by the stream restoration. Secondly, the study aims to suggest the environment-friendly management to accomplish arrangement of the naturally friendly stream based on the identification survey, The result from the user survey with 303 valid answer sheets show that the people from neighboring residential areas use this stream a lot doing exercising(51.8%) and taking a walk(24.4%) in their free time. Also regular use rate is high, and people are likely to use it alone(30.4%) or as a family(28.4%). With regard to the need of facility increase, even though the respondents required resting places in the shade(80.8%) most, overall, additional introduction of facilities was analyzed as unnecessary(78.8%). safety issue(22.0%) and a lack of convenience facilities(17.6%) and resting places in the shade(16.6%) are pointed as main problems while the users are generally satisfied(59.5%) with the stream. Improving walk-way and planting trees for shade on the slope were designed as a solution for these problems. For securing safety through improvement of walk-way, the scattering of pressure of current walk with building new walk using berms was presented. In order to increase safety on the walk-way(see above figure), the study proposes to build a new walk-way with berms to disperse excessive pressure. It also suggests the tree planting to provide shade in the stream and to make a provision for the planting of forest trees in the current law.

A Study on the Characteristics of Jobs in Academic Libraries According to Different Generations (대학도서관 업무의 시대별 변천에 따른 특성 연구)

  • Cho, Chul-Hyun
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.26 no.1
    • /
    • pp.135-170
    • /
    • 2015
  • This study aimed to investigate the transition of academic libraries' jobs by developing a model based on a shift of library generations including Library 1.0, Library 2.0, and Library 3.0 corresponding to the shift of web generations and to explore generational characteristics of library duties as well. The research used three phases of procedure: literature review about different library generations; job analyses for academic libraries in South Korea and the U.S.A.; the Delphi technique in tree sequential order. The research findings were as follows. First of all, there were 170 duties that continued from Library 1.0 to Library 3.0. There were 58 duties which continued from Library 2.0 to Library 3.0 whereas three duties that continued from Library 1.0 to Library 2.0. In addition, three distinctive duties existed only in Library 1.0 whereas one unique duty was only in Library 2.0. Library 3.0 generated 25 new duties. Secondly, considering general characteristics which cover specific parts of individual duties, there was a significant increase in importance, difficulty, and frequency of library administration throughout the three generations. In terms of importance, difficulty, and frequency of collection development and management, there was a significant increase only from Library 2.0 to Library 3.0. Considering information organization, there was a significant decrease in importance from Library 1.0 to Library 2.0. In addition, there was a significant decrease in frequency and there was no significant difference in difficulty throughout the three generations. In the case of information service, while there was a significant increase in importance among three generations, there was a significant increase in difficulty only from Library 1.0 to Library 2.0. However, there was no generational difference in frequency. With the respect of information system development and management, there was a significant increase in importance and frequency throughout the three generations, but there was no significant difference in difficulty among three generations.