• Title/Summary/Keyword: classification modeling

Search Result 599, Processing Time 0.027 seconds

The Model of Appraisal Method on Authentic Records (전자기록의 진본 평가 시스템 모형 연구)

  • Kim, Ik-Han
    • The Korean Journal of Archival Studies
    • /
    • no.14
    • /
    • pp.91-117
    • /
    • 2006
  • Electronic Records need to be appraised the authenticity as well as the value itself. There has been various kinds of discussion about how records to be appraised the value of themselves, but there's little argument about how electronic records to be appraised the authenticity of themselves. Therefore this article is modeling some specific authenticity appraisal methods and showing each stages those methods should or may be applied. At the Ingest stage, integrity verification right after records creation in the organization which produced the records, quality and integrity verification about the transferred in the organization which received the records and integrity check between SIP and AIP in the organization which received and preserved the records are essential. At the Preservation stage, integrity check between same AIPs stored in different medium separately and validation of records where or not damaged and recovery damaged records are needed. At the various Processing stages, suitability evaluation after changing the record's management control meta data and changing the record's classification, integrity check after records migration and periodical validation and integrity verification about DIPs are required. For those activities, the appraisal methods including integrity verification, content consistency check, suitability evaluation about record's meta data, feasibility check of unauthorized update and physical status validation should be applied to the electronic records management process.

Systematic Literature Review for HRD in Korea Franchise Business (국내 프랜차이즈 사업에서의 인적자원개발에 관한 체계적 문헌 고찰)

  • KIM, Eunsung;LEE, Sang-Seub
    • The Korean Journal of Franchise Management
    • /
    • v.10 no.2
    • /
    • pp.33-47
    • /
    • 2019
  • Purpose - The purpose of this study is to classify and analyze existing studies from various angles through systematic literature review of how human resources development has been researched in the domestic franchise business. These studies are intended to suggest the direction in which human resource development research should be conducted in the future in the franchise business. Research design, data, and methodology - This study is based on systematic literature review methodology. It has gone through the process of subject language setting, literature search routing, search term selection, literature selection, literature classification and literature analysis. The systematic literature review identified 59 peer-reviewed dissertations and scientific journal publications on the subject of HRD in Korea franchise business. Result - This study analyzed by research methods, research industries, research population and dependent variable using the systematic review process. The literature studied in the 2000s mainly led to research on education and training of franchise employees in beauty franchise business. In the literature studied since 2010, human resources development was mainly studied in the supervisor in the restaurant franchise business, and in the study of competence rather than education and training. According to the research methods, statistical methods were mostly relatively simple, such as t-test or one-way distribution analysis until the 2000s, and after 2010, in-depth and structural studies using multiple return analysis, structural method analysis, path analysis, multi-dimensional scale analysis, AHP, etc were conducted. When classified by study dependant, early research until the 2000s focused on the study of education and training, which is an independent variable, on the satisfaction of education programs, job satisfaction, and immersion. On the other hand, studies conducted since 2010 have produced more complex results using various medium variants, and those related to management performance and relationship performance have been mainly studied, rather than the satisfaction of the education itself. Conclusions - While the domestic franchise business is expanding in terms of quantity, such as the number of franchises and franchises, the development in terms of quality for the joint growth of franchises and franchisees is still lacking. In order for the franchisee to continue to grow with each other, the franchisee must identify and develop their current performance or expected capabilities through capacity modeling at various targets and levels.

Evaluation of Priorities for Greening of Vacant Houses using Connectivity Modeling (연결성 모델링을 활용한 빈집 녹지화 우선순위 평가)

  • Lee, Hyun-Jung;Kim, Whee-Moon;Kim, Kyeong-Tae;Shin, Ji-Young;Park, Chang-Sug;Park, Hyun-Joo;Song, Won-Kyong
    • Journal of the Korean Society of Environmental Restoration Technology
    • /
    • v.25 no.1
    • /
    • pp.25-38
    • /
    • 2022
  • Urban problems are constantly occurring around the world due to rapid industrialization and population decline. In particular, as the number of vacant houses is gradually increasing as the population decreases, it is necessary to prepare countermeasures. A plan to utilize vacant houses has emerged to restore the natural environment of the urban ecosystem where forest destruction, damage to habitats of wild animals and plants, and disconnection have occurred due to large-scale development. Through connectivity analysis, it is possible to understand the overall ecosystem flow based on the movement of species and predict the effect when vacant houses are converted into green spaces. Therefore, this study analyzed the green area network to confirm the possibility of greening of vacant houses neglected in Jeonju based on circuit theory. Using Circuitscape and Least-cost path, we tried to identify the connectivity of green areas and propose an ecological axis based on the analysis. In order to apply the resistance values required for analysis based on previous studies, the 2020 subdivision land cover data were integrated into the major classification evaluation items. When the eight forests in the target site were analyzed as the standard, the overall connectivity and connectivity between forests in the area were high, so it is judged that the existing green areas can perform various functions, such as species movement and provision of habitats. Based on the results of the connectivity analysis, the importance of vacant houses was calculated and the top 20 vacant houses were identified, and it was confirmed that the higher the ranking, the more positive the degree of landscape connectivity was when converted to green areas. In addition, it was confirmed that the results of analyzing the least-cost path based on the resistance values such as connectivity analysis and the existing conceptual map showed some differences when comparing the ecological axes in the form. As a result of checking the vacant houses corresponding to the relevant axis based on the width standards of the main and sub-green areas, a total of 30 vacant houses were included in the 200m width and 6 vacant houses in the 80m width. It is judged that the conversion of vacant houses to green space can contribute to biodiversity conservation as well as connectivity between habitats of species as it is coupled with improved green space connectivity. In addition, it is expected to help solve the problem of vacant houses in the future by showing the possibility of using vacant houses.

Detection of Wildfire Smoke Plumes Using GEMS Images and Machine Learning (GEMS 영상과 기계학습을 이용한 산불 연기 탐지)

  • Jeong, Yemin;Kim, Seoyeon;Kim, Seung-Yeon;Yu, Jeong-Ah;Lee, Dong-Won;Lee, Yangwon
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.5_3
    • /
    • pp.967-977
    • /
    • 2022
  • The occurrence and intensity of wildfires are increasing with climate change. Emissions from forest fire smoke are recognized as one of the major causes affecting air quality and the greenhouse effect. The use of satellite product and machine learning is essential for detection of forest fire smoke. Until now, research on forest fire smoke detection has had difficulties due to difficulties in cloud identification and vague standards of boundaries. The purpose of this study is to detect forest fire smoke using Level 1 and Level 2 data of Geostationary Environment Monitoring Spectrometer (GEMS), a Korean environmental satellite sensor, and machine learning. In March 2022, the forest fire in Gangwon-do was selected as a case. Smoke pixel classification modeling was performed by producing wildfire smoke label images and inputting GEMS Level 1 and Level 2 data to the random forest model. In the trained model, the importance of input variables is Aerosol Optical Depth (AOD), 380 nm and 340 nm radiance difference, Ultra-Violet Aerosol Index (UVAI), Visible Aerosol Index (VisAI), Single Scattering Albedo (SSA), formaldehyde (HCHO), nitrogen dioxide (NO2), 380 nm radiance, and 340 nm radiance were shown in that order. In addition, in the estimation of the forest fire smoke probability (0 ≤ p ≤ 1) for 2,704 pixels, Mean Bias Error (MBE) is -0.002, Mean Absolute Error (MAE) is 0.026, Root Mean Square Error (RMSE) is 0.087, and Correlation Coefficient (CC) showed an accuracy of 0.981.

Movie Recommended System base on Analysis for the User Review utilizing Ontology Visualization (온톨로지 시각화를 활용한 사용자 리뷰 분석 기반 영화 추천 시스템)

  • Mun, Seong Min;Kim, Gi Nam;Choi, Gyeong cheol;Lee, Kyung Won
    • Design Convergence Study
    • /
    • v.15 no.2
    • /
    • pp.347-368
    • /
    • 2016
  • Recently, researches for the word of mouth(WOM) imply that consumers use WOM informations of products in their purchase process. This study suggests methods using opinion mining and visualization to understand consumers' opinion of each goods and each markets. For this study we conduct research that includes developing domain ontology based on reviews confined to "movie" category because people who want to have watching movie refer other's movie reviews recently, and it is analyzed by opinion mining and visualization. It has differences comparing other researches as conducting attribution classification of evaluation factors and comprising verbal dictionary about evaluation factors when we conduct ontology process for analyzing. We want to prove through the result if research method will be valid. Results derived from this study can be largely divided into three. First, This research explains methods of developing domain ontology using keyword extraction and topic modeling. Second, We visualize reviews of each movie to understand overall audiences' opinion about specific movies. Third, We find clusters that consist of products which evaluated similar assessments in accordance with the evaluation results for the product. Case study of this research largely shows three clusters containing 130 movies that are used according to audiences'opinion.

Robust Speech Recognition Algorithm of Voice Activated Powered Wheelchair for Severely Disabled Person (중증 장애우용 음성구동 휠체어를 위한 강인한 음성인식 알고리즘)

  • Suk, Soo-Young;Chung, Hyun-Yeol
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.6
    • /
    • pp.250-258
    • /
    • 2007
  • Current speech recognition technology s achieved high performance with the development of hardware devices, however it is insufficient for some applications where high reliability is required, such as voice control of powered wheelchairs for disabled persons. For the system which aims to operate powered wheelchairs safely by voice in real environment, we need to consider that non-voice commands such as user s coughing, breathing, and spark-like mechanical noise should be rejected and the wheelchair system need to recognize the speech commands affected by disability, which contains specific pronunciation speed and frequency. In this paper, we propose non-voice rejection method to perform voice/non-voice classification using both YIN based fundamental frequency(F0) extraction and reliability in preprocessing. We adopted a multi-template dictionary and acoustic modeling based speaker adaptation to cope with the pronunciation variation of inarticulately uttered speech. From the recognition tests conducted with the data collected in real environment, proposed YIN based fundamental extraction showed recall-precision rate of 95.1% better than that of 62% by cepstrum based method. Recognition test by a new system applied with multi-template dictionary and MAP adaptation also showed much higher accuracy of 99.5% than that of 78.6% by baseline system.

Automatic Quality Evaluation with Completeness and Succinctness for Text Summarization (완전성과 간결성을 고려한 텍스트 요약 품질의 자동 평가 기법)

  • Ko, Eunjung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.125-148
    • /
    • 2018
  • Recently, as the demand for big data analysis increases, cases of analyzing unstructured data and using the results are also increasing. Among the various types of unstructured data, text is used as a means of communicating information in almost all fields. In addition, many analysts are interested in the amount of data is very large and relatively easy to collect compared to other unstructured and structured data. Among the various text analysis applications, document classification which classifies documents into predetermined categories, topic modeling which extracts major topics from a large number of documents, sentimental analysis or opinion mining that identifies emotions or opinions contained in texts, and Text Summarization which summarize the main contents from one document or several documents have been actively studied. Especially, the text summarization technique is actively applied in the business through the news summary service, the privacy policy summary service, ect. In addition, much research has been done in academia in accordance with the extraction approach which provides the main elements of the document selectively and the abstraction approach which extracts the elements of the document and composes new sentences by combining them. However, the technique of evaluating the quality of automatically summarized documents has not made much progress compared to the technique of automatic text summarization. Most of existing studies dealing with the quality evaluation of summarization were carried out manual summarization of document, using them as reference documents, and measuring the similarity between the automatic summary and reference document. Specifically, automatic summarization is performed through various techniques from full text, and comparison with reference document, which is an ideal summary document, is performed for measuring the quality of automatic summarization. Reference documents are provided in two major ways, the most common way is manual summarization, in which a person creates an ideal summary by hand. Since this method requires human intervention in the process of preparing the summary, it takes a lot of time and cost to write the summary, and there is a limitation that the evaluation result may be different depending on the subject of the summarizer. Therefore, in order to overcome these limitations, attempts have been made to measure the quality of summary documents without human intervention. On the other hand, as a representative attempt to overcome these limitations, a method has been recently devised to reduce the size of the full text and to measure the similarity of the reduced full text and the automatic summary. In this method, the more frequent term in the full text appears in the summary, the better the quality of the summary. However, since summarization essentially means minimizing a lot of content while minimizing content omissions, it is unreasonable to say that a "good summary" based on only frequency always means a "good summary" in its essential meaning. In order to overcome the limitations of this previous study of summarization evaluation, this study proposes an automatic quality evaluation for text summarization method based on the essential meaning of summarization. Specifically, the concept of succinctness is defined as an element indicating how few duplicated contents among the sentences of the summary, and completeness is defined as an element that indicating how few of the contents are not included in the summary. In this paper, we propose a method for automatic quality evaluation of text summarization based on the concepts of succinctness and completeness. In order to evaluate the practical applicability of the proposed methodology, 29,671 sentences were extracted from TripAdvisor 's hotel reviews, summarized the reviews by each hotel and presented the results of the experiments conducted on evaluation of the quality of summaries in accordance to the proposed methodology. It also provides a way to integrate the completeness and succinctness in the trade-off relationship into the F-Score, and propose a method to perform the optimal summarization by changing the threshold of the sentence similarity.

Suggestion of Urban Regeneration Type Recommendation System Based on Local Characteristics Using Text Mining (텍스트 마이닝을 활용한 지역 특성 기반 도시재생 유형 추천 시스템 제안)

  • Kim, Ikjun;Lee, Junho;Kim, Hyomin;Kang, Juyoung
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.3
    • /
    • pp.149-169
    • /
    • 2020
  • "The Urban Renewal New Deal project", one of the government's major national projects, is about developing underdeveloped areas by investing 50 trillion won in 100 locations on the first year and 500 over the next four years. This project is drawing keen attention from the media and local governments. However, the project model which fails to reflect the original characteristics of the area as it divides project area into five categories: "Our Neighborhood Restoration, Housing Maintenance Support Type, General Neighborhood Type, Central Urban Type, and Economic Base Type," According to keywords for successful urban regeneration in Korea, "resident participation," "regional specialization," "ministerial cooperation" and "public-private cooperation", when local governments propose urban regeneration projects to the government, they can see that it is most important to accurately understand the characteristics of the city and push ahead with the projects in a way that suits the characteristics of the city with the help of local residents and private companies. In addition, considering the gentrification problem, which is one of the side effects of urban regeneration projects, it is important to select and implement urban regeneration types suitable for the characteristics of the area. In order to supplement the limitations of the 'Urban Regeneration New Deal Project' methodology, this study aims to propose a system that recommends urban regeneration types suitable for urban regeneration sites by utilizing various machine learning algorithms, referring to the urban regeneration types of the '2025 Seoul Metropolitan Government Urban Regeneration Strategy Plan' promoted based on regional characteristics. There are four types of urban regeneration in Seoul: "Low-use Low-Level Development, Abandonment, Deteriorated Housing, and Specialization of Historical and Cultural Resources" (Shon and Park, 2017). In order to identify regional characteristics, approximately 100,000 text data were collected for 22 regions where the project was carried out for a total of four types of urban regeneration. Using the collected data, we drew key keywords for each region according to the type of urban regeneration and conducted topic modeling to explore whether there were differences between types. As a result, it was confirmed that a number of topics related to real estate and economy appeared in old residential areas, and in the case of declining and underdeveloped areas, topics reflecting the characteristics of areas where industrial activities were active in the past appeared. In the case of the historical and cultural resource area, since it is an area that contains traces of the past, many keywords related to the government appeared. Therefore, it was possible to confirm political topics and cultural topics resulting from various events. Finally, in the case of low-use and under-developed areas, many topics on real estate and accessibility are emerging, so accessibility is good. It mainly had the characteristics of a region where development is planned or is likely to be developed. Furthermore, a model was implemented that proposes urban regeneration types tailored to regional characteristics for regions other than Seoul. Machine learning technology was used to implement the model, and training data and test data were randomly extracted at an 8:2 ratio and used. In order to compare the performance between various models, the input variables are set in two ways: Count Vector and TF-IDF Vector, and as Classifier, there are 5 types of SVM (Support Vector Machine), Decision Tree, Random Forest, Logistic Regression, and Gradient Boosting. By applying it, performance comparison for a total of 10 models was conducted. The model with the highest performance was the Gradient Boosting method using TF-IDF Vector input data, and the accuracy was 97%. Therefore, the recommendation system proposed in this study is expected to recommend urban regeneration types based on the regional characteristics of new business sites in the process of carrying out urban regeneration projects."

Spatial Distribution of Aging District in Taejeon Metropolitan City (대전광역시 노령화 지구의 공간적 분포 패턴)

  • Jeong, Hwan-Yeong;Ko, Sang-Im
    • Journal of the Korean association of regional geographers
    • /
    • v.6 no.2
    • /
    • pp.1-19
    • /
    • 2000
  • This study is to investigate and analyze regional patterns of aging in Taejeon Metropolitan city-the overpopulated area of Choong-Cheong Province-by cohort analysis method. According to the population structure transition caused by rapid social and economic changes, Korea has made a rapid progress in population aging since 1970. This trend is so rapid that we should prepare for and cope with aging society. It is not only slow to cope with it in our society, but also there are few studies on population aging of the geographical field in Korea. The data of this study are the reports of Population and Housing Censuses in 1975 and 1985 and General Population and Housing Censuses with 10% sample survey in 1995 taken by National Statistical Office. The research method is to sample as the aging district the area with high aged population rate where the populations over 60 reside among total population during the years of 1975, 1985, 1995 and to sample the special districts of decreasing population where the population decreases very much and the special districts of increasing population in which the population increases greatly, presuming that the reason why aged population rate increases is that non-elderly population high in mobility moves out. It is then verified and ascertained whether it is true or not with cohort analysis method by age. Finally regional patterns in the city are found through the classification and modeling by type based on the aging district, the special districts of decreasing population, and the special districts of increasing population. The characteristics of the regional patterns show that there is social population transition and that non-elderly population moves out. The aging district with the high aged population rate is divided into high-level keeping-up type, relative falling type below the average of Taejeon city in aging progress, and relative rising type above the average of the city. This district can be found at both the central area of the city and the suburbs because Taejeon city has the characteristic of over-bounded city. But it cannot be found at the new built-up area with the in-migration of large population. The special districts of decreasing population where the population continues to decrease can be said to be the population doughnuts found at the CBD and its neighboring inner area. On the other hand, the special districts of increasing population where the population continues to increase are located at the new built-up area of the northern part in Taejeon city. The special districts of decreasing population are overlapping with the aging district and higher in aged population rate by the out-migration of non-elderly population. The special districts of increasing population are not overlapping with the aging district and lower in aged population rate by the in-migration of non-elderly population. To clarify the distribution map of the aging district, the special districts of decreasing and increasing population and the aging district are divided into four groups such as the special districts of decreasing population group-the same one as the aging district, the special districts of decreasing population group, the special districts of increasing population group, and the other district. With the cohort analysis method by age used to investigate the definite increase and decrease of aging population through population transition of each group, it is found that the progress of population aging is closely related to the social population fluctuation, especially that aged population rate is higher with the out-migration of non-elderly population. This is to explain each model of CBD, inner area, and the suburbs after modeling the aging district, the special districts of decreasing population, and the special districts of increasing population in Taejeon city. On the assumption that the city area is a concentric circle, it is possible to divide it into three areas such as CBD(A), the inner area(B), and the suburbs(C). The special districts of increasing and decreasing population in the city are divided into three districts-the special districts of decreasing population(a), the special districts of increasing population(b), and the others(c). The aging district of this city is divided into the aging district($\alpha$) and the others($\beta$). And then modeling these districts, it is probable to find regional patterns in the city. $Aa{\alpha}$ and $Ac{\beta}$ patterns are found in the CBD, in which $Aa{\alpha}$ is the special district of decreasing population and is higher in aged population rate because of aged population low in mobility staying behind and out-migration of non-elderly population. $Ba{\alpha}$, $Ba{\beta}$, $Bb{\beta}$, and $Bc{\beta}$ patterns are found in the inner area, in which neighboring area $Ba{\alpha}$ pattern is located. $Bb{\beta}$ pattern is located at the new developing area of newly built apartment complex. $Cb{\beta}$, $Cc{\alpha}$, and $Cc{\beta}$ patterns are found in the suburbs, among which $Cc{\alpha}$ pattern is highest in population aging. It is likely that the $Cc{\beta}$ under housing land readjustment on a large scale will be the $Cb{\beta}$ pattern. As analyzed above, marriage and out-migration of new family, non-elderly population, with house purchase are main factors in accelerating population aging in the central area of the city. Population aging is responsible for the great increase of aged population with longer life expectancy by the low death rate, the out-migration of non-elderly population, and the age group of new aged population in the suburbs. It is necessary to investigate and analyze the regional patterns of population aging at the time when population problems caused by aging as well as longer life expectancy are now on the increase. I hope that this will help the future study on population aging of the geographical field in Korea. As in the future population aging will be a major problem in our society, local autonomy should make a plan for the problem to the extent that population aging progresses by regional groups and inevitably prepare for it.

  • PDF

DISEASE DIAGNOSED AND DESCRIBED BY NIRS

  • Tsenkova, Roumiana N.
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1031-1031
    • /
    • 2001
  • The mammary gland is made up of remarkably sensitive tissue, which has the capability of producing a large volume of secretion, milk, under normal or healthy conditions. When bacteria enter the gland and establish an infection (mastitis), inflammation is initiated accompanied by an influx of white cells from the blood stream, by altered secretory function, and changes in the volume and composition of secretion. Cell numbers in milk are closely associated with inflammation and udder health. These somatic cell counts (SCC) are accepted as the international standard measurement of milk quality in dairy and for mastitis diagnosis. NIR Spectra of unhomogenized composite milk samples from 14 cows (healthy and mastitic), 7days after parturition and during the next 30 days of lactation were measured. Different multivariate analysis techniques were used to diagnose the disease at very early stage and determine how the spectral properties of milk vary with its composition and animal health. PLS model for prediction of somatic cell count (SCC) based on NIR milk spectra was made. The best accuracy of determination for the 1100-2500nm range was found using smoothed absorbance data and 10 PLS factors. The standard error of prediction for independent validation set of samples was 0.382, correlation coefficient 0.854 and the variation coefficient 7.63%. It has been found that SCC determination by NIR milk spectra was indirect and based on the related changes in milk composition. From the spectral changes, we learned that when mastitis occurred, the most significant factors that simultaneously influenced milk spectra were alteration of milk proteins and changes in ionic concentration of milk. It was consistent with the results we obtained further when applied 2DCOS. Two-dimensional correlation analysis of NIR milk spectra was done to assess the changes in milk composition, which occur when somatic cell count (SCC) levels vary. The synchronous correlation map revealed that when SCC increases, protein levels increase while water and lactose levels decrease. Results from the analysis of the asynchronous plot indicated that changes in water and fat absorptions occur before other milk components. In addition, the technique was used to assess the changes in milk during a period when SCC levels do not vary appreciably. Results indicated that milk components are in equilibrium and no appreciable change in a given component was seen with respect to another. This was found in both healthy and mastitic animals. However, milk components were found to vary with SCC content regardless of the range considered. This important finding demonstrates that 2-D correlation analysis may be used to track even subtle changes in milk composition in individual cows. To find out the right threshold for SCC when used for mastitis diagnosis at cow level, classification of milk samples was performed using soft independent modeling of class analogy (SIMCA) and different spectral data pretreatment. Two levels of SCC - 200 000 cells/$m\ell$ and 300 000 cells/$m\ell$, respectively, were set up and compared as thresholds to discriminate between healthy and mastitic cows. The best detection accuracy was found with 200 000 cells/$m\ell$ as threshold for mastitis and smoothed absorbance data: - 98% of the milk samples in the calibration set and 87% of the samples in the independent test set were correctly classified. When the spectral information was studied it was found that the successful mastitis diagnosis was based on reviling the spectral changes related to the corresponding changes in milk composition. NIRS combined with different ways of spectral data ruining can provide faster and nondestructive alternative to current methods for mastitis diagnosis and a new inside into disease understanding at molecular level.

  • PDF