• Title/Summary/Keyword: Search Tree

Search Result 636, Processing Time 0.026 seconds

A Study on Updated Object Detection and Extraction of Underground Information (지하정보 변화객체 탐지 및 추출 연구)

  • Kim, Kwangsoo;Lee, Heyung-Sub;Kim, Juwan
    • Journal of Software Assessment and Valuation
    • /
    • v.16 no.2
    • /
    • pp.99-107
    • /
    • 2020
  • An underground integrated map is being built for underground safety management and is being updated periodically. The map update proceeds with the procedure of deleting all previously stored objects and saving newly entered objects. However, even unchanged objects are repeatedly stored, deleted, and stored. That causes the delay of the update time. In this study, in order to shorten the update time of the integrated map, an updated object and an unupdated object are separated, and only updated objects are reflected in the underground integrated map, and a system implementing this technology is described. For the updated object, an object comparison method using the center point of the object is used, and a quad tree is used to improve the search speed. The types of updated objects are classified into addition and deletion using the shape of the object, and change using its attributes. The proposed system consists of update object detection, extraction, conversion, storage, and history management modules. This system has the advantage of being able to update the integrated map about four times faster than the existing method based on the data used in the experiment, and has the advantage that it can be applied to both ground and underground facilities.

The Detection of Online Manipulated Reviews Using Machine Learning and GPT-3 (기계학습과 GPT3를 시용한 조작된 리뷰의 탐지)

  • Chernyaeva, Olga;Hong, Taeho
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.4
    • /
    • pp.347-364
    • /
    • 2022
  • Fraudulent companies or sellers strategically manipulate reviews to influence customers' purchase decisions; therefore, the reliability of reviews has become crucial for customer decision-making. Since customers increasingly rely on online reviews to search for more detailed information about products or services before purchasing, many researchers focus on detecting manipulated reviews. However, the main problem in detecting manipulated reviews is the difficulties with obtaining data with manipulated reviews to utilize machine learning techniques with sufficient data. Also, the number of manipulated reviews is insufficient compared with the number of non-manipulated reviews, so the class imbalance problem occurs. The class with fewer examples is under-represented and can hamper a model's accuracy, so machine learning methods suffer from the class imbalance problem and solving the class imbalance problem is important to build an accurate model for detecting manipulated reviews. Thus, we propose an OpenAI-based reviews generation model to solve the manipulated reviews imbalance problem, thereby enhancing the accuracy of manipulated reviews detection. In this research, we applied the novel autoregressive language model - GPT-3 to generate reviews based on manipulated reviews. Moreover, we found that applying GPT-3 model for oversampling manipulated reviews can recover a satisfactory portion of performance losses and shows better performance in classification (logit, decision tree, neural networks) than traditional oversampling models such as random oversampling and SMOTE.

Jeong Yak-Yong's Zhong-yong: The Habit of Moral Behavior Through Grasp (정약용의 중용: 장악을 통한 도덕적 행위의 습관화)

  • Gao, Ming-Wen;Mo, A-Yeong
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.8 no.8
    • /
    • pp.793-803
    • /
    • 2018
  • Since Confucius presentied 'zhong-yong' and Zi Si wrote Zhong-Yong (The Doctring of the Mean), specially since Zhu Xi edited Zhong-Yong as one of Si-Shu (The Four Books) and interpreted it, zhong-yong was not only recognized as the extreme of morality but also as a significant category of Confucianism. The purpose of this paper is to clarify how Jeong Yak-Yong criticized Zhu Xi's interpretation of zhong-yong, and furthermore, to search how Jung Yak-yong explain the zhong-yong by tree concepts of 'grasp', 'moral behavior', and 'habit'. Zhu Xi interpreted zong as a measured absolute middle of two-side and a non-launched original nature. Interpreted yong as a common truth. Therefore he interpreted zhong-yong as a common truth of neither excessive nor enough. Accordingly, Zhu Xi's zhong-yong can be understood as absolute zhong-yong without human's reflection and moral behavior. But Jeong Yak-Yong interpreted zhong as the state of a man's very hard-concentrate and grasp of situation, and interpreted yong as the state of a man's very hard-effort moral behavior and it's habit. Therefore he explained zhong-yong as a habit of moral behavior through grasp.

A Study on the Prediction of Uniaxial Compressive Strength Classification Using Slurry TBM Data and Random Forest (이수식 TBM 데이터와 랜덤포레스트를 이용한 일축압축강도 분류 예측에 관한 연구)

  • Tae-Ho Kang;Soon-Wook Choi;Chulho Lee;Soo-Ho Chang
    • Tunnel and Underground Space
    • /
    • v.33 no.6
    • /
    • pp.547-560
    • /
    • 2023
  • Recently, research on predicting ground classification using machine learning techniques, TBM excavation data, and ground data is increasing. In this study, a multi-classification prediction study for uniaxial compressive strength (UCS) was conducted by applying random forest model based on a decision tree among machine learning techniques widely used in various fields to machine data and ground data acquired at three slurry shield TBM sites. For the classification prediction, the training and test data were divided into 7:3, and a grid search including 5-fold cross-validation was used to select the optimal parameter. As a result of classification learning for UCS using a random forest, the accuracy of the multi-classification prediction model was found to be high at both 0.983 and 0.982 in the training set and the test set, respectively. However, due to the imbalance in data distribution between classes, the recall was evaluated low in class 4. It is judged that additional research is needed to increase the amount of measured data of UCS acquired in various sites.

Identification of DNA Methylation Markers for NSCLC Using Hpall-Mspl Methylation Microarray (Hpall-Mspl Methylation Microarray를 이용한 비소세포폐암의 DNA Methylation Marker 발굴)

  • Kwon, Mi Hye;Lee, Go Eun;Kwon, Sun Jung;Choi, Eugene;Na, Moon Jun;Cho, Hyun Min;Kim, Young Jin;Sul, Hye Jung;Cho, Young Jun;Son, Ji Woong
    • Tuberculosis and Respiratory Diseases
    • /
    • v.65 no.6
    • /
    • pp.495-503
    • /
    • 2008
  • Background: Epigenetic alterations in certain genes are now known as at least important as genetic mutation in pathogenesis of cancer. Especially abnormal hypermethylation in or near promoter region of tumor suppressor genes (TSGs) are known to result in gene silencing and loss of gene function eventually. The authors tried to search for new lung cancer-specific TSGs which have CpG islands and HpaII sites, and are thought to be involved in carcinogenesis by epigenetic mechanism. Methods: Tumor tissue and corresponding adjacent normal tissue were obtained from 10 patients who diagnosed with non small cell lung cancer (NSCLC) and underwent surgery in Konyang university hospital in 2005. Methylation profiles of promoter region of 21 genes in tumor tissue & non-tumor tissue were examined with HpaII-MspI methylation microarray (Methyl-Scan DNA chip$^{(R)}$, Genomic tree, Inc, South Korea). The rates of hypermethylation were compared in tumor and non-tumor group, and as a normal control, we obtained lung tissue from two young patients with pneumothorax during bullectomies, methylation profiles were examined in the same way. Results: Among the 21 genes, 10 genes were commonly methylated in tumor, non-tumor, and control group. The 6 genes of APC, AR, RAR-b, HTR1B, EPHA3, and CFTR, among the rest of 11 genes were not methylated in control, and more frequently hypermethylated in tumor tissue than non-tumor tissue. Conclusion: In the present study, HTR1B, EPHA3, and CFTR are suggested as possible novel TSGs of NSCLC by epigenetic mechanism.

A Study on the Aspects and Characteristics of the Vegetation Maintenance Project at the Historic Site of Angkor, Cambodia -with the Focus on Preah Khan, Banteay Srei, and Ta Prohm Temples- (캄보디아 앙코르 유적에서 식생정비 사업의 양상과 특징에 관한 고찰 - 프레아 칸 사원·반테이 스레이 사원·타 프롬 사원을 중심으로 -)

  • Lee, Jae-Yong;Kim, Young-Mo
    • Korean Journal of Heritage: History & Science
    • /
    • v.51 no.1
    • /
    • pp.32-47
    • /
    • 2018
  • The purpose of this study is to examine the vegetation maintenance project that was conducted as a part of the Official Development Assistance (ODA) project for the historic site of Angkor, to analyze the aspects and characteristics of the project, and to derive implications for the establishment of future policies and strategies. First, the key words used in the vegetation maintenance project at the historic site of Angkor do not only refer to the concept of plants (and more specifically to 'trees') but also to the concept of heritage. In other words, the concept of heritage is not limited to architectural structures but is also intended to mean the vegetation and surroundings that form the historic site. Second, the expansion of the value of vegetation has contributed to the establishment of the basic principles of conservation based on the 'coexistence' between architectural structures and vegetation; here, vegetation has come to be recognized as an 'essential' element in the conservation of historic sites. Third, the range of vegetation maintenance has expanded from each tree to the surroundings of the temples, and vegetation maintenance came to adopt 'integrative' and 'active' directions to improve not only the growth environment of the vegetation but also the viewing environment experienced by visitors. This change means that it is necessary for the historic site maintenance project to comprehensively deal with the temples and their surrounding areas. Fourth, for the effective performance of the ODA project, the role of the International Coordinating Committee for the Safeguarding and Development of the Historic Site of Angkor (ICC-Angkor), under the influence of UNESCO, was expanded from an examination of the problems with the existing projects to a search for solutions to technical consultation and supervision. This implies that, in order to perform the ODA project in a way that is appropriate to the local conditions, it is important to reach gradual and phased agreements with ICC-Angkor.

Clickstream Big Data Mining for Demographics based Digital Marketing (인구통계특성 기반 디지털 마케팅을 위한 클릭스트림 빅데이터 마이닝)

  • Park, Jiae;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.143-163
    • /
    • 2016
  • The demographics of Internet users are the most basic and important sources for target marketing or personalized advertisements on the digital marketing channels which include email, mobile, and social media. However, it gradually has become difficult to collect the demographics of Internet users because their activities are anonymous in many cases. Although the marketing department is able to get the demographics using online or offline surveys, these approaches are very expensive, long processes, and likely to include false statements. Clickstream data is the recording an Internet user leaves behind while visiting websites. As the user clicks anywhere in the webpage, the activity is logged in semi-structured website log files. Such data allows us to see what pages users visited, how long they stayed there, how often they visited, when they usually visited, which site they prefer, what keywords they used to find the site, whether they purchased any, and so forth. For such a reason, some researchers tried to guess the demographics of Internet users by using their clickstream data. They derived various independent variables likely to be correlated to the demographics. The variables include search keyword, frequency and intensity for time, day and month, variety of websites visited, text information for web pages visited, etc. The demographic attributes to predict are also diverse according to the paper, and cover gender, age, job, location, income, education, marital status, presence of children. A variety of data mining methods, such as LSA, SVM, decision tree, neural network, logistic regression, and k-nearest neighbors, were used for prediction model building. However, this research has not yet identified which data mining method is appropriate to predict each demographic variable. Moreover, it is required to review independent variables studied so far and combine them as needed, and evaluate them for building the best prediction model. The objective of this study is to choose clickstream attributes mostly likely to be correlated to the demographics from the results of previous research, and then to identify which data mining method is fitting to predict each demographic attribute. Among the demographic attributes, this paper focus on predicting gender, age, marital status, residence, and job. And from the results of previous research, 64 clickstream attributes are applied to predict the demographic attributes. The overall process of predictive model building is compose of 4 steps. In the first step, we create user profiles which include 64 clickstream attributes and 5 demographic attributes. The second step performs the dimension reduction of clickstream variables to solve the curse of dimensionality and overfitting problem. We utilize three approaches which are based on decision tree, PCA, and cluster analysis. We build alternative predictive models for each demographic variable in the third step. SVM, neural network, and logistic regression are used for modeling. The last step evaluates the alternative models in view of model accuracy and selects the best model. For the experiments, we used clickstream data which represents 5 demographics and 16,962,705 online activities for 5,000 Internet users. IBM SPSS Modeler 17.0 was used for our prediction process, and the 5-fold cross validation was conducted to enhance the reliability of our experiments. As the experimental results, we can verify that there are a specific data mining method well-suited for each demographic variable. For example, age prediction is best performed when using the decision tree based dimension reduction and neural network whereas the prediction of gender and marital status is the most accurate by applying SVM without dimension reduction. We conclude that the online behaviors of the Internet users, captured from the clickstream data analysis, could be well used to predict their demographics, thereby being utilized to the digital marketing.

The Search for Study on the Construction Process and Changes in the Landscape Plants of the Pasanseodang ('파산서당'의 영건과정과 조경식물 변화상 탐색)

  • Joo, Been;Choi, Hayoung;Shin, Sangsup
    • Korean Journal of Heritage: History & Science
    • /
    • v.51 no.1
    • /
    • pp.48-65
    • /
    • 2018
  • The authors of this paper aim to make a record of the construction process, its symbolic meaning, and the changes in the status of the landscape plants at the Pasanseodang according to the Report on the Pasanseodang written by Park Gyu-hyun in 1874. First, the construction of Samgahun Pavilion, which is located in Myo-ri, Habin-myun, Dalsung-gun, Daegu, took about 90 years and spanned the lifetimes of Park Sungsoo, an 11th-generation descendant of Park Paengnyun (1417~1456) through to Park Kyuhyun, a 14th-generation descendant. It was called the shape of dragon, with its head facing the tail (回龍顧尾形), in feng shui. Second, the village of Pahwoe was founded in 1769, the 45th year of the reign of King Yeongjo, by Park Sungsoo for the purpose of socializing with his friends at his thatched home, and was named after his own courtesy name (Samgahun). Park Kwangseok, the second son of Park Sungsoo, built the sarangchae in 1826 and the anchae in 1869 after his marriage (in 1783). Then, Park Kyuhyun, the grandson of Park Kwangseok, built the pond and planted it with lotus flowers, and built the Hayeopjeong in 1874. The Pasanseodang, as the precursor of the Hayeopjeong, may be related with the name of the hillside region behind Samgahun. Third, a quadrangular-shaped pond with a length of 21m and a width of 15m was also built and planted with lotus flowers. In the center of the pond is a small round island that reflects the world view of the Chosun dynasty, i.e. that the sky is round and the landmass is quadrangular. Meanwhile, the name of the Hayeopjeon reflects the value system of aristocrats who lived a life of leisure and artistic indulgence. They called the eastern room "Yeeyeonhun" (怡燕軒) and the western room "Mongyangjae" (蒙養齋), names which embody their wishes for a good life as a member of the nobility and a bright future for one's descendants. Fourth, in Confucian terms, the authors infer the points of view reflected in the kinds of trees that were planted according to Confucian norms (pine tree, lotus, bamboo), the living philosophy of sustainability (willow), the ideology of seclusion and the search for peace of mind (bamboo), and relief efforts for the poor and a life of practicality (chestnut, oak, wild walnut, lacquer). The authors assert that this way of planting trees was a highly effective design feature of landscape architecture that drew on the locational and symbolic significance of the Seodang. Fifth, the majority of the trees that were initially planted withered and were replaced with different species, except for the locust and lotus, at this point. Nevertheless, a review of the process of construction, symbolic meaning, and original architectural landscape of the Samgahun is of value in demonstrating the extended symbolic meaning of their descendants in terms of the practical loss of the function of the Seodang, the values of Feng Sui (red in the east, white in the west, based on the principles of Feng Sui), the function of repelling evils spirits (kalopanax, trifoliate orange), aesthetic and practical values (sweetbrier, apricot, pear, peach, and oriental oak trees), and the prosperity of the family and the timeless value of honest poverty (silk, crape myrtle, and yew trees).

Steel Plate Faults Diagnosis with S-MTS (S-MTS를 이용한 강판의 표면 결함 진단)

  • Kim, Joon-Young;Cha, Jae-Min;Shin, Junguk;Yeom, Choongsub
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.1
    • /
    • pp.47-67
    • /
    • 2017
  • Steel plate faults is one of important factors to affect the quality and price of the steel plates. So far many steelmakers generally have used visual inspection method that could be based on an inspector's intuition or experience. Specifically, the inspector checks the steel plate faults by looking the surface of the steel plates. However, the accuracy of this method is critically low that it can cause errors above 30% in judgment. Therefore, accurate steel plate faults diagnosis system has been continuously required in the industry. In order to meet the needs, this study proposed a new steel plate faults diagnosis system using Simultaneous MTS (S-MTS), which is an advanced Mahalanobis Taguchi System (MTS) algorithm, to classify various surface defects of the steel plates. MTS has generally been used to solve binary classification problems in various fields, but MTS was not used for multiclass classification due to its low accuracy. The reason is that only one mahalanobis space is established in the MTS. In contrast, S-MTS is suitable for multi-class classification. That is, S-MTS establishes individual mahalanobis space for each class. 'Simultaneous' implies comparing mahalanobis distances at the same time. The proposed steel plate faults diagnosis system was developed in four main stages. In the first stage, after various reference groups and related variables are defined, data of the steel plate faults is collected and used to establish the individual mahalanobis space per the reference groups and construct the full measurement scale. In the second stage, the mahalanobis distances of test groups is calculated based on the established mahalanobis spaces of the reference groups. Then, appropriateness of the spaces is verified by examining the separability of the mahalanobis diatances. In the third stage, orthogonal arrays and Signal-to-Noise (SN) ratio of dynamic type are applied for variable optimization. Also, Overall SN ratio gain is derived from the SN ratio and SN ratio gain. If the derived overall SN ratio gain is negative, it means that the variable should be removed. However, the variable with the positive gain may be considered as worth keeping. Finally, in the fourth stage, the measurement scale that is composed of selected useful variables is reconstructed. Next, an experimental test should be implemented to verify the ability of multi-class classification and thus the accuracy of the classification is acquired. If the accuracy is acceptable, this diagnosis system can be used for future applications. Also, this study compared the accuracy of the proposed steel plate faults diagnosis system with that of other popular classification algorithms including Decision Tree, Multi Perception Neural Network (MLPNN), Logistic Regression (LR), Support Vector Machine (SVM), Tree Bagger Random Forest, Grid Search (GS), Genetic Algorithm (GA) and Particle Swarm Optimization (PSO). The steel plates faults dataset used in the study is taken from the University of California at Irvine (UCI) machine learning repository. As a result, the proposed steel plate faults diagnosis system based on S-MTS shows 90.79% of classification accuracy. The accuracy of the proposed diagnosis system is 6-27% higher than MLPNN, LR, GS, GA and PSO. Based on the fact that the accuracy of commercial systems is only about 75-80%, it means that the proposed system has enough classification performance to be applied in the industry. In addition, the proposed system can reduce the number of measurement sensors that are installed in the fields because of variable optimization process. These results show that the proposed system not only can have a good ability on the steel plate faults diagnosis but also reduce operation and maintenance cost. For our future work, it will be applied in the fields to validate actual effectiveness of the proposed system and plan to improve the accuracy based on the results.

A Literature Review and Classification of Recommender Systems on Academic Journals (추천시스템관련 학술논문 분석 및 분류)

  • Park, Deuk-Hee;Kim, Hyea-Kyeong;Choi, Il-Young;Kim, Jae-Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.1
    • /
    • pp.139-152
    • /
    • 2011
  • Recommender systems have become an important research field since the emergence of the first paper on collaborative filtering in the mid-1990s. In general, recommender systems are defined as the supporting systems which help users to find information, products, or services (such as books, movies, music, digital products, web sites, and TV programs) by aggregating and analyzing suggestions from other users, which mean reviews from various authorities, and user attributes. However, as academic researches on recommender systems have increased significantly over the last ten years, more researches are required to be applicable in the real world situation. Because research field on recommender systems is still wide and less mature than other research fields. Accordingly, the existing articles on recommender systems need to be reviewed toward the next generation of recommender systems. However, it would be not easy to confine the recommender system researches to specific disciplines, considering the nature of the recommender system researches. So, we reviewed all articles on recommender systems from 37 journals which were published from 2001 to 2010. The 37 journals are selected from top 125 journals of the MIS Journal Rankings. Also, the literature search was based on the descriptors "Recommender system", "Recommendation system", "Personalization system", "Collaborative filtering" and "Contents filtering". The full text of each article was reviewed to eliminate the article that was not actually related to recommender systems. Many of articles were excluded because the articles such as Conference papers, master's and doctoral dissertations, textbook, unpublished working papers, non-English publication papers and news were unfit for our research. We classified articles by year of publication, journals, recommendation fields, and data mining techniques. The recommendation fields and data mining techniques of 187 articles are reviewed and classified into eight recommendation fields (book, document, image, movie, music, shopping, TV program, and others) and eight data mining techniques (association rule, clustering, decision tree, k-nearest neighbor, link analysis, neural network, regression, and other heuristic methods). The results represented in this paper have several significant implications. First, based on previous publication rates, the interest in the recommender system related research will grow significantly in the future. Second, 49 articles are related to movie recommendation whereas image and TV program recommendation are identified in only 6 articles. This result has been caused by the easy use of MovieLens data set. So, it is necessary to prepare data set of other fields. Third, recently social network analysis has been used in the various applications. However studies on recommender systems using social network analysis are deficient. Henceforth, we expect that new recommendation approaches using social network analysis will be developed in the recommender systems. So, it will be an interesting and further research area to evaluate the recommendation system researches using social method analysis. This result provides trend of recommender system researches by examining the published literature, and provides practitioners and researchers with insight and future direction on recommender systems. We hope that this research helps anyone who is interested in recommender systems research to gain insight for future research.