• Title/Summary/Keyword: Build Time

Search Result 1,850, Processing Time 0.029 seconds

Conditional Generative Adversarial Network based Collaborative Filtering Recommendation System (Conditional Generative Adversarial Network(CGAN) 기반 협업 필터링 추천 시스템)

  • Kang, Soyi;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.157-173
    • /
    • 2021
  • With the development of information technology, the amount of available information increases daily. However, having access to so much information makes it difficult for users to easily find the information they seek. Users want a visualized system that reduces information retrieval and learning time, saving them from personally reading and judging all available information. As a result, recommendation systems are an increasingly important technologies that are essential to the business. Collaborative filtering is used in various fields with excellent performance because recommendations are made based on similar user interests and preferences. However, limitations do exist. Sparsity occurs when user-item preference information is insufficient, and is the main limitation of collaborative filtering. The evaluation value of the user item matrix may be distorted by the data depending on the popularity of the product, or there may be new users who have not yet evaluated the value. The lack of historical data to identify consumer preferences is referred to as data sparsity, and various methods have been studied to address these problems. However, most attempts to solve the sparsity problem are not optimal because they can only be applied when additional data such as users' personal information, social networks, or characteristics of items are included. Another problem is that real-world score data are mostly biased to high scores, resulting in severe imbalances. One cause of this imbalance distribution is the purchasing bias, in which only users with high product ratings purchase products, so those with low ratings are less likely to purchase products and thus do not leave negative product reviews. Due to these characteristics, unlike most users' actual preferences, reviews by users who purchase products are more likely to be positive. Therefore, the actual rating data is over-learned in many classes with high incidence due to its biased characteristics, distorting the market. Applying collaborative filtering to these imbalanced data leads to poor recommendation performance due to excessive learning of biased classes. Traditional oversampling techniques to address this problem are likely to cause overfitting because they repeat the same data, which acts as noise in learning, reducing recommendation performance. In addition, pre-processing methods for most existing data imbalance problems are designed and used for binary classes. Binary class imbalance techniques are difficult to apply to multi-class problems because they cannot model multi-class problems, such as objects at cross-class boundaries or objects overlapping multiple classes. To solve this problem, research has been conducted to convert and apply multi-class problems to binary class problems. However, simplification of multi-class problems can cause potential classification errors when combined with the results of classifiers learned from other sub-problems, resulting in loss of important information about relationships beyond the selected items. Therefore, it is necessary to develop more effective methods to address multi-class imbalance problems. We propose a collaborative filtering model using CGAN to generate realistic virtual data to populate the empty user-item matrix. Conditional vector y identify distributions for minority classes and generate data reflecting their characteristics. Collaborative filtering then maximizes the performance of the recommendation system via hyperparameter tuning. This process should improve the accuracy of the model by addressing the sparsity problem of collaborative filtering implementations while mitigating data imbalances arising from real data. Our model has superior recommendation performance over existing oversampling techniques and existing real-world data with data sparsity. SMOTE, Borderline SMOTE, SVM-SMOTE, ADASYN, and GAN were used as comparative models and we demonstrate the highest prediction accuracy on the RMSE and MAE evaluation scales. Through this study, oversampling based on deep learning will be able to further refine the performance of recommendation systems using actual data and be used to build business recommendation systems.

Military science's understanding on Daodejing of Wangzhen (왕진의 『도덕경』에 대한 병학적 이해)

  • Kim, Tae-yong
    • The Journal of Korean Philosophical History
    • /
    • no.23
    • /
    • pp.295-316
    • /
    • 2008
  • This is to study the military philosophy of Wangzhen's Commentary on Daodejing written by a famous military officer in the end of Tang Dynasty, Wangzhen. Historically, many scholars consider Laozi's Daodejing as a book on military science. Wangzhen, however, is the only person to interpret Daodejing genealogically from a military perspective. Wangzhen thinks that the fact that human beings like competing naturally causes the constant competition in the world. Human beings are born with the most outstanding spirit among all beings. But, when God created human beings, they had greed and avarice in the middle of their heart. Accordingly, human beings look for a success and a profit, and follow the wicked way, leaving the right way. The contractions among each greed and avarice bring about small or big competitions. Human beings have greed and avarice. It means human beings have emotions. As a result, the competitions in the world are not able to disappear because human beings have emotions. To win the completion human beings use weapons. According to Wangzhen, the war is the most devilish deed due to the weapon's atrocious, dangerous quality. Yet, the world's interests are decided by how efficiently the weapons are used. Consequently, the weapon techniques are worthy and play an important role in the real world. Morality, however, should be in the first priority in ruling over a country and commanding the army. The national security and the war victory could be secured when civil and military affairs have a balance. Wangzhen thinks that Laozi emphasizes "Not-Competing" as a basic solution of competition. The competition is the root cause of war and disorder. Therefore, Not-Competing is the main idea of Daodejing. Not-Competiting is a basis of Wangzhen's military philosophy as well. For Wangzhen, Not-Competing is Wuwei. Wuwei has political and military meanings at the same time. Wangzhen build up the "Not-Competing" military philosophy by applying Loazi's Daodejing to his military philosophy.

Prediction of patent lifespan and analysis of influencing factors using machine learning (기계학습을 활용한 특허수명 예측 및 영향요인 분석)

  • Kim, Yongwoo;Kim, Min Gu;Kim, Young-Min
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.2
    • /
    • pp.147-170
    • /
    • 2022
  • Although the number of patent which is one of the core outputs of technological innovation continues to increase, the number of low-value patents also hugely increased. Therefore, efficient evaluation of patents has become important. Estimation of patent lifespan which represents private value of a patent, has been studied for a long time, but in most cases it relied on a linear model. Even if machine learning methods were used, interpretation or explanation of the relationship between explanatory variables and patent lifespan was insufficient. In this study, patent lifespan (number of renewals) is predicted based on the idea that patent lifespan represents the value of the patent. For the research, 4,033,414 patents applied between 1996 and 2017 and finally granted were collected from USPTO (US Patent and Trademark Office). To predict the patent lifespan, we use variables that can reflect the characteristics of the patent, the patent owner's characteristics, and the inventor's characteristics. We build four different models (Ridge Regression, Random Forest, Feed Forward Neural Network, Gradient Boosting Models) and perform hyperparameter tuning through 5-fold Cross Validation. Then, the performance of the generated models are evaluated, and the relative importance of predictors is also presented. In addition, based on the Gradient Boosting Model which have excellent performance, Accumulated Local Effects Plot is presented to visualize the relationship between predictors and patent lifespan. Finally, we apply Kernal SHAP (SHapley Additive exPlanations) to present the evaluation reason of individual patents, and discuss applicability to the patent evaluation system. This study has academic significance in that it cumulatively contributes to the existing patent life estimation research and supplements the limitations of existing patent life estimation studies based on linearity. It is academically meaningful that this study contributes cumulatively to the existing studies which estimate patent lifespan, and that it supplements the limitations of linear models. Also, it is practically meaningful to suggest a method for deriving the evaluation basis for individual patent value and examine the applicability to patent evaluation systems.

A Case Study on Servicism: Korea's Founding and Development Wisdom (서비스주의 국가사례 연구: 대한민국 건국과 발전 지혜)

  • Hyunsoo Kim
    • Journal of Service Research and Studies
    • /
    • v.12 no.2
    • /
    • pp.135-151
    • /
    • 2022
  • This study was conducted on a servicism basis as a study to build the management wisdom of Korea as an asset of human society. The case of the founding and development of the Republic of Korea, which developed into one of the world's leading powers with unprecedented rapid growth despite the devastating three-year war immediately after its founding, is an important subject of study. This study conducted a macroscopic analysis. In the process of carrying out the founding activities, it was confirmed that the original idea inherent in Koreans was the Taegeuk idea of the National Flag. The founders of the Republic of Korea consciously or unconsciously adopted the Taegeuk ideology as the founding ideology. It can be said that the Taegeuk ideology has led to greater success because it is based on the service philosophy, which is the common truth of mankind. In this study, the process of preparing for the founding of the Republic of Korea, which was founded by accepting a new maritime civilization from a country with a long continental civilization, the process of national foundation, and the process of operating the country after its founding were analyzed as a whole. In particular, it discovered the excellent wisdom of the founders who appropriately utilized the strengths of continental and maritime civilizations in necessary situations, and confirmed that the dynamic operating model of development through fierce competition between the two opponents was a success factor for high-speed national development. It requires great wisdom for Koreans with diverse ideological backgrounds and unique personalities to unite to establish and run a nation. This is because fierce competition and cooperation must be carried out at the same time. The great wisdom of the founding of the Republic of Korea was able to be demonstrated because the Taegeuk ideology was acting as a common basic ideology for Koreans. Taegeuk ideology, the source of wisdom that Korea has developed under constant tension, can be capitalized as human wisdom, and additional case studies are needed.

Preliminary Inspection Prediction Model to select the on-Site Inspected Foreign Food Facility using Multiple Correspondence Analysis (차원축소를 활용한 해외제조업체 대상 사전점검 예측 모형에 관한 연구)

  • Hae Jin Park;Jae Suk Choi;Sang Goo Cho
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.121-142
    • /
    • 2023
  • As the number and weight of imported food are steadily increasing, safety management of imported food to prevent food safety accidents is becoming more important. The Ministry of Food and Drug Safety conducts on-site inspections of foreign food facilities before customs clearance as well as import inspection at the customs clearance stage. However, a data-based safety management plan for imported food is needed due to time, cost, and limited resources. In this study, we tried to increase the efficiency of the on-site inspection by preparing a machine learning prediction model that pre-selects the companies that are expected to fail before the on-site inspection. Basic information of 303,272 foreign food facilities and processing businesses collected in the Integrated Food Safety Information Network and 1,689 cases of on-site inspection information data collected from 2019 to April 2022 were collected. After preprocessing the data of foreign food facilities, only the data subject to on-site inspection were extracted using the foreign food facility_code. As a result, it consisted of a total of 1,689 data and 103 variables. For 103 variables, variables that were '0' were removed based on the Theil-U index, and after reducing by applying Multiple Correspondence Analysis, 49 characteristic variables were finally derived. We build eight different models and perform hyperparameter tuning through 5-fold cross validation. Then, the performance of the generated models are evaluated. The research purpose of selecting companies subject to on-site inspection is to maximize the recall, which is the probability of judging nonconforming companies as nonconforming. As a result of applying various algorithms of machine learning, the Random Forest model with the highest Recall_macro, AUROC, Average PR, F1-score, and Balanced Accuracy was evaluated as the best model. Finally, we apply Kernal SHAP (SHapley Additive exPlanations) to present the selection reason for nonconforming facilities of individual instances, and discuss applicability to the on-site inspection facility selection system. Based on the results of this study, it is expected that it will contribute to the efficient operation of limited resources such as manpower and budget by establishing an imported food management system through a data-based scientific risk management model.

The Genealogy of Forbidden Sound -Political Aesthetics of Ambiguity in the Criticism of Japanese Style in Korean Society of the 1960s (일본적인 것, 혹은 금지된 '소리'의 계보 -한일국교정상화 성립기 '왜색(倭色)' 비판담론과 양의성의 정치미학)

  • Jeong, Chang-Hoon
    • Journal of Popular Narrative
    • /
    • v.25 no.1
    • /
    • pp.349-392
    • /
    • 2019
  • In the 1960s of Korea, the normalization of diplomatic relations between Korea and Japan led to a sense of a vigorous anxiety and fear that "Japan will once again come to the Korean peninsula". As a reaction to this, the discourse on the criticism of 'Japanese Style' strongly emerged. If the prior discourse of criticism was to express the national antipathy toward the colonial remnants that had not yet been disposed of, the critical discourse of the 1960s was the wariness of the newly created 'Japanese Style' in popular culture, and to grasp it as a symptomatic phenomenon that 'evil-minded Japan' was revealed. Thus, this new logic of criticism of the 'Japanese Style' had a qualitative difference from the existing ones. It was accompanied by a willingness to inspect and censor the 'masses' that grew up as consumers of transnational 'mass culture' that flowed and chained in the geopolitical order under the Cold War system. Therefore, the topology of 'popular things=Japanese things=consuming things' reveals the paradox of moral demands that existed within Korean society in the 1960s. This was to solidify the divisive circulation structure that caused them to avoid direct contact with the other called 'Japan', but at the same time, get as close to it as ever. It is a repetitive obsession that pushes the other to another side through the moral segregation that strictly draws a line of demarcation between oneself and the other, but on the other hand is attracted to the object and pulls it back to its side. This paper intends to listen to the different voices that have arisen in the repetitive obsession to understand the significance of the dissonance that has been repeated in the contemporary era. This will be an examination of the paradoxical object of Japan that has been repeatedly asked to build the internal control principle of Korean society, or to hide the oppressive and violent side of the power, and that can neither be accepted nor destroyed completely as part of oneself.

Development of Strategies to Improve Water Quality of the Yeongsan River in Connection with Adaptation to Climate Change (기후변화의 적응과 연계한 영산강 수질개선대책 개발)

  • Yong Woon Lee;Won Mo Yang;Gwang Duck Song;Yong Uk Ryu;Hak Young Lee
    • Korean Journal of Ecology and Environment
    • /
    • v.56 no.3
    • /
    • pp.187-195
    • /
    • 2023
  • Almost all of the water from agricultural dams located to the upper of the Yeongsan river is supplied as irrigation water for farmland and thus is not discharged to the main stream of the river. Also, most of the irrigation water does not return to the river after use, adding to the lack of flow in the main stream. As a result, the water quality and aquatic health of the river have become the poorest among the four major rivers in Korea. Therefore, in this study, several strategies for water quality improvement of the river were developed considering pollution reduction and flow rate increase, and their effect analysis was performed using a water quality model. The results of this study showed that the target water quality of the Yeongsan river could be achieved if flow increase strategies (FISs) are intensively pursued in parallel with pollution reduction. The reason is because the water quality of the river has been steadily improved through pollution reduction but this method is now nearing the limit. In addition, rainfall-related FISs such as dam construction and water distribution adjustment may be less effective or lost if a megadrought continues due to climate change and then rainfall does not occur for a long time. Therefore, in the future, if the application conditions for the FISs are similar, the seawater desalination facility, which is independent of rainfall, should be considered as the priority installation target among the FISs. The reason is that seawater desalination facilities can replace the water supply function of dams, which are difficult to newly build in Korea, and can be useful as a climate change adaptation facility by preventing water-related disasters in the event of a long-term megadrought.

Case study on flood water level prediction accuracy of LSTM model according to condition of reference hydrological station combination (참조 수문관측소 구성 조건에 따른 LSTM 모형 홍수위예측 정확도 검토 사례 연구)

  • Lee, Seungho;Kim, Sooyoung;Jung, Jaewon;Yoon, Kwang Seok
    • Journal of Korea Water Resources Association
    • /
    • v.56 no.12
    • /
    • pp.981-992
    • /
    • 2023
  • Due to recent global climate change, the scale of flood damage is increasing as rainfall is concentrated and its intensity increases. Rain on a scale that has not been observed in the past may fall, and long-term rainy seasons that have not been recorded may occur. These damages are also concentrated in ASEAN countries, and many people in ASEAN countries are affected, along with frequent occurrences of flooding due to typhoons and torrential rains. In particular, the Bandung region which is located in the Upper Chitarum River basin in Indonesia has topographical characteristics in the form of a basin, making it very vulnerable to flooding. Accordingly, through the Official Development Assistance (ODA), a flood forecasting and warning system was established for the Upper Citarium River basin in 2017 and is currently in operation. Nevertheless, the Upper Citarium River basin is still exposed to the risk of human and property damage in the event of a flood, so efforts to reduce damage through fast and accurate flood forecasting are continuously needed. Therefore, in this study an artificial intelligence-based river flood water level forecasting model for Dayeu Kolot as a target station was developed by using 10-minute hydrological data from 4 rainfall stations and 1 water level station. Using 10-minute hydrological observation data from 6 stations from January 2017 to January 2021, learning, verification, and testing were performed for lead time such as 0.5, 1, 2, 3, 4, 5 and 6 hour and LSTM was applied as an artificial intelligence algorithm. As a result of the study, good results were shown in model fit and error for all lead times, and as a result of reviewing the prediction accuracy according to the learning dataset conditions, it is expected to be used to build an efficient artificial intelligence-based model as it secures prediction accuracy similar to that of using all observation stations even when there are few reference stations.

A Study on the Establishment of Comparison System between the Statement of Military Reports and Related Laws (군(軍) 보고서 등장 문장과 관련 법령 간 비교 시스템 구축 방안 연구)

  • Jung, Jiin;Kim, Mintae;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.3
    • /
    • pp.109-125
    • /
    • 2020
  • The Ministry of National Defense is pushing for the Defense Acquisition Program to build strong defense capabilities, and it spends more than 10 trillion won annually on defense improvement. As the Defense Acquisition Program is directly related to the security of the nation as well as the lives and property of the people, it must be carried out very transparently and efficiently by experts. However, the excessive diversification of laws and regulations related to the Defense Acquisition Program has made it challenging for many working-level officials to carry out the Defense Acquisition Program smoothly. It is even known that many people realize that there are related regulations that they were unaware of until they push ahead with their work. In addition, the statutory statements related to the Defense Acquisition Program have the tendency to cause serious issues even if only a single expression is wrong within the sentence. Despite this, efforts to establish a sentence comparison system to correct this issue in real time have been minimal. Therefore, this paper tries to propose a "Comparison System between the Statement of Military Reports and Related Laws" implementation plan that uses the Siamese Network-based artificial neural network, a model in the field of natural language processing (NLP), to observe the similarity between sentences that are likely to appear in the Defense Acquisition Program related documents and those from related statutory provisions to determine and classify the risk of illegality and to make users aware of the consequences. Various artificial neural network models (Bi-LSTM, Self-Attention, D_Bi-LSTM) were studied using 3,442 pairs of "Original Sentence"(described in actual statutes) and "Edited Sentence"(edited sentences derived from "Original Sentence"). Among many Defense Acquisition Program related statutes, DEFENSE ACQUISITION PROGRAM ACT, ENFORCEMENT RULE OF THE DEFENSE ACQUISITION PROGRAM ACT, and ENFORCEMENT DECREE OF THE DEFENSE ACQUISITION PROGRAM ACT were selected. Furthermore, "Original Sentence" has the 83 provisions that actually appear in the Act. "Original Sentence" has the main 83 clauses most accessible to working-level officials in their work. "Edited Sentence" is comprised of 30 to 50 similar sentences that are likely to appear modified in the county report for each clause("Original Sentence"). During the creation of the edited sentences, the original sentences were modified using 12 certain rules, and these sentences were produced in proportion to the number of such rules, as it was the case for the original sentences. After conducting 1 : 1 sentence similarity performance evaluation experiments, it was possible to classify each "Edited Sentence" as legal or illegal with considerable accuracy. In addition, the "Edited Sentence" dataset used to train the neural network models contains a variety of actual statutory statements("Original Sentence"), which are characterized by the 12 rules. On the other hand, the models are not able to effectively classify other sentences, which appear in actual military reports, when only the "Original Sentence" and "Edited Sentence" dataset have been fed to them. The dataset is not ample enough for the model to recognize other incoming new sentences. Hence, the performance of the model was reassessed by writing an additional 120 new sentences that have better resemblance to those in the actual military report and still have association with the original sentences. Thereafter, we were able to check that the models' performances surpassed a certain level even when they were trained merely with "Original Sentence" and "Edited Sentence" data. If sufficient model learning is achieved through the improvement and expansion of the full set of learning data with the addition of the actual report appearance sentences, the models will be able to better classify other sentences coming from military reports as legal or illegal. Based on the experimental results, this study confirms the possibility and value of building "Real-Time Automated Comparison System Between Military Documents and Related Laws". The research conducted in this experiment can verify which specific clause, of several that appear in related law clause is most similar to the sentence that appears in the Defense Acquisition Program-related military reports. This helps determine whether the contents in the military report sentences are at the risk of illegality when they are compared with those in the law clauses.

An Exploratory Study on the Competition Patterns Between Internet Sites in Korea (한국 인터넷사이트들의 산업별 경쟁유형에 대한 탐색적 연구)

  • Park, Yoonseo;Kim, Yongsik
    • Asia Marketing Journal
    • /
    • v.12 no.4
    • /
    • pp.79-111
    • /
    • 2011
  • Digital economy has grown rapidly so that the new business area called 'Internet business' has been dramatically extended as time goes on. However, in the case of Internet business, market shares of individual companies seem to fluctuate very extremely. Thus marketing managers who operate the Internet sites have seriously observed the competition structure of the Internet business market and carefully analyzed the competitors' behavior in order to achieve their own business goals in the market. The newly created Internet business might differ from the offline ones in management styles, because it has totally different business circumstances when compared with the existing offline businesses. Thus, there should be a lot of researches for finding the solutions about what the features of Internet business are and how the management style of those Internet business companies should be changed. Most marketing literatures related to the Internet business have focused on individual business markets. Specifically, many researchers have studied the Internet portal sites and the Internet shopping mall sites, which are the most general forms of Internet business. On the other hand, this study focuses on the entire Internet business industry to understand the competitive circumstance of online market. This approach makes it possible not only to have a broader view to comprehend overall e-business industry, but also to understand the differences in competition structures among Internet business markets. We used time-series data of Internet connection rates by consumers as the basic data to figure out the competition patterns in the Internet business markets. Specifically, the data for this research was obtained from one of Internet ranking sites, 'Fian'. The Internet business ranking data is obtained based on web surfing record of some pre-selected sample group where the possibility of double-count for page-views is controlled by method of same IP check. The ranking site offers several data which are very useful for comparison and analysis of competitive sites. The Fian site divides the Internet business areas into 34 area and offers market shares of big 5 sites which are on high rank in each category daily. We collected the daily market share data about Internet sites on each area from April 22, 2008 to August 5, 2008, where some errors of data was found and 30 business area data were finally used for our research after the data purification. This study performed several empirical analyses in focusing on market shares of each site to understand the competition among sites in Internet business of Korea. We tried to perform more statistically precise analysis for looking into business fields with similar competitive structures by applying the cluster analysis to the data. The research results are as follows. First, the leading sites in each area were classified into three groups based on averages and standard deviations of daily market shares. The first group includes the sites with the lowest market shares, which give more increased convenience to consumers by offering the Internet sites as complimentary services for existing offline services. The second group includes sites with medium level of market shares, where the site users are limited to specific small group. The third group includes sites with the highest market shares, which usually require online registration in advance and have difficulty in switching to another site. Second, we analyzed the second place sites in each business area because it may help us understand the competitive power of the strongest competitor against the leading site. The second place sites in each business area were classified into four groups based on averages and standard deviations of daily market shares. The four groups are the sites showing consistent inferiority compared to the leading sites, the sites with relatively high volatility and medium level of shares, the sites with relatively low volatility and medium level of shares, the sites with relatively low volatility and high level of shares whose gaps are not big compared to the leading sites. Except 'web agency' area, these second place sites show relatively stable shares below 0.1 point of standard deviation. Third, we also classified the types of relative strength between leading sites and the second place sites by applying the cluster analysis to the gap values of market shares between two sites. They were also classified into four groups, the sites with the relatively lowest gaps even though the values of standard deviation are various, the sites with under the average level of gaps, the sites with over the average level of gaps, the sites with the relatively higher gaps and lower volatility. Then we also found that while the areas with relatively bigger gap values usually have smaller standard deviation values, the areas with very small differences between the first and the second sites have a wider range of standard deviation values. The practical and theoretical implications of this study are as follows. First, the result of this study might provide the current market participants with the useful information to understand the competitive circumstance of the market and build the effective new business strategy for the market success. Also it might be useful to help new potential companies find a new business area and set up successful competitive strategies. Second, it might help Internet marketing researchers take a macro view of the overall Internet market so that make possible to begin the new studies on overall Internet market beyond individual Internet market studies.

  • PDF