• Title/Summary/Keyword: 생성모형

Search Result 1,366, Processing Time 0.026 seconds

The Study of 'Comments on Current Events' in Hwangsong Sinmun (『황성신문(皇城新聞)』의 '시평(時評)' 연구 -「비설(飛屑)」, 「국외냉평(局外冷評)」을 중심으로-)

  • 반재유
    • 한국학연구
    • /
    • no.49
    • /
    • pp.223-242
    • /
    • 2018
  • Starting to occur during modern age, 'Comments on Current Events', or comments on current events, refers to the works which deliver the criticism or reputation about the issues of the time. In Hwangsong Sinmun, as new writing models conflicted with traditional narrative practices, complicated works with various metaphor and symbols were published serially. They are critical materials, which allow you to probe the way in which earlier traditional narrative changed through modern media. This article takes note of a pair of them: Biseol, Gukoenaengpyeong Unlike conventional serials, Biseol and Gukoenaengpyeong, which were 'Comments on Current Events' serials in Hwangsong Sinmun, have something in common with 'Narrative Editorials' for some reasons: interrogatory style which were frequently being used in editorials accounts for a large share of those two serials, their contents with a vein of humor also imply critical social issues, and they make separate comments additionally. In addition, we can see how the style of 'Sowha' in late Joseon Dynasty developed into short narrative through a new medium, the modern newspaper. After all, the two serials are important materials which can identify the way in which modern 'Sowha' was created and established as an epic literature, 'Comments on Current Events'.

Study of Rainfall-Runoff Variation by Grid Size and Critical Area (격자크기와 임계면적에 따른 홍수유출특성 변화)

  • Ahn, Seung-Seop;Lee, Jeung-Seok;Jung, Do-Joon;Han, Ho-Chul
    • Journal of Environmental Science International
    • /
    • v.16 no.4
    • /
    • pp.523-532
    • /
    • 2007
  • This study utilized the 1/25,000 topographic map of the upper area from the Geum-ho watermark located at the middle of Geum-ho river from the National Geographic Information Institute. For the analysis, first, the influence of the size of critical area to the hydro topographic factors was examined changing grid size to $10m{\times}10m,\;30m{\times}30m\;and\;50m{\times}50m$, and the critical area for the formation of a river to $0.01km^2{\sim}0.50km^2$. It is known from the examination result of watershed morphology according to the grid size that the smaller grid size, the better resolution and accuracy. And it is found, from the analysis result of the degree of the river according to the minimum critical area for each grid size, that the grid size does not affect on the degree of the river, and the number of rivers with 2nd and higher degree does not show remarkable difference while there is big difference in the number of 1st degree rivers. From the results above, it is thought that the critical area of $0.15km^2{\sim}0.20km^2$ is appropriate for formation of a river being irrelevant to the grid size in extraction of hydro topographic parameters that are used in the runoff analysis model using topographic maps. Therefore, the GIUH model applied analysis results by use of the river level difference law proposed in this study for the explanation on the outflow response-changing characters according to the decision of a critical value of a minimum level difference river, showed that, since an ogival occurrence time and an ogival flow volume are very significant in a flood occurrence in case of not undertow facilities, the researcher could obtain a good result for the forecast of river outflow when considering a convenient application of the model and an easy acquisition of data, so it's judged that this model is proper as an algorism for the decision of a critical value of a river basin.

Conditional Generative Adversarial Network based Collaborative Filtering Recommendation System (Conditional Generative Adversarial Network(CGAN) 기반 협업 필터링 추천 시스템)

  • Kang, Soyi;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.157-173
    • /
    • 2021
  • With the development of information technology, the amount of available information increases daily. However, having access to so much information makes it difficult for users to easily find the information they seek. Users want a visualized system that reduces information retrieval and learning time, saving them from personally reading and judging all available information. As a result, recommendation systems are an increasingly important technologies that are essential to the business. Collaborative filtering is used in various fields with excellent performance because recommendations are made based on similar user interests and preferences. However, limitations do exist. Sparsity occurs when user-item preference information is insufficient, and is the main limitation of collaborative filtering. The evaluation value of the user item matrix may be distorted by the data depending on the popularity of the product, or there may be new users who have not yet evaluated the value. The lack of historical data to identify consumer preferences is referred to as data sparsity, and various methods have been studied to address these problems. However, most attempts to solve the sparsity problem are not optimal because they can only be applied when additional data such as users' personal information, social networks, or characteristics of items are included. Another problem is that real-world score data are mostly biased to high scores, resulting in severe imbalances. One cause of this imbalance distribution is the purchasing bias, in which only users with high product ratings purchase products, so those with low ratings are less likely to purchase products and thus do not leave negative product reviews. Due to these characteristics, unlike most users' actual preferences, reviews by users who purchase products are more likely to be positive. Therefore, the actual rating data is over-learned in many classes with high incidence due to its biased characteristics, distorting the market. Applying collaborative filtering to these imbalanced data leads to poor recommendation performance due to excessive learning of biased classes. Traditional oversampling techniques to address this problem are likely to cause overfitting because they repeat the same data, which acts as noise in learning, reducing recommendation performance. In addition, pre-processing methods for most existing data imbalance problems are designed and used for binary classes. Binary class imbalance techniques are difficult to apply to multi-class problems because they cannot model multi-class problems, such as objects at cross-class boundaries or objects overlapping multiple classes. To solve this problem, research has been conducted to convert and apply multi-class problems to binary class problems. However, simplification of multi-class problems can cause potential classification errors when combined with the results of classifiers learned from other sub-problems, resulting in loss of important information about relationships beyond the selected items. Therefore, it is necessary to develop more effective methods to address multi-class imbalance problems. We propose a collaborative filtering model using CGAN to generate realistic virtual data to populate the empty user-item matrix. Conditional vector y identify distributions for minority classes and generate data reflecting their characteristics. Collaborative filtering then maximizes the performance of the recommendation system via hyperparameter tuning. This process should improve the accuracy of the model by addressing the sparsity problem of collaborative filtering implementations while mitigating data imbalances arising from real data. Our model has superior recommendation performance over existing oversampling techniques and existing real-world data with data sparsity. SMOTE, Borderline SMOTE, SVM-SMOTE, ADASYN, and GAN were used as comparative models and we demonstrate the highest prediction accuracy on the RMSE and MAE evaluation scales. Through this study, oversampling based on deep learning will be able to further refine the performance of recommendation systems using actual data and be used to build business recommendation systems.

A Coexistence Model in a Dynamic Platform with ICT-based Multi-Value Chains: focusing on Healthcare Service (ICT 기반 다중 가치사슬의 동적 플랫폼에서의 공존 모형: 의료서비스를 중심으로)

  • Lee, Hyun Jung;Chang, Yong Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.1
    • /
    • pp.69-93
    • /
    • 2017
  • The development of ICT has leaded the diversification and changes of supplies and demands in markets. It also caused the creations of a variety of values which are differentiated from those in the existing market. Therefore, a new-type market is created, which can include multi-value chains which are from ICT-based created markets as well as the existing markets. We defined the platform as the new-type market. In the platform, the multi-value chains can be coexisted with multi-values. In true market, when a new-type value chain entered into an existing market, it is general that it can be conflicted with the existing value chain in the market. The conflicted problem among multi-value chains in a market is caused by the sharing of limited market resources like suppliers, consumers, services or products among the value chains. In other words, if there are multi-value chains in the platform, then it is possible to have conflictions, overlapping, creations or losses of values among the value chains. To solve the problem, we introduce coexistence factors to reduce the conflictions to reach market equilibrium in the platform. In the other hand, it is possible to lead the creations of differentiated values from the existing market and to augment the total market values in the platform. In the early era of ICT development, ICT was introduced for improvement of efficiency and effectiveness of the value chains in the existing market. However, according to the changed role of ICT from the supporter to the promotor of the market, ICT became to lead the variations of the value chains and creations of various values in the markets. For instance, Uber Taxi created a new value chain with ICT-based new-type service or products with new resources like new suppliers and consumers. When Uber and Traditional Taxi services are playing at the same time in Taxi service platform, it is possible to create values or make conflictions among values between the new and old value chains. In this research, like Uber and traditional taxi services, if there are conflictions among the multi-value chains, then it is necessary to minimize the conflictions in the platform for the coexistence of multi-value chains which can create the value-added values in the platform. So, it is important to predict and discuss the possible conflicted problems between new and old value chains. The confliction should be solved to reach market equilibrium with multi-value chains in the platform. That is, we discuss the possibility of the coexistence of multi-value chains in the platform which are comprised of a variety of suppliers and customers. To do this, especially we are focusing on the healthcare markets. Nowadays healthcare markets are popularized in global market as well as domestic. Therefore, there are a lot of and a variety of healthcare services like Traditional-, Tele-, or Intelligent- healthcare services and so on. It shows that there are multi-suppliers, -consumers and -services as components of each different value chain in the same platform. The platform can be shared by different values that are created or overlapped by confliction and loss of values in the value chains. In this research, as was said, we focused on the healthcare services to show if a platform can be shared by different value chains like traditional-, tele-healthcare and intelligent-healthcare services and products. Additionally, we try to show if it is possible to increase the value of each value chain as well as the total value of the platform. As the result, it is possible to increase of each value of each value chain as well as the total value in the platform. Finally, we propose a coexistence model to overcome such problems and showed the possibility of coexistence between the value chains through experimentation.

Extension Method of Association Rules Using Social Network Analysis (사회연결망 분석을 활용한 연관규칙 확장기법)

  • Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.4
    • /
    • pp.111-126
    • /
    • 2017
  • Recommender systems based on association rule mining significantly contribute to seller's sales by reducing consumers' time to search for products that they want. Recommendations based on the frequency of transactions such as orders can effectively screen out the products that are statistically marketable among multiple products. A product with a high possibility of sales, however, can be omitted from the recommendation if it records insufficient number of transactions at the beginning of the sale. Products missing from the associated recommendations may lose the chance of exposure to consumers, which leads to a decline in the number of transactions. In turn, diminished transactions may create a vicious circle of lost opportunity to be recommended. Thus, initial sales are likely to remain stagnant for a certain period of time. Products that are susceptible to fashion or seasonality, such as clothing, may be greatly affected. This study was aimed at expanding association rules to include into the list of recommendations those products whose initial trading frequency of transactions is low despite the possibility of high sales. The particular purpose is to predict the strength of the direct connection of two unconnected items through the properties of the paths located between them. An association between two items revealed in transactions can be interpreted as the interaction between them, which can be expressed as a link in a social network whose nodes are items. The first step calculates the centralities of the nodes in the middle of the paths that indirectly connect the two nodes without direct connection. The next step identifies the number of the paths and the shortest among them. These extracts are used as independent variables in the regression analysis to predict future connection strength between the nodes. The strength of the connection between the two nodes of the model, which is defined by the number of nodes between the two nodes, is measured after a certain period of time. The regression analysis results confirm that the number of paths between the two products, the distance of the shortest path, and the number of neighboring items connected to the products are significantly related to their potential strength. This study used actual order transaction data collected for three months from February to April in 2016 from an online commerce company. To reduce the complexity of analytics as the scale of the network grows, the analysis was performed only on miscellaneous goods. Two consecutively purchased items were chosen from each customer's transactions to obtain a pair of antecedent and consequent, which secures a link needed for constituting a social network. The direction of the link was determined in the order in which the goods were purchased. Except for the last ten days of the data collection period, the social network of associated items was built for the extraction of independent variables. The model predicts the number of links to be connected in the next ten days from the explanatory variables. Of the 5,711 previously unconnected links, 611 were newly connected for the last ten days. Through experiments, the proposed model demonstrated excellent predictions. Of the 571 links that the proposed model predicts, 269 were confirmed to have been connected. This is 4.4 times more than the average of 61, which can be found without any prediction model. This study is expected to be useful regarding industries whose new products launch quickly with short life cycles, since their exposure time is critical. Also, it can be used to detect diseases that are rarely found in the early stages of medical treatment because of the low incidence of outbreaks. Since the complexity of the social networking analysis is sensitive to the number of nodes and links that make up the network, this study was conducted in a particular category of miscellaneous goods. Future research should consider that this condition may limit the opportunity to detect unexpected associations between products belonging to different categories of classification.

Water Balance Projection Using Climate Change Scenarios in the Korean Peninsula (기후변화 시나리오를 활용한 미래 한반도 물수급 전망)

  • Kim, Cho-Rong;Kim, Young-Oh;Seo, Seung Beom;Choi, Su-Woong
    • Journal of Korea Water Resources Association
    • /
    • v.46 no.8
    • /
    • pp.807-819
    • /
    • 2013
  • This study proposes a new methodology for future water balance projection considering climate change by assigning a weight to each scenario instead of inputting future streamflows based on GCMs into a water balance model directly. K-nearest neighbor algorithm was employed to assign weights and streamflows in non-flood period (October to the following June) was selected as the criterion for assigning weights. GCM-driven precipitation was input to TANK model to simulate future streamflow scenarios and Quantile Mapping was applied to correct bias between GCM hindcast and historical data. Based on these bias-corrected streamflows, different weights were assigned to each streamflow scenarios to calculate water shortage for the projection periods; 2020s (2010~2039), 2050s (2040~2069), and 2080s (2070~2099). As a result by applying the proposed methodology to project water shortage over the Korean Peninsula, average water shortage for 2020s is projected to increase to 10~32% comparing to the basis (1967~2003). In addition, according to getting decreased in streamflows in non-flood period gradually by 2080s, average water shortage for 2080s is projected to increase up to 97% (516.5 million $m^3/yr$) as maximum comparing to the basis. While the existing research on climate change gives radical increase in future water shortage, the results projected by the weighting method shows conservative change. This study has significance in the applicability of water balance projection regarding climate change, keeping the existing framework of national water resources planning and this lessens the confusion for decision-makers in water sectors.

Characteristics of Corrosion and Water Quality in Simulated Reclaimed Water Distribution Pipelines (모형 재이용관을 이용한 하수재이용수의 부식 및 수질영향 연구)

  • Kang, Sung-Won;Lee, Jai-Young;Lee, Hyun-Dong;Kim, Gi-Eun;Kwak, Pill-Jae
    • Journal of Korean Society of Environmental Engineers
    • /
    • v.34 no.7
    • /
    • pp.473-479
    • /
    • 2012
  • Water reuse has been highlighted as a representative alternative to solve the lacking water resource. This study carried out a study on the pipe corrosion and water quality change which can occur through the supply of reclaimed water, using a simulated reclaimed water distribution pipeline. Galvanized steel pipe (GSP), cast iron pipe (CIP), stainless steel pipe (STSP) and PVC pipe (PVCP) were used for the pipe materials. Reclaimed water(RW) and tap water(TW) were respectively supplied into simulated reclaimed water distribution pipelines. As a result of performing a loop test to supply reclaimed water to simulated reclaimed water distribution pipelines, the weight reduction of pipe coupons showed the sequence of CIP > GSP > STSP ${\approx}$ PVCP. In addition, reclaimed water showed a high corrosion rate comparing to that of tap water. In case of CIP, the initial corrosion rate showed 3.511 mdd(milligrams per square decimeter per day) for reclaimed water and 2.064 mdd for tap water and the corrosion rate for 90 days showed 0.833 mdd for reclaimed water and 0.294 mdd for tap water. Also in case of GSP, the initial corrosion rate showed 2.703 mdd for reclaimed water and 2.499 mdd for tap water and the corrosion rate for 90 days showed 0.349 mdd for reclaimed water and 0.248 mdd for tap water, which was a tendency similar to that appeared in CIP with a tendency to reduce the corrosion rate. As a result of water quality changes of reclaimed water at pipe materials to carry out the loop test, there was higher conversion ratio of ammonia into nitrate in CIP and GSP with higher corrosion rate than that in STSP and PVCP where no corrosion has occurred. The highest denitrification rate of nitrate could be observed from CIP with the most particles generated from corrosion. In CIP, it could be confirmed that there was MIC (Microbiologically Induced Corrosion) as a result of EDS (Energy Dispersive X-ray spectrometer System) analysis results.

Temporal and Spatial Characteristics of Sediment Yields from the Chungju Dam Upstream Watershed (충주댐 상류유역의 유사 발생에 대한 시공간적인 특성)

  • Kim, Chul-Gyum;Lee, Jeong-Eun;Kim, Nam-Won
    • Journal of Korea Water Resources Association
    • /
    • v.40 no.11
    • /
    • pp.887-898
    • /
    • 2007
  • A physically based semi-distributed model, SWAT was applied to the Chungju Dam upstream watershed in order to investigate the spatial and temporal characteristics of watershed sediment yields. For this, general features of the SWAT and sediment simulation algorithm within the model were described briefly, and watershed sediment modeling system was constructed after calibration and validation of parameters related to the runoff and sediment. With this modeling system, temporal and spatial variation of soil loss and sediment yields according to watershed scales, land uses, and reaches was analyzed. Sediment yield rates with drainage areas resulted in $0.5{\sim}0.6ton/ha/yr$ excluding some upstream sub-watersheds and showed around 0.51 ton/ha/yr above the areas of $1,000km^2$. Annual average soil loss according to land use represented the higher values in upland areas, but relatively lower in paddy and forest areas which were similar to the previous results from other researchers. Among the upstream reaches, Pyeongchanggang and Jucheongang showed higher sediment yields which was thought to be caused by larger area and higher fraction of upland than other upstream sub-areas. Monthly sediment yields at the main outlet showed same trend with seasonal rainfall distribution, that is, approximately 62% of annual yield was generated during July to August and the amount was about 208 ton/yr. From the results, we could obtain the uniform value of sediment yield rate and could roughly evaluate the effect of soil loss with land uses, and also could analyze the temporal and spatial characteristics of sediment yields from each reach and monthly variation for the Chungju Dam upstream watershed.

Improving Performance of Recommendation Systems Using Topic Modeling (사용자 관심 이슈 분석을 통한 추천시스템 성능 향상 방안)

  • Choi, Seongi;Hyun, Yoonjin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.3
    • /
    • pp.101-116
    • /
    • 2015
  • Recently, due to the development of smart devices and social media, vast amounts of information with the various forms were accumulated. Particularly, considerable research efforts are being directed towards analyzing unstructured big data to resolve various social problems. Accordingly, focus of data-driven decision-making is being moved from structured data analysis to unstructured one. Also, in the field of recommendation system, which is the typical area of data-driven decision-making, the need of using unstructured data has been steadily increased to improve system performance. Approaches to improve the performance of recommendation systems can be found in two aspects- improving algorithms and acquiring useful data with high quality. Traditionally, most efforts to improve the performance of recommendation system were made by the former approach, while the latter approach has not attracted much attention relatively. In this sense, efforts to utilize unstructured data from variable sources are very timely and necessary. Particularly, as the interests of users are directly connected with their needs, identifying the interests of the user through unstructured big data analysis can be a crew for improving performance of recommendation systems. In this sense, this study proposes the methodology of improving recommendation system by measuring interests of the user. Specially, this study proposes the method to quantify interests of the user by analyzing user's internet usage patterns, and to predict user's repurchase based upon the discovered preferences. There are two important modules in this study. The first module predicts repurchase probability of each category through analyzing users' purchase history. We include the first module to our research scope for comparing the accuracy of traditional purchase-based prediction model to our new model presented in the second module. This procedure extracts purchase history of users. The core part of our methodology is in the second module. This module extracts users' interests by analyzing news articles the users have read. The second module constructs a correspondence matrix between topics and news articles by performing topic modeling on real world news articles. And then, the module analyzes users' news access patterns and then constructs a correspondence matrix between articles and users. After that, by merging the results of the previous processes in the second module, we can obtain a correspondence matrix between users and topics. This matrix describes users' interests in a structured manner. Finally, by using the matrix, the second module builds a model for predicting repurchase probability of each category. In this paper, we also provide experimental results of our performance evaluation. The outline of data used our experiments is as follows. We acquired web transaction data of 5,000 panels from a company that is specialized to analyzing ranks of internet sites. At first we extracted 15,000 URLs of news articles published from July 2012 to June 2013 from the original data and we crawled main contents of the news articles. After that we selected 2,615 users who have read at least one of the extracted news articles. Among the 2,615 users, we discovered that the number of target users who purchase at least one items from our target shopping mall 'G' is 359. In the experiments, we analyzed purchase history and news access records of the 359 internet users. From the performance evaluation, we found that our prediction model using both users' interests and purchase history outperforms a prediction model using only users' purchase history from a view point of misclassification ratio. In detail, our model outperformed the traditional one in appliance, beauty, computer, culture, digital, fashion, and sports categories when artificial neural network based models were used. Similarly, our model outperformed the traditional one in beauty, computer, digital, fashion, food, and furniture categories when decision tree based models were used although the improvement is very small.

Preliminary Inspection Prediction Model to select the on-Site Inspected Foreign Food Facility using Multiple Correspondence Analysis (차원축소를 활용한 해외제조업체 대상 사전점검 예측 모형에 관한 연구)

  • Hae Jin Park;Jae Suk Choi;Sang Goo Cho
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.121-142
    • /
    • 2023
  • As the number and weight of imported food are steadily increasing, safety management of imported food to prevent food safety accidents is becoming more important. The Ministry of Food and Drug Safety conducts on-site inspections of foreign food facilities before customs clearance as well as import inspection at the customs clearance stage. However, a data-based safety management plan for imported food is needed due to time, cost, and limited resources. In this study, we tried to increase the efficiency of the on-site inspection by preparing a machine learning prediction model that pre-selects the companies that are expected to fail before the on-site inspection. Basic information of 303,272 foreign food facilities and processing businesses collected in the Integrated Food Safety Information Network and 1,689 cases of on-site inspection information data collected from 2019 to April 2022 were collected. After preprocessing the data of foreign food facilities, only the data subject to on-site inspection were extracted using the foreign food facility_code. As a result, it consisted of a total of 1,689 data and 103 variables. For 103 variables, variables that were '0' were removed based on the Theil-U index, and after reducing by applying Multiple Correspondence Analysis, 49 characteristic variables were finally derived. We build eight different models and perform hyperparameter tuning through 5-fold cross validation. Then, the performance of the generated models are evaluated. The research purpose of selecting companies subject to on-site inspection is to maximize the recall, which is the probability of judging nonconforming companies as nonconforming. As a result of applying various algorithms of machine learning, the Random Forest model with the highest Recall_macro, AUROC, Average PR, F1-score, and Balanced Accuracy was evaluated as the best model. Finally, we apply Kernal SHAP (SHapley Additive exPlanations) to present the selection reason for nonconforming facilities of individual instances, and discuss applicability to the on-site inspection facility selection system. Based on the results of this study, it is expected that it will contribute to the efficient operation of limited resources such as manpower and budget by establishing an imported food management system through a data-based scientific risk management model.